llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 14:02:52 +02:00

Author	SHA1	Message	Date
Paul Robinson	982abe3b3f	Add a "debugger tuning" concept that allows us to fine-tune how we emit debug info, according to the preferences of the different debuggers used on various targets. Darwin and FreeBSD default to tuning for LLDB; PS4 defaults to tuning for the SCE (Sony Computer Entertainment) debugger. All others default to GDB. Differential Revision: http://reviews.llvm.org/D8506 llvm-svn: 242338	2015-07-15 22:04:54 +00:00
JF Bastien	a8f830427c	Fix mergefunc infinite loop Self-referential constants containing references to a merged function no longer cause the MergeFunctions pass to infinite loop. Also adds a reproduction IR which would otherwise fail, which was isolated from a similar issue in Chromium. Author: jrkoenig Reviewers: nlewycky, jfb Subscribers: llvm-commits, nlewycky, jfb Differential Revision: http://reviews.llvm.org/D11208 llvm-svn: 242337	2015-07-15 21:51:33 +00:00
Rafael Espindola	cb9db3a66d	Handle the error of trying to convert a regular archive to a thin one. While at it, test that we can add to a thin archive. llvm-svn: 242330	2015-07-15 20:45:56 +00:00
Tobias Edler von Koch	2f242c1a89	Analyze recursive PHI nodes in BasicAA Summary: This patch allows phi nodes like %x = phi [ %incptr, ... ] [ %var, ... ] %incptr = getelementptr %x, 1 to be analyzed by BasicAliasAnalysis. In aliasPHI, we can detect incoming values that are recursive GEPs with a constant offset. Instead of trying to analyze a recursive GEP (and failing), we now ignore it and instead set the size of the memory referenced by the PHINode to UnknownSize. This represents all the possible memory locations the pointer represented by the PHINode could be advanced to by the GEP. For now, this new behavior is turned off by default to allow debugging of performance degradations seen with SPEC/x86 and Hexagon benchmarks. The flag -basicaa-recphi turns it on. Reviewers: hfinkel, sanjoy Subscribers: tobiasvk_caf, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D10368 llvm-svn: 242320	2015-07-15 19:32:22 +00:00
Bruno Cardoso Lopes	02108a873f	Revert "Look through PHIs to find additional register sources" Likely broke compilation on ARM: http://lab.llvm.org:8011/builders/clang-native-arm-lnt/builds/13054 This reverts commit 131ce4a838c081516cbfed039fc986b33e3979d6. llvm-svn: 242310	2015-07-15 18:10:35 +00:00
Adrian Prantl	450121ac9f	Debug Info: Add basic support for external types references. This is a necessary prerequisite for bootstrapping the emission of debug info inside modules. - Adds a FlagExternalTypeRef to DICompositeType. External types must have a unique identifier. - External type references are emitted using a forward declaration with a DW_AT_signature([DW_FORM_ref_sig8]) based on the UID. http://reviews.llvm.org/D9612 llvm-svn: 242302	2015-07-15 17:01:41 +00:00
Pete Cooper	885a149814	Add missing load/store flags to thumb2 instructions. These were the cause of a verifier error when building 7zip with -verify-machineinstrs. Running 'make check' with the verifier triggered the same error on the test here so i've updated the test to run the verifier on one of its runs instead of adding a new one. While looking at this code, there was a stale comment that these instructions were only used for disassembly. This probably used to be the case, but they are now used in the 'ARM load / store optimization pass' too. llvm-svn: 242300	2015-07-15 16:36:38 +00:00
Bruno Cardoso Lopes	3cf14dd30a	Look through PHIs to find additional register sources - Teaches the ValueTracker in the PeepholeOptimizer to look through PHI instructions. - Add findNextSourceAndRewritePHI method to lookup into multiple sources returnted by the ValueTracker and rewrite PHIs with new sources. With these changes we can find more register sources and rewrite more copies to allow coaslescing of bitcast instructions. Hence, we eliminate unnecessary VR64 <-> GR64 copies in x86, but it could be extended to other archs by marking "isBitcast" on target specific instructions. The x86 example follows: A: psllq %mm1, %mm0 movd %mm0, %r9 jmp C B: por %mm1, %mm0 movd %mm0, %r9 jmp C C: movd %r9, %mm0 pshufw $238, %mm0, %mm0 Becomes: A: psllq %mm1, %mm0 jmp C B: por %mm1, %mm0 jmp C C: pshufw $238, %mm0, %mm0 Differential Revision: http://reviews.llvm.org/D11197 rdar://problem/20404526 llvm-svn: 242295	2015-07-15 15:35:23 +00:00
Benjamin Kramer	49e3c66d0c	[PPC] Disassemble little endian ppc instructions in the right byte order PR24122. The test is simply a byte swapped version of ppc64-encoding.txt. llvm-svn: 242288	2015-07-15 12:56:19 +00:00
Alexey Bataev	1b8ab2f134	[SDAG] Optimize unordered comparison in soft-float mode (patch by Anton Nadolskiy) Current implementation handles unordered comparison poorly in soft-float mode. Consider (a ULE b) which is a <= b. It is lowered to (ledf2(a, b) <= 0 \|\| unorddf2(a, b) != 0) (in general). We can do better job by lowering it to (__gtdf2(a, b) <= 0). Such replacement is true for other CMP's (ult, ugt, uge). In general, we just call same function as for ordered case but negate comparison against zero. Differential Revision: http://reviews.llvm.org/D10804 llvm-svn: 242280	2015-07-15 08:39:35 +00:00
Hal Finkel	db4f85d64e	[PowerPC] Use the MachineCombiner to reassociate fadd/fmul This is a direct port of the code from the X86 backend (r239486/r240361), which uses the MachineCombiner to reassociate (floating-point) adds/muls to increase ILP, to the PowerPC backend. The rationale is the same. There is a lot of copy-and-paste here between the X86 code and the PowerPC code, and we should extract at least some of this into CodeGen somewhere. However, I don't want to do that until this code is enhanced to handle FMAs as well. After that, we'll be in a better position to extract the common parts. llvm-svn: 242279	2015-07-15 08:23:05 +00:00
Petr Pavlu	06ccb9b21b	[AArch64] Fix problems in decoding generic MSR instructions Bitpatterns rejected by the decoder method of `MSR (immediate)` should be decoded as the `extended MSR (register)` instruction. Differential Revision: http://reviews.llvm.org/D7174 llvm-svn: 242276	2015-07-15 08:10:30 +00:00
Petr Pavlu	4f9907e752	[TableGen] Improve decoding options for non-orthogonal instructions When FixedLenDecoder matches an input bitpattern of form [01]+ with an instruction bitpattern of form [01?]+ (where 0/1 are static bits and ? are mixed/variable bits) it passes the input bitpattern to a specific instruction decoder method which then makes a final decision whether the bitpattern is a valid instruction or not. This means the decoder must handle all possible values of the variable bits which sometimes leads to opcode rewrites in the decoder method when the instructions are not fully orthogonal. The patch provides a way for the decoder method to say that when it returns Fail it does not necessarily mean the bitpattern is invalid, but rather that the bitpattern is definitely not an instruction that is recognized by the decoder method. The decoder can then try to match the input bitpattern with other possible instruction bitpatterns. For example, this allows to solve a situation on AArch64 where the `MSR (immediate)` instruction has form: 1101 0101 0000 0??? 0100 ???? ???1 1111 but not all values of the ? bits are allowed. The rejected values should be handled by the `extended MSR (register)` instruction: 1101 0101 000? ???? ???? ???? ???? ???? The decoder will first try to decode an input bitpattern that matches both bitpatterns as `MSR (immediate)` but currently this puts the decoder method of `MSR (immediate)` into a situation when it must be able to decode all possible values of the ? bits, i.e. it would need to rewrite the instruction to `MSR (register)` when it is not `MSR (immediate)`. The patch allows to specify that the decoder method cannot determine if the instruction is valid for all variable values. The decoder method can simply return Fail when it knows it is definitely not `MSR (immediate)`. The decoder will then backtrack the decoding and find that it can match the input bitpattern with the more generic `MSR (register)` bitpattern too. Differential Revision: http://reviews.llvm.org/D7174 llvm-svn: 242274	2015-07-15 08:04:27 +00:00
Simon Pilgrim	bcccac638f	[X86][SSE] Added i686/SSE2 vector shift tests. We were only testing on x86-64, but we should be ensuring decent code gen of i64 shifts on 32-bit targets. llvm-svn: 242273	2015-07-15 08:04:07 +00:00
Igor Breger	7e12f17211	AVX : Fix ISA disabling in case AVX512VL , some instructions should be disabled only if AVX512BW present. Tests added. Differential Revision: http://reviews.llvm.org/D11122 llvm-svn: 242270	2015-07-15 07:08:10 +00:00
Rafael Espindola	71571d65a9	Initial support for writing thin archives. llvm-svn: 242269	2015-07-15 05:47:46 +00:00
Michael Zolotukhin	4f6efcf5b5	Tidy-up test case from r242257. llvm-svn: 242268	2015-07-15 01:51:51 +00:00
Michael Zolotukhin	51cd406a34	[LoopUnrolling] Handle cast instructions. During estimation of unrolling effect we should be able to propagate constants through casts. Differential Revision: http://reviews.llvm.org/D10207 llvm-svn: 242257	2015-07-15 00:19:51 +00:00
JF Bastien	24e01d0990	WebAssembly: fix build breakage. Summary: processFunctionBeforeCalleeSavedScan was renamed to determineCalleeSaves and now takes a BitVector parameter as of rL242165, reviewed in http://reviews.llvm.org/D10909 WebAssembly is still marked as experimental and therefore doesn't build by default. It does, however, grep by default! I notice that processFunctionBeforeCalleeSavedScan is still mentioned in a few comments and error messages, which I also fixed. Reviewers: qcolombet, sunfish Subscribers: jfb, dsanders, hfinkel, MatzeB, llvm-commits Differential Revision: http://reviews.llvm.org/D11199 llvm-svn: 242242	2015-07-14 23:06:07 +00:00
Hal Finkel	f7a4ebfecb	[PowerPC] Support symbolic targets in patchpoints Follow-up r235483, with the corresponding support in PPC. We use a regular call for symbolic targets (because they're much cheaper than indirect calls). llvm-svn: 242239	2015-07-14 22:53:11 +00:00
Rafael Espindola	92d8a3ae9e	Accept lower case to handle windows error messages. llvm-svn: 242236	2015-07-14 22:42:21 +00:00
David Majnemer	c31a55656e	[InstCombine] Generalize sub of selects optimization to all BinaryOperators This exposes further optimization opportunities if the selects are correlated. llvm-svn: 242235	2015-07-14 22:39:23 +00:00
Hal Finkel	2a88231c40	[PowerPC] Use the ABI indirect-call protocol for patchpoints We used to take the address specified as the direct target of the patchpoint and did no TOC-pointer handling. This, however, as not all that useful, because MCJIT tends to create a lot of modules, and they have their own TOC sections. Thus, to call from the generated code to other generated code, you really need to switch TOC pointers. Make this work as expected, and under ELFv1, tread the address as the function descriptor address so that the correct TOC pointer can be loaded. llvm-svn: 242217	2015-07-14 22:26:06 +00:00
Rafael Espindola	fadf8cbec7	Add support for reading members out of thin archives. For now the Archive owns the buffers of the thin archive members. This makes for a simple API, but all the buffers are destructed only when the archive is destructed. This should be fine since we close the files after mmap so we should not hit an open file limit. llvm-svn: 242215	2015-07-14 22:18:43 +00:00
Alex Lorenz	427be4c561	MIR Serialization: Serialize the machine basic block live in registers. llvm-svn: 242204	2015-07-14 21:24:41 +00:00
Tim Northover	e8970888a4	GVN: tolerate an instruction being replaced without existing in the leaderboard Sometimes an incidentally created instruction can duplicate a Value used elsewhere. It then often doesn't end up in the leader table. If it's later removed, we attempt to remove it from the leader table and segfault. Instead we should just ignore the removal request, which won't cause any problems. The reverse situation, where the original instruction is replaced by the new one (which you might think could leave the leader table empty) cannot occur, because the incidental instruction will never be found in the first place. llvm-svn: 242199	2015-07-14 21:03:18 +00:00
Hal Finkel	7783df1bb4	[PowerPC] Fix the PPCInstrInfo::getInstrLatency implementation PowerPC uses itineraries to describe processor pipelines (and dispatch-group restrictions for P7/P8 cores). Unfortunately, the target-independent implementation of TII.getInstrLatency calls ItinData->getStageLatency, and that looks for the largest cycle count in the pipeline for any given instruction. This, however, yields the wrong answer for the PPC itineraries, because we don't encode the full pipeline. Because the functional units are fully pipelined, we only model the initial stages (there are no relevant hazards in the later stages to model), and so the technique employed by getStageLatency does not really work. Instead, we should take the maximum output operand latency, and that's what PPCInstrInfo::getInstrLatency now does. This caused some test-case churn, including two unfortunate side effects. First, the new arrangement of copies we get from function parameters now sometimes blocks VSX FMA mutation (a FIXME has been added to the code and the test cases), and we have one significant test-suite regression: SingleSource/Benchmarks/BenchmarkGame/spectral-norm 56.4185% +/- 18.9398% In this benchmark we have a loop with a vectorized FP divide, and it with the new scheduling both divides end up in the same dispatch group (which in this case seems to cause a problem, although why is not exactly clear). The grouping structure is hard to predict from the bottom of the loop, and there may not be much we can do to fix this. Very few other test-suite performance effects were really significant, but almost all weakly favor this change. However, in light of the issues highlighted above, I've left the old behavior available via a command-line flag. llvm-svn: 242188	2015-07-14 20:02:02 +00:00
Krzysztof Parzyszek	8d49f92601	[Hexagon] Generate instructions for operations on predicate registers Convert logical operations on general-purpose registers to the correspon- ding operations on predicate registers. llvm-svn: 242186	2015-07-14 19:30:21 +00:00
Keno Fischer	400c1358d0	[CodeGen] Force emission of personality directive if explicitly specified Summary: Before this change, personality directives were not emitted if there was no invoke left in the function (of course until recently this also meant that we couldn't know what the personality actually was). This patch forces personality directives to still be emitted, unless it is known to be a noop in the absence of invokes, or the user explicitly specified `nounwind` (and not `uwtable`) on the function. Reviewers: majnemer, rnk Subscribers: rnk, llvm-commits Differential Revision: http://reviews.llvm.org/D10884 llvm-svn: 242185	2015-07-14 19:22:51 +00:00
Matt Arsenault	efaaa8cb7c	AMDGPU: Avoid using 64-bit shift for i64 (shl x, 32) This can be done only with moves which theoretically will optimize better later. Although this transform increases the instruction count, it should be code size / cycle count neutral in the worst VALU case. It also seems to slightly improve a couple of testcases due to other DAG combines this exposes. This is probably slightly worse for the SALU case, so it might be better to handle this during moveToVALU, although then you lose some simplifications like the load width reducing in the simple testcase. llvm-svn: 242177	2015-07-14 18:20:33 +00:00
Matt Arsenault	988dd980cf	AMDGPU/SI: Fix read2 merging into a super register. If the read2 produced was supposed to be writing into a super register, it would use the wrong subregister indices. Fix this by inserting copies, so we only ever write to a vreg_64. Run the register coalescer again to clean this up, although this isn't ideal and often does result in an extra move. Also remove the assert that offset1 > offset0. There isn't a real reason to not allow this other than a minor convenience in the compiler, and it doesn't seem worth the effort of avoiding it. llvm-svn: 242174	2015-07-14 17:57:36 +00:00
Nemanja Ivanovic	a397908a1f	Add missing builtins to the PPC back end for ABI compliance (vol. 4) This patch corresponds to review: http://reviews.llvm.org/D11183 Back end portion of the fourth round of additions to altivec.h. llvm-svn: 242167	2015-07-14 17:25:20 +00:00
Tim Northover	b5aedb0d34	ARM: add at least one real test for r242123. The ones committed were orthogonal to the change and would have passed before that revision. What it did do was prevent an assertion failure when generating object files. llvm-svn: 242166	2015-07-14 17:23:55 +00:00
Matthias Braun	14b971e075	PrologEpilogInserter: Rewrite API to determine callee save regsiters. This changes TargetFrameLowering::processFunctionBeforeCalleeSavedScan(): - Rename the function to determineCalleeSaves() - Pass a bitset of callee saved registers by reference, thus avoiding the function-global PhysRegUsed bitset in MachineRegisterInfo. - Without PhysRegUsed the implementation is fine tuned to not save physcial registers which are only read but never modified. Related to rdar://21539507 Differential Revision: http://reviews.llvm.org/D10909 llvm-svn: 242165	2015-07-14 17:17:13 +00:00
Tim Northover	9509e14e0a	AArch64: add rev64 alias for 64-bit rev instruction. It could be useful to assembly programmers and makes the permitted variants a little more uniform. llvm-svn: 242164	2015-07-14 17:07:29 +00:00
Krzysztof Parzyszek	38a34d7651	[Hexagon] Generate "extract" instructions more aggressively Generate extract instructions (via intrinsics) before the DAG combiner folds shifts into unrecognizable forms. llvm-svn: 242163	2015-07-14 17:07:24 +00:00
Rafael Espindola	1893c5e76b	llvm-ar: Don't try to extract from thin archives. This matches the gnu ar behavior. llvm-svn: 242162	2015-07-14 16:55:13 +00:00
Rafael Espindola	1724f6ab47	Sleep for 2.1 seconds to see if that makes the test stable on windows. Might fix pr24106. llvm-svn: 242158	2015-07-14 16:34:23 +00:00
Rafael Espindola	e5753ae917	llvm-ar: print an error when the requested member is not found. llvm-svn: 242156	2015-07-14 16:02:40 +00:00
Rafael Espindola	926701ff8f	Rename a test. NFC. llvm-svn: 242151	2015-07-14 15:06:18 +00:00
Tom Stellard	a3220fa789	AMDGPU/SI: Add support for shrinking v_cndmask_b32_e32 instructions Reviewers: arsenm Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11061 llvm-svn: 242146	2015-07-14 14:15:03 +00:00
Daniel Sanders	1c28346cdb	[mips] Fix li/la differences between IAS and GAS. Summary: - Signed 16-bit should have priority over unsigned. - For la, unsigned 16-bit must use ori+addu rather than directly use ori. - Correct tests on 32-bit immediates with 64-bit predicates by sign-extending the immediate beforehand. For example, isInt<16>(0xffff8000) should be true and use addiu. Also split li/la testing into separate files due to their size. Reviewers: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10967 llvm-svn: 242139	2015-07-14 12:24:22 +00:00
David Majnemer	52d04298e1	[SROA] Don't de-atomic volatile loads and stores Volatile loads and stores are made visible in global state regardless of what memory is involved. It is not correct to disregard the ordering and synchronization scope because it is possible to synchronize with memory operations performed by hardware. This partially addresses PR23737. llvm-svn: 242126	2015-07-14 06:19:58 +00:00
Yaron Keren	a300c4d7e5	Generate correct asm info for mingw and cygwin ARM targets. http://reviews.llvm.org/D11075 Patch by Martell Malone Reviewed by Reid Kleckner llvm-svn: 242123	2015-07-14 05:51:05 +00:00
NAKAMURA Takumi	aa66e39b4d	Give an explicit triple to llvm/test/CodeGen/X86/pr13577.ll. llvm-svn: 242111	2015-07-14 03:07:06 +00:00
Matthias Braun	e1bcc14506	Revert "LegalizeDAG: Fix and improve FCOPYSIGN/FABS legalization" Accidental commit, needs review first. This reverts commit r242107. llvm-svn: 242108	2015-07-14 02:09:57 +00:00
Matthias Braun	e8ffab6ec5	LegalizeDAG: Fix and improve FCOPYSIGN/FABS legalization - Factor out code to query and modify the sign bit of a floatingpoint value as an integer. This also works if none of the targets integer types is big enough to hold all bits of the floatingpoint value. - Legalize FABS(x) as FCOPYSIGN(x, 0.0) if FCOPYSIGN is available, otherwise perform bit manipulation on the sign bit. The previous code used "x >u 0 ? x : -x" which is incorrect for x being -0.0! It also takes 34 instructions on ARM Cortex-M4. With this patch we only require 5: vldr d0, LCPI0_0 vmov r2, r3, d0 lsrs r2, r3, #31 bfi r1, r2, #31, #1 bx lr (This could be further improved if the compiler would recognize that r2, r3 is zero). - Only lower FCOPYSIGN(x, y) = sign(x) ? -FABS(x) : FABS(x) if FABS is available otherwise perform bit manipulation on the sign bit. - Perform the sign(x) test by masking out the sign bit and comparing with 0 rather than shifting the sign bit to the highest position and testing for "<s 0". For x86 copysignl (on 80bit values) this gets us: testl $32768, %eax rather than: shlq $48, %rax sets %al testb %al, %al llvm-svn: 242107	2015-07-14 02:08:26 +00:00
Matthias Braun	cc6f791568	X86: Check output of x86 copysignl testcase. This makes the changes in an upcoming patch visible. llvm-svn: 242106	2015-07-14 02:08:23 +00:00
Alex Lorenz	262fad804e	MIR Serialization: Serialize the variable sized stack objects. llvm-svn: 242095	2015-07-14 00:26:26 +00:00
Reid Kleckner	15ae77989b	Update enforceKnownAlignment after the isWeakForLinker semantic change Previously we would refrain from attempting to increase the linkage of available_externally globals because they were considered weak for the linker. Now they are treated more like a declaration instead of a weak definition. This was causing SSE alignment faults in Chromuim, when some code assumed it could increase the alignment of a dllimported global that it didn't control. http://crbug.com/509256 llvm-svn: 242091	2015-07-14 00:11:08 +00:00
Alex Lorenz	cddac3ca8f	MIR Serialization: Serialize the sub register indices. This commit serializes the sub register indices from the register machine operands. Reviewers: Duncan P. N. Exon Smith llvm-svn: 242084	2015-07-13 23:24:34 +00:00
Rafael Espindola	59e6986755	Add missing file. Sorry about that. llvm-svn: 242083	2015-07-13 23:14:26 +00:00
Rafael Espindola	61c7cb197d	Fix reading archive members with / in the name. This is important for thin archives. llvm-svn: 242082	2015-07-13 23:07:05 +00:00
Bill Schmidt	381b5e4a38	[PPC64LE] More improvements to VSX swap optimization This patch allows VSX swap optimization to succeed more frequently. Specifically, it is concerned with common code sequences that occur when copying a scalar floating-point value to a vector register. This patch currently handles cases where the floating-point value is already in a register, but does not yet handle loads (such as via an LXSDX scalar floating-point VSX load). That will be dealt with later. A typical case is when a scalar value comes in as a floating-point parameter. The value is copied into a virtual VSFRC register, and then a sequence of SUBREG_TO_REG and/or COPY operations will convert it to a full vector register of the class required by the context. If this vector register is then used as part of a lane-permuted computation, the original scalar value will be in the wrong lane. We can fix this by adding a swap operation following any widening SUBREG_TO_REG operation. Additional COPY operations may be needed around the swap operation in order to keep register assignment happy, but these are pro forma operations that will be removed by coalescing. If a scalar value is otherwise directly referenced in a computation (such as by one of the many XS* vector-scalar operations), we currently disable swap optimization. These operations are lane-sensitive by definition. A MentionsPartialVR flag is added for use in each swap table entry that mentions a scalar floating-point register without having special handling defined. A common idiom for PPC64LE is to convert a double-precision scalar to a vector by performing a splat operation. This ensures that the value can be referenced as V[0], as it would be for big endian, whereas just converting the scalar to a vector with a SUBREG_TO_REG operation leaves this value only in V[1]. A doubleword splat operation is one form of an XXPERMDI instruction, which takes one doubleword from a first operand and another doubleword from a second operand, with a two-bit selector operand indicating which doublewords are chosen. In the general case, an XXPERMDI can be permitted in a lane-swapped region provided that it is properly transformed to select the corresponding swapped values. This transformation is to reverse the order of the two input operands, and to reverse and complement the bits of the selector operand (derivation left as an exercise to the reader ;). A new test case that exercises the scalar-to-vector and generalized XXPERMDI transformations is added as CodeGen/PowerPC/swaps-le-5.ll. The patch also requires a change to CodeGen/PowerPC/swaps-le-3.ll to use CHECK-DAG instead of CHECK for two independent instructions that now appear in reverse order. There are two small unrelated changes that are added with this patch. First, the XXSLDWI instruction was incorrectly omitted from the list of lane-sensitive instructions; this is now fixed. Second, I observed that the same webs were being rejected over and over again for different reasons. Since it's sufficient to reject a web only once, I added a check for this to speed up the compilation time slightly. llvm-svn: 242081	2015-07-13 22:58:19 +00:00
Pete Cooper	f529cd51f8	Remove unnecessary lines from the test in r242068. This test case was breaking the hexagon elf bot. The failing lines were actually unnecessary as checking that the store still reads the correct value demonstrates that everything is working fine now. llvm-svn: 242073	2015-07-13 21:50:35 +00:00
Pete Cooper	20cab35e27	Loop idiom recognizer was replacing too many uses of popcount. When spotting that a loop can use ctpop, we were incorrectly replacing all uses of a value with a value derived from ctpop. The bug here was exposed because we were replacing a use prior to the ctpop with the ctpop value and so we have a use before def, i.e., we changed %tobool.5 = icmp ne i32 %num, 0 store i1 %tobool.5, i1* %ptr br i1 %tobool.5, label %for.body.lr.ph, label %for.end to store i1 %1, i1* %ptr %0 = call i32 @llvm.ctpop.i32(i32 %num) %1 = icmp ne i32 %0, 0 br i1 %1, label %for.body.lr.ph, label %for.end Even if we inserted the ctpop so that it dominates the store here, that would still be incorrect. The store doesn’t want the result of ctpop. The fix is very simple, and involves replacing only the branch condition with the ctpop instead of all uses. Reviewed by Hal Finkel. llvm-svn: 242068	2015-07-13 21:25:33 +00:00
Reid Kleckner	e4008eaff3	[WinEH] Emit the LSDA even if no lpads remain but outlining occurred The outlined funclets call intrinsics which reference labels from the LSDA. This situation can easily arise in small functions with a single cleanup at -O0, where Clang marks a definition as nounwind, and then WinEHPrepare "discovers" that the landingpad is dead by accident and deletes it. We now need to ask the LLVM IR Function for it's personality directly, rather than going through MachineModuleInfo. Fixes PR23892. llvm-svn: 242063	2015-07-13 20:41:46 +00:00
Rafael Espindola	8be47bcf22	Add support deterministic output in llvm-ar and make it the default. llvm-svn: 242061	2015-07-13 20:38:09 +00:00
David Majnemer	31202c91fa	[MC] Correctly escape .safeseh's symbol This fixes PR24107. llvm-svn: 242050	2015-07-13 18:51:15 +00:00
Mark Heffernan	cab25e5143	Enable runtime unrolling with unroll pragma metadata Enable runtime unrolling for loops with unroll count metadata ("#pragma unroll N") and a runtime trip count. Also, do not unroll loops with unroll full metadata if the loop has a runtime loop count. Previously, such loops would be unrolled with a very large threshold (pragma-unroll-threshold) if runtime unrolled happened to be enabled resulting in a very large (and likely unwise) unroll factor. llvm-svn: 242047	2015-07-13 18:26:27 +00:00
Alex Lorenz	f891b70313	MIR Serialization: Serialize the fixed stack objects. This commit serializes the fixed stack objects, including fixed spill slots. The fixed stack objects are serialized using a YAML sequence of YAML inline mappings. Each mapping has the object's ID, type, size, offset, and alignment. The objects that aren't spill slots also serialize the isImmutable and isAliased flags. The fixed stack objects are a part of the machine function's YAML mapping. Reviewers: Duncan P. N. Exon Smith llvm-svn: 242045	2015-07-13 18:07:26 +00:00
Reid Kleckner	20e3005c59	[WinEH] Strip the \01 character from the __CxxFrameHandler3 thunk name Add another C++ 32-bit EH table test. llvm-svn: 242044	2015-07-13 17:55:14 +00:00
James Y Knight	b1184c5419	Fix handling of the 'n' asm constraint with invalid operands. It had accidently accepted a symbol+offset value (and emitted incorrect code for it, keeping only the offset part) instead of properly reporting the constraint as invalid. Differential Revision: http://reviews.llvm.org/D11039 llvm-svn: 242040	2015-07-13 16:36:22 +00:00
Tom Stellard	24371a62d2	AMDGPU/SI: Select mad patterns to v_mac_f32 The two-address instruction pass will convert these back to v_mad_f32 if necessary. Differential Revision: http://reviews.llvm.org/D11060 llvm-svn: 242038	2015-07-13 15:47:57 +00:00
Logan Chien	33021e9c0e	ARM: Fix cttz expansion on vector types. The 64/128-bit vector types are legal if NEON instructions are available. However, there was no matching patterns for @llvm.cttz.*() intrinsics and result in fatal error. This commit fixes the problem by lowering cttz to: a. ctpop((x & -x) - 1) b. width - ctlz(x & -x) - 1 llvm-svn: 242037	2015-07-13 15:37:30 +00:00
Scott Douglass	f2de6e8d36	[ARM] Handle commutativity when converting to tADDhirr in Thumb2 Also, run thumb_rewrite.s tests in Thumb2 now that they pass. Differential Revision: http://reviews.llvm.org/D11132 llvm-svn: 242036	2015-07-13 15:31:48 +00:00
Scott Douglass	265de7fb23	[ARM] Add Thumb2 ADD with SP narrowing from 3 operand to 2 Differential Revision: http://reviews.llvm.org/D11131 llvm-svn: 242035	2015-07-13 15:31:40 +00:00
Scott Douglass	4d198ab5b3	[ARM] Small refactor of tryConvertingToTwoOperandForm (nfc) Also, add more Thumb2 ADD tests requested during review of http://reviews.llvm.org/D11053. Differential Revision: http://reviews.llvm.org/D11130 llvm-svn: 242034	2015-07-13 15:31:33 +00:00
Silviu Baranga	ae2a209a5b	Cleanup after r241809 - remove uncessary call to std::sort Summary: The iteration order within a member of DepCands is deterministic and therefore we don't have to sort the accesses within a member. We also don't have to copy the indices of the pointers into a vector, since we can iterate over the members of the class. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11145 llvm-svn: 242033	2015-07-13 14:48:24 +00:00
Rafael Espindola	5af0439872	Don't change the visibility when converting a definition to a declaration. llvm-svn: 242030	2015-07-13 14:18:22 +00:00
Rafael Espindola	122834dffd	Print the visibility of available_externally functions. We were already printing it for declarations, but not available_externally. llvm-svn: 242027	2015-07-13 13:55:18 +00:00
Manuel Klimek	7a05b84c26	Revert r241981 "Revert "Revert r236894 "[BasicAA] Fix zext & sext handling""" The repros from PR23626 still fail. llvm-svn: 242025	2015-07-13 13:50:55 +00:00
Elena Demikhovsky	618bae6f38	AVX-512: Added all AVX-512 forms of Vector Convert for Float/Double/Int/Long types. In this patch I have only encoding. Intrinsics and DAG lowering will be in the next patch. I temporary removed the old intrinsics test (just to split this patch). Half types are not covered here. Differential Revision: http://reviews.llvm.org/D11134 llvm-svn: 242023	2015-07-13 13:26:20 +00:00
Jingyue Wu	248d90dab7	[LSR] don't attempt to promote ephemeral values to indvars Summary: This at least saves compile time. I also encountered a case where ephemeral values affect whether other variables are promoted, causing performance issues. It may be a bug in LSR, but I didn't manage to reduce it yet. Anyhow, I believe it's in general not worth considering ephemeral values in LSR. Reviewers: atrick, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11115 llvm-svn: 242011	2015-07-13 03:28:53 +00:00
David Majnemer	31685d95b1	[InstSimplify] Teach InstSimplify how to simplify extractelement llvm-svn: 242008	2015-07-13 01:15:53 +00:00
David Majnemer	bbc2fd617f	[InstSimplify] Teach InstSimplify how to simplify extractvalue llvm-svn: 242007	2015-07-13 01:15:46 +00:00
Renato Golin	943837bfb6	[ARM] Add support for nest attribute using r12 Register r12 ('ip') is used by GCC for this purpose and hence is used here. As discussed on the GCC mailing list, the register choice is an ABI issue and so choosing the same register as GCC means __builtin_call_with_static_chain is compatible. A similar patch has just gone in the AArch64 backend, so this is just the ARM counterpart, following the same discussion. Patch by Stephen Cross. llvm-svn: 241996	2015-07-12 18:16:40 +00:00
Simon Pilgrim	e3726fba14	[X86][SSE] Tidied up vector extend/truncation tests. NFCI. llvm-svn: 241995	2015-07-12 17:40:49 +00:00
Simon Pilgrim	88d91ba7dc	[X86][SSE] Vectorized v4i32 non-uniform shifts. While the v4i32 shl operation is already vectorized using a cvttps2dq/pmulld pattern, the lshr/ashr opeations are still scalarized. This patch adds vectorization support for non-uniform v4i32 shift operations - it splats constant shift amounts to allow them to use the immediate sse shift instructions, or extracts/zero-extends non-constant shift amounts. The individual results are then blended together. Differential Revision: http://reviews.llvm.org/D11063 llvm-svn: 241989	2015-07-12 11:15:19 +00:00
David Majnemer	243b5516a7	[LICM] Don't try to sink values out of loops without any exits There is no suitable basic block to sink instructions in loops without exits. The only way an instruction in a loop without exits can be used is as an incoming value to a PHI. In such cases, the incoming block for the corresponding value is unreachable. This fixes PR24013. Differential Revision: http://reviews.llvm.org/D10903 llvm-svn: 241987	2015-07-12 03:53:05 +00:00
Hal Finkel	568c3a41af	[PowerPC] Make use of the TargetRecip system r238842 added the TargetRecip system for controlling use of reciprocal estimates for sqrt and division using a set of parameters that can be set by the frontend. Clang now supports a sophisticated -mrecip option, and this will allow that option to effectively control the relevant code-generation functionality of the PPC backend. llvm-svn: 241985	2015-07-12 02:33:57 +00:00
Hal Finkel	f91045042a	[PowerPC] Support the nest parameter attribute This adds support for the 'nest' attribute, which allows the static chain register to be set for functions calls under non-Darwin PPC/PPC64 targets. r11 is the chain register (which the PPC64 ELF ABI calls the "environment pointer"). For indirect calls under PPC64 ELFv1, this would normally be loaded from the function descriptor, but providing an explicit 'nest' parameter will override that process and use the value provided. This allows __builtin_call_with_static_chain to work as expected on PowerPC. llvm-svn: 241984	2015-07-12 00:37:44 +00:00
Hal Finkel	d776deb484	Revert "Revert r236894 "[BasicAA] Fix zext & sext handling"" r236894 caused PR23626 (Clang miscompiles webkit's base64 decoder), and was reverted in r237984. This reapplies the patch with an additional test case for PR23626 and the associated fix (both scales and offsets in the BasicAliasAnalysis::constantOffsetHeuristic should initially be zero). Patch by Nick White, thanks! llvm-svn: 241981	2015-07-11 11:04:54 +00:00
Igor Laevsky	05bff16edd	Add argmemonly attribute. This change adds new attribute called "argmemonly". Function marked with this attribute can only access memory through it's argument pointers. This attribute directly corresponds to the "OnlyAccessesArgumentPointees" ModRef behaviour in alias analysis. Differential Revision: http://reviews.llvm.org/D10398 llvm-svn: 241979	2015-07-11 10:30:36 +00:00
Tyler Nowicki	21a81c1187	Renamed some uses of unroll to interleave in the vectorizer. llvm-svn: 241971	2015-07-11 00:31:11 +00:00
Alex Lorenz	3307a878a0	MIR Serialization: Serialize the virtual register operands. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D11005 llvm-svn: 241959	2015-07-10 22:51:20 +00:00
Bjorn Steinbrink	4de6eb776a	[InstCombine] Actually combine AA metadata when replacing one load with another Fixes PR24083 llvm-svn: 241955	2015-07-10 22:30:17 +00:00
Reid Kleckner	1acdc5b15e	[SEH] Push reloads of the SEH code past phi nodes This in turn would sometimes introduce new cleanupblocks that didn't previously exist. The uses were being introduced by SSA value demotion. We actually want to promote uses of EH pointers and selectors, so I added some spcecial casing to avoid demoting such instructions. This is getting overly complicated, but hopefully we'll come along and delete it in the new representation. llvm-svn: 241950	2015-07-10 22:21:54 +00:00
Matt Arsenault	e1819eb4e3	DAGCombiner: Assume invariant load cannot alias a store The motivation is to allow GatherAllAliases / FindBetterChain to not give up on dependent loads of a pointer from constant memory. This is important for AMDGPU, because most loads are pointers derived from a load of a kernel argument from constant memory. llvm-svn: 241948	2015-07-10 22:17:40 +00:00
Quentin Colombet	74a81286b5	[ShrinkWrap][PEI] Do not insert epilogue for unreachable blocks. Although this is not incorrect to insert such code, it is useless and it hurts the binary size. llvm-svn: 241946	2015-07-10 22:09:55 +00:00
Evgeniy Stepanov	b389e3c7f6	Fix AArch64 prologue for empty frame with dynamic allocas. Fixes PR23804: assertion failure in emitPrologue in the case of a function with an empty frame and a dynamic alloca that needs stack realignment. This is a typical case for AddressSanitizer. llvm-svn: 241943	2015-07-10 21:24:07 +00:00
Michael J. Spencer	2a3efb265a	[Object][ELF] Handle the dynamic string table in files without a section table. llvm-svn: 241937	2015-07-10 20:11:57 +00:00
Matthias Braun	bab8638132	ARMLoadStoreOpt: Merge subs/adds into LDRD/STRD; Factor out common code This commit factors out common code from MergeBaseUpdateLoadStore() and MergeBaseUpdateLSMultiple() and introduces a new function MergeBaseUpdateLSDouble() which merges adds/subs preceding/following a strd/ldrd instruction into an strd/ldrd instruction with writeback where possible. Differential Revision: http://reviews.llvm.org/D10676 llvm-svn: 241928	2015-07-10 18:37:33 +00:00
Fiona Glaser	7d7d370b56	ComputeKnownBits: be a bit smarter about ADDs If our two inputs have known top-zero bit counts M and N, we trivially know that the output cannot have any bits set in the top (min(M, N)-1) bits, since nothing could carry past that point. llvm-svn: 241927	2015-07-10 18:29:02 +00:00
Matthias Braun	5c16e27f27	ARMLoadStoreOptimizer: Create LDRD/STRD on thumb2 Differential Revision: http://reviews.llvm.org/D10623 llvm-svn: 241926	2015-07-10 18:28:49 +00:00
Alex Lorenz	e7f6afffde	MIR Serialization: Initial serialization of stack objects. This commit implements the initial serialization of stack objects from the MachineFrameInfo class. It can only serialize the ordinary stack objects (including ordinary spill slots), but it doesn't serialize variable sized or fixed stack objects yet. The stack objects are serialized using a YAML sequence of YAML inline mappings. Each mapping has the object's ID, type, size, offset and alignment. The stack objects are a part of machine function's YAML mapping. Reviewers: Duncan P. N. Exon Smith llvm-svn: 241922	2015-07-10 18:13:57 +00:00
Matthias Braun	699544cc62	ARMLoadStoreOptimizer: Rewrite LDM/STM matching logic. This improves the logic in several ways and is a preparation for followup patches: - First perform an analysis and create a list of merge candidates, then transform. This simplifies the code in that you have don't have to care to much anymore that you may be holding iterators to MachineInstrs that get removed. - Analyze/Transform basic blocks in reverse order. This allows to use LivePhysRegs to find free registers instead of the RegisterScavenger. The RegisterScavenger will become less precise in the future as it relies on the deprecated kill-flags. - Return the newly created node in MergeOps so there's no need to look around in the schedule to find it. - Rename some MBBI iterators to InsertBefore to make their role clear. - General code cleanup. Differential Revision: http://reviews.llvm.org/D10140 llvm-svn: 241920	2015-07-10 18:08:49 +00:00
Eli Bendersky	711d658342	Actually support volatile memcpys in NVPTX lowering Differential Revision: http://reviews.llvm.org/D11091 llvm-svn: 241914	2015-07-10 15:40:33 +00:00
Benjamin Kramer	90b7dad7cf	[InstSimplify] Fold away ord/uno fcmps when nnan is present. This is important to fold away the slow case of complex multiplies emitted by clang. llvm-svn: 241911	2015-07-10 14:02:02 +00:00
James Molloy	4008b8e3dc	Add support for fast-math flags to the FCmp instruction. FCmp behaves a lot like a floating-point binary operator in many ways, and can benefit from fast-math information. Flags such as nsz and nnan can affect if this fcmp (in combination with a select) can be treated as a fminnum/fmaxnum operation. This adds backwards-compatible bitcode support, IR parsing and writing, LangRef changes and IRBuilder changes. I'll need to audit InstSimplify and InstCombine in a followup to find places where flags should be copied. llvm-svn: 241901	2015-07-10 12:52:00 +00:00
Alexey Bataev	d7edeff54f	Disable loop re-rotation for -Oz (patch by Andrey Turetsky) After changes in rL231820 loop re-rotation is performed even in -Oz mode. Since loop rotation is disabled for -Oz, it seems loop re-rotation should be disabled too. Differential Revision: http://reviews.llvm.org/D10961 llvm-svn: 241897	2015-07-10 10:37:09 +00:00
David Majnemer	80ac5e60bf	Revert the new EH instructions This reverts commits r241888-r241891, I didn't mean to commit them. llvm-svn: 241893	2015-07-10 07:15:17 +00:00
David Majnemer	18c9feac36	Tighten the verifier check for catchblock. llvm-svn: 241891	2015-07-10 07:01:07 +00:00
David Majnemer	6310e08ce2	New EH representation for MSVC compatibility Summary: This introduces new instructions neccessary to implement MSVC-compatible exception handling support. Most of the middle-end and none of the back-end haven't been audited or updated to take them into account. Reviewers: rnk, JosephTremoulet, reames, nlewycky, rjmccall Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11041 llvm-svn: 241888	2015-07-10 07:00:44 +00:00
Bjorn Steinbrink	ecb205fc74	[InstCombine] Employ AliasAnalysis in FindAvailableLoadedValue llvm-svn: 241887	2015-07-10 06:55:49 +00:00
Bjorn Steinbrink	1a4f8db997	[InstCombine] Properly combine metadata when replacing a load with another Not doing this can lead to misoptimizations down the line, e.g. because of range metadata on the replacing load excluding values that are valid for the load that is being replaced. llvm-svn: 241886	2015-07-10 06:55:44 +00:00
Jingyue Wu	7084152bd9	[NVPTX] declare no vector registers Summary: Without this patch, LoopVectorizer in certain cases (see loop-vectorize.ll) produces code with complex control flow which hurts later optimizations. Since NVPTX doesn't have vector registers in LLVM's sense (NVPTXTTI::getRegisterBitWidth(true) == 32), we for now declare no vector registers to effectively disable loop vectorization. Reviewers: jholewinski Subscribers: jingyue, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D11089 llvm-svn: 241884	2015-07-10 04:31:56 +00:00
Reid Kleckner	b2ba390775	[WinEH] Make sure LSDA tables are 4 byte aligned Apparently this is important, otherwise _except_handler3 assumes that the registration node is corrupted and ignores it. Also fix a bug in WinEHPrepare where we would insert code after a terminator instruction. llvm-svn: 241877	2015-07-10 00:08:49 +00:00
Sanjay Patel	76dbdc8f6e	[x86] enable machine combiner reassociations for scalar double-precision multiplies llvm-svn: 241873	2015-07-09 22:58:39 +00:00
Sanjay Patel	2245b46f4f	[x86] enable machine combiner reassociations for scalar double-precision adds llvm-svn: 241871	2015-07-09 22:48:54 +00:00
Michael J. Spencer	bb17a74bbd	[Object][ELF] Support dumping hash-tables from files with no section table. This time without breaking the bots. llvm-svn: 241869	2015-07-09 22:32:24 +00:00
Alex Lorenz	5c443ee383	MIR Serialization: Serialize the virtual register definitions. The virtual registers are serialized using a YAML sequence of YAML inline mappings. Each mapping has the id of the virtual register and the register class. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10981 llvm-svn: 241868	2015-07-09 22:23:13 +00:00
Reid Kleckner	733508d4e8	[WinEH] Give up on using CSRs across 32-bit invokes for now The runtime does not restore CSRs when transferring control back to the function handling the exception. According to the experts on IRC, LLVM's register allocator has no way to model register clobbers that only happen on one edge of the CFG. For now, don't worry about trying to use the meager three CSRs available on 32-bit X86 and just say that such invokes preserve nothing. llvm-svn: 241865	2015-07-09 22:09:41 +00:00
Alex Lorenz	e56a7b90eb	MIR Parser: Report an error when parsing machine function with an empty body. This commit adds a new error which is reported when the MIR Parser encounters a machine function without any machine basic blocks. The machine verifier expects that the machine functions have at least one MBB, and this error will prevent machine functions without MBBs from reaching the machine verifier and crashing with an assertion. llvm-svn: 241862	2015-07-09 21:21:33 +00:00
Sanjoy Das	fefc8af84a	[ImplicitNullChecks] Be smarter in picking the memory op. Summary: Before this change ImplicitNullChecks would only pick loads of the form: ``` test Reg, Reg jz elsewhere fallthrough: movl 32(Reg), Reg2 ``` but not (say) ``` test Reg, Reg jz elsewhere fallthrough: inc Reg3 movl 32(Reg), Reg2 ``` This change teaches ImplicitNullChecks to look through "unrelated" instructions like `inc Reg3` when searching for a load instruction to convert to a trapping load. Reviewers: atrick, JosephTremoulet, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11044 llvm-svn: 241850	2015-07-09 20:13:25 +00:00
Rafael Espindola	111ba6bab3	Create BSD archives by default on OS X. They should probably be created on anything that is not windows or linux, but I will test on freebsd before changing that. With this it is possible to bootstrap with llvm-ar instead of ar+ranlib on OS X. llvm-svn: 241849	2015-07-09 20:12:50 +00:00
Alex Lorenz	60440f3a04	MIR Serialization: Serialize the simple MachineFrameInfo attributes. This commit serializes the 13 scalar boolean and integer attributes from the MachineFrameInfo class: IsFrameAddressTaken, IsReturnAddressTaken, HasStackMap, HasPatchPoint, StackSize, OffsetAdjustment, MaxAlignment, AdjustsStack, HasCalls, MaxCallFrameSize, HasOpaqueSPAdjustment, HasVAStart, and HasMustTailInVarArgFunc. These attributes are serialized as part of the frameInfo YAML mapping, which itself is a part of the machine function's YAML mapping. llvm-svn: 241844	2015-07-09 19:55:27 +00:00
Rafael Espindola	282d12d41c	llvm-ar: Pad the symbol table to 4 bytes. It looks like ld64 requires it. With this we seem to be able to bootstrap using llvm-ar+/usr/bin/true instead of ar+ranlib (currently on stage2). llvm-svn: 241842	2015-07-09 19:48:06 +00:00
Sanjoy Das	6b9a30bed3	[IndVars] Try to use existing values in RewriteLoopExitValues. Summary: In RewriteLoopExitValues, before expanding out an SCEV expression using SCEVExpander, try to see if an existing LLVM IR expression already computes the value we're interested in. If so use that existing expression. Apart from reducing IndVars' reliance on the rest of the compilation pipeline, this also prevents IndVars from concluding some expressions as "high cost" when they're not. For instance, `InductiveRangeCheckElimination` often emits code of the following form: ``` len = umin(len_A, len_B) loop: ... if (i++ < len) goto loop outside_loop: use(i) ``` `SCEVExpander` refuses to rewrite the use of `i` in `outside_loop`, since it thinks the value of `i` on loop exit, `len`, is a high cost expansion since it contains an `umax` in it. With this change, `IndVars` can see that it can re-use `len` instead of creating a new expression to compute `umin(len_A, len_B)`. I considered putting this cleverness in `SCEVExpander`, but I was worried that it may then have a deterimental effect on other passes that use it. So I decided it was better to just do this in the one place where it seems like an obviously good idea, with the intent of generalizing later if needed. Reviewers: atrick, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10782 llvm-svn: 241838	2015-07-09 18:46:12 +00:00
Simon Atanasyan	a847de6456	[llvm-readobj] Print MIPS PLT table Now the -mips-plt-got prints both MIPS GOT and PLT tables. llvm-svn: 241836	2015-07-09 18:23:10 +00:00
Simon Atanasyan	901c8a500a	[yaml2obj] Initialize sh_addralign field for the .symtab section llvm-svn: 241835	2015-07-09 18:23:02 +00:00
Simon Atanasyan	2f77b9cd0e	[ELF] Explicitly configure sections alignment in the tests. NFC llvm-svn: 241834	2015-07-09 18:22:55 +00:00
David Majnemer	40f539becd	[llvm-objdump] Require that jump targets shown in -d are functions Don't let the disassembler pick call <.text> if a function happens to live at the start of the section by only using function symbols. llvm-svn: 241830	2015-07-09 18:11:40 +00:00
Pat Gavlin	a6d3ba4544	Allow {e,r}bp as the target of {read,write}_register. This patch allows the read_register and write_register intrinsics to read/write the RBP/EBP registers on X86 iff the targeted register is the frame pointer for the containing function. Differential Revision: http://reviews.llvm.org/D10977 llvm-svn: 241827	2015-07-09 17:40:29 +00:00
Sanjay Patel	6ce96b0ff0	fix an invisible bug when combining repeated FP divisors This patch fixes bugs that were exposed by the addition of fast-math-flags in the DAG: r237046 ( http://reviews.llvm.org/rL237046 ): 1. When replacing a division node, it's not enough to RAUW. We should call CombineTo() to delete dead nodes and combine again. 2. Because we are changing the DAG, we can't return an empty SDValue after the transform. As the code comments say: Visitation implementation - Implement dag node combining for different node types. The semantics are as follows: Return Value: SDValue.getNode() == 0 - No change was made SDValue.getNode() == N - N was replaced, is dead and has been handled. otherwise - N should be replaced by the returned Operand. The new test case shows no difference with or without this patch, but it will crash if we re-apply r237046 or enable FMF via the current -enable-fmf-dag cl::opt. Differential Revision: http://reviews.llvm.org/D9893 llvm-svn: 241826	2015-07-09 17:28:37 +00:00
Silviu Baranga	a5bff005e9	Add a test of a regression discovered during testing of r241673 Summary: We were missing a corner case where DepCands was not available, but we were using DepCands to compute the checking pointer groups. This adds a test for that regression. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11068 llvm-svn: 241818	2015-07-09 16:40:25 +00:00
Rafael Espindola	bc80c9fa30	Add missing file from previous commit. llvm-svn: 241815	2015-07-09 15:58:22 +00:00
Rafael Espindola	70e52a227c	Basic support for BSD symbol tables in archives. This could be optimized and for now we only produce __.SYMDEF and not "__.SYMDEF SORTED". llvm-svn: 241814	2015-07-09 15:56:23 +00:00
Silviu Baranga	3a8eca654a	Don't rely on the DepCands iteration order when constructing checking pointer groups Summary: The checking pointer group construction algorithm relied on the iteration on DepCands. We would need the same leaders across runs and the same iteration order over the underlying std::set for determinism. This changes the algorithm to process the pointers in the order in which they were added to the runtime check, which is deterministic. We need to update the tests, since the order in which pointers appear has changed. No new tests were added, since it is impossible to test for non-determinism. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11064 llvm-svn: 241809	2015-07-09 15:18:25 +00:00
Pawel Bylica	b5caea461d	Reapply fixed r241790: Fix shift legalization and lowering for big constants. Summary: If shift amount is a constant value > 64 bit it is handled incorrectly during type legalization and X86 lowering. This patch the type of shift amount argument in function DAGTypeLegalizer::ExpandShiftByConstant from unsigned to APInt. Reviewers: nadav, majnemer, sanjoy, RKSimon Subscribers: RKSimon, llvm-commits Differential Revision: http://reviews.llvm.org/D10767 llvm-svn: 241806	2015-07-09 14:58:04 +00:00
Krzysztof Parzyszek	cac9b5847a	[Hexagon] Add support for atomic RMW operations llvm-svn: 241804	2015-07-09 14:51:21 +00:00
Arnaud A. de Grandmaison	90c89b61da	[AArch64] Select SBFIZ or UBFIZ instead of left + right shifts And rename LSB to Immr / MSB to Imms to match the ARM ARM terminology. llvm-svn: 241803	2015-07-09 14:33:38 +00:00
Scott Douglass	08f31271aa	[ARM] Thumb1 3 to 2 operand convertion for commutative operations Differential Revision: http://reviews.llvm.org/D11057 llvm-svn: 241802	2015-07-09 14:13:55 +00:00
Scott Douglass	480340a7cb	[ARM] Don't be overzealous converting Thumb1 3 to 2 operands Differential Revision: http://reviews.llvm.org/D11056 llvm-svn: 241801	2015-07-09 14:13:48 +00:00
Scott Douglass	82d04ef2eb	[ARM] Add Thumb2 ADD with PC narrowing from 3 operand to 2 Differential Revision: http://reviews.llvm.org/D11055 llvm-svn: 241800	2015-07-09 14:13:41 +00:00
Scott Douglass	a3566efdd9	[ARM] Refactor converting Thumb1 from 3 to 2 operand (nfc) Also adds some test cases. Differential Revision: http://reviews.llvm.org/D11054 llvm-svn: 241799	2015-07-09 14:13:34 +00:00
Scott Douglass	7b927c9cd5	[ARM] Add ADD tests for Thumb2 narrowing (nfc) Differential Revision: http://reviews.llvm.org/D11053 llvm-svn: 241798	2015-07-09 14:13:22 +00:00
Renato Golin	fc38fe2618	Test for 241794 (nest attribute in AArch64) Forgot to git add the test. Patch by Stephen Cross. llvm-svn: 241797	2015-07-09 13:29:35 +00:00
Pawel Bylica	7aa3d79c2c	Revert r241790: Fix shift legalization and lowering for big constants. llvm-svn: 241792	2015-07-09 09:50:54 +00:00
Pawel Bylica	f94083a5ee	Fix shift legalization and lowering for big constants. Summary: If shift amount is a constant value > 64 bit it is handled incorrectly during type legalization and X86 lowering. This patch the type of shift amount argument in function DAGTypeLegalizer::ExpandShiftByConstant from unsigned to APInt. Reviewers: nadav, majnemer, sanjoy, RKSimon Subscribers: RKSimon, llvm-commits Differential Revision: http://reviews.llvm.org/D10767 llvm-svn: 241790	2015-07-09 08:01:36 +00:00
Elena Demikhovsky	88c04dfc81	Extended syntax of vector version of getelementptr instruction. The justification of this change is here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-March/082989.html According to the current GEP syntax, vector GEP requires that each index must be a vector with the same number of elements. %A = getelementptr i8, <4 x i8> %ptrs, <4 x i64> %offsets In this implementation I let each index be or vector or scalar. All vector indices must have the same number of elements. The scalar value will mean the splat vector value. (1) %A = getelementptr i8, i8 %ptr, <4 x i64> %offsets or (2) %A = getelementptr i8, <4 x i8> %ptrs, i64 %offset In all cases the %A type is <4 x i8> In the case (2) we add the same offset to all pointers. The case (1) covers C[B[i]] case, when we have the same base C and different offsets B[i]. The documentation is updated. http://reviews.llvm.org/D10496 llvm-svn: 241788	2015-07-09 07:42:48 +00:00
Adrian Prantl	e71dc1d0ed	Temporarily reverting 241765, 241768, and 241772 to unbreak the build bots. llvm-svn: 241781	2015-07-09 02:14:49 +00:00
Michael J. Spencer	b5535ebef9	[Object][ELF] Support dumping hash-tables from files with no section table. llvm-svn: 241765	2015-07-09 00:21:06 +00:00
David Majnemer	7357b5b4b1	[CodeView] Add support for emitting column information Column information is present in CodeView when the line table subsection has bit 0 set to 1 in it's flags field. The column information is represented as a pair of 16-bit quantities: a starting and ending column. This information is present at the end of the chunk, after all the line-PC pairs. llvm-svn: 241764	2015-07-09 00:19:51 +00:00
Alex Lorenz	284c4b13e7	MIR Serialization: Serialize the 'undef' register machine operand flag. llvm-svn: 241762	2015-07-08 23:58:31 +00:00
Sanjay Patel	c325a497d8	[SLPVectorizer] Try different vectorization factors for store chains ...and set max vector register size based on target This patch is based on discussion on the llvmdev mailing list: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-July/087405.html and also solves: https://llvm.org/bugs/show_bug.cgi?id=17170 Several FIXME/TODO items are noted in comments as potential improvements. Differential Revision: http://reviews.llvm.org/D10950 llvm-svn: 241760	2015-07-08 23:40:55 +00:00
Adam Nemet	24c6a55664	[LAA] Revert a small part of r239295 This commit ([LAA] Fix estimation of number of memchecks) regressed the logic a bit. We shouldn't quit the analysis if we encounter a pointer without known bounds unless we actually need to emit a memcheck for it. The original code was using NumComparisons which is now computed differently. Instead I compute NeedRTCheck from NumReadPtrChecks and NumWritePtrChecks. As side note, I find the separation of NeedRTCheck and CanDoRT confusing, so I will try to merge them in a follow-up patch. llvm-svn: 241756	2015-07-08 22:58:48 +00:00
Sanjay Patel	806e80e796	[x86] enable machine combiner reassociations for scalar single-precision multiplies llvm-svn: 241752	2015-07-08 22:35:20 +00:00
Rafael Espindola	a73aa6c568	Don't reject an archive with just a symbol table. It is pretty unambiguous how to interpret it and gnu ar accepts it too. llvm-svn: 241750	2015-07-08 22:27:54 +00:00
Eli Bendersky	374b7e43da	Add tests for the NVPTXLowerAggrCopies pass. Note: not testing memmove lowering for now, as it's broken [see https://llvm.org/bugs/show_bug.cgi?id=24056] llvm-svn: 241736	2015-07-08 21:29:28 +00:00

1 2 3 4 5 ...

31061 Commits