llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 23:42:52 +01:00

Author	SHA1	Message	Date
Michael Gottesman	a2ef7dd057	[stackprotector] Forgot to add in PR number to test case. llvm-svn: 191261	2013-09-24 02:10:55 +00:00
Michael Gottesman	2ec63d27a9	[stackprotector] Allow for copies from vreg -> vreg to be in a terminator sequence. Sometimes a copy from a vreg -> vreg sneaks into the middle of a terminator sequence. It is safe to slice this into the stack protector success bb. This fixes PR16979. llvm-svn: 191260	2013-09-24 01:50:26 +00:00
Bill Wendling	339b0f39aa	Selecting the address from a very long chain of GEPs can blow the stack. The recursive nature of the address selection code can cause the stack to explode if there is a long chain of GEPs. Convert the recursive bit into a iterative method to avoid this. <rdar://problem/12445434> llvm-svn: 191252	2013-09-24 00:13:08 +00:00
Reed Kotler	ed09a36fb5	Make nomips16 mask not repeat if it ends with a '.'. This mask is purely for debugging and testing. llvm-svn: 191231	2013-09-23 22:36:11 +00:00
Ben Langmuir	706a7ccbeb	Add sha intrinsic tests These should have been included with r190864, but I forgot to use svn add. llvm-svn: 191208	2013-09-23 16:57:52 +00:00
Daniel Sanders	ced4e4005c	[mips][msa] Added support for matching addvi, and subvi from normal IR (i.e. not intrinsics) llvm-svn: 191203	2013-09-23 14:29:55 +00:00
Daniel Sanders	34cb8f3e4d	[mips][msa] Added support for matching insert and copy from normal IR (i.e. not intrinsics) Changes to MIPS SelectionDAG: * Added nodes VEXTRACT_[SZ]EXT_ELT to represent extract and extend in a single operation and implemented the DAG combines necessary to fold sign/zero extends into the extract. llvm-svn: 191199	2013-09-23 14:03:12 +00:00
Daniel Sanders	d1df1263eb	[mips][msa] Added support for matching pcnt from normal IR (i.e. not intrinsics) llvm-svn: 191198	2013-09-23 13:40:21 +00:00
Daniel Sanders	7d945d142d	[mips][msa] Added support for matching nor from normal IR (i.e. not intrinsics) llvm-svn: 191195	2013-09-23 13:22:24 +00:00
Daniel Sanders	91c78d1d33	[mips][msa] Added support for matching and, or, and xor from normal IR (i.e. not intrinsics) llvm-svn: 191194	2013-09-23 12:57:42 +00:00
Daniel Sanders	d3c403c386	[mips][msa] Implemented build_vector using ldi, fill, and custom SelectionDAG nodes (VSPLAT and VSPLATD) Note: There's a later patch on my branch that re-implements this to select build_vector without the custom SelectionDAG nodes. The future patch avoids the constant-folding problems stemming from the custom node (i.e. it doesn't need to re-implement all the DAG combines related to BUILD_VECTOR). Changes to MIPS specific SelectionDAG nodes: * Added VSPLAT This is a special case of BUILD_VECTOR that covers the case the BUILD_VECTOR is a splat operation. * Added VSPLATD This is a special case of VSPLAT that handles the cases when v2i64 is legal llvm-svn: 191191	2013-09-23 12:02:46 +00:00
Tim Northover	c9a7e47164	ISelDAG: spot chain cycles involving MachineNodes Previously, the DAGISel function WalkChainUsers was spotting that it had entered already-selected territory by whether a node was a MachineNode (amongst other things). Since it's fairly common practice to insert MachineNodes during ISelLowering, this was not the correct check. Looking around, it seems that other nodes get their NodeId set to -1 upon selection, so this makes sure the same thing happens to all MachineNodes and uses that characteristic to determine whether we should stop looking for a loop during selection. This should fix PR15840. llvm-svn: 191165	2013-09-22 08:21:56 +00:00
Venkatraman Govindaraju	ae9ddc5768	[Sparc] Add support for TLS in sparc. llvm-svn: 191164	2013-09-22 06:48:52 +00:00
Venkatraman Govindaraju	df68ba133b	[SPARC] Make functions with GLOBAL_OFFSET_TABLE access as non-leaf functions. llvm-svn: 191160	2013-09-22 01:40:24 +00:00
Venkatraman Govindaraju	e3ed207140	[Sparc] Emit .register directive to declare the use of global registers %g2, %g4, %g6 and %g7. llvm-svn: 191158	2013-09-22 00:42:30 +00:00
Venkatraman Govindaraju	54744c0b41	[Sparc] Fix lowering FABS on fp128 (long double) on pre-v9 targets. llvm-svn: 191154	2013-09-21 23:51:08 +00:00
Juergen Ributzka	b55735e2d8	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." This reverts commit r191130. llvm-svn: 191138	2013-09-21 15:09:46 +00:00
Juergen Ributzka	32cca125e1	[X86] Emulate AVX 256bit MIN/MAX support by splitting the vector. In AVX 256bit vectors are valid vectors and therefore the Type Legalizer doesn't split the VSELECT and SETCC nodes. AVX only supports MIN/MAX on 128bit vectors and this fix enables vector splitting for this special case in the X86 DAG Combiner. This fix is related to PR16695, PR17002, and <rdar://problem/14594431>. llvm-svn: 191131	2013-09-21 04:55:22 +00:00
Juergen Ributzka	67e5289ff2	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask for the given target. This mask has usually te same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. llvm-svn: 191130	2013-09-21 04:55:18 +00:00
NAKAMURA Takumi	81025ea333	Initialize BSSSection explicitly in InitMachOMCObjectFileInfo() to appease msvc. This can revert r191087. llvm-svn: 191128	2013-09-21 02:34:45 +00:00
Reed Kotler	c8c27affb8	Set .reorder for the stub so that gas takes care of delay slot processing. llvm-svn: 191125	2013-09-21 01:37:52 +00:00
NAKAMURA Takumi	69102d22a5	llvm/test: Mark 3 tests as XFAIL:msvc. llvm-svn: 191087	2013-09-20 12:57:34 +00:00
Kai Nacke	64a3fccd60	PR16726: extend rol/ror matching C-like languages promote types like unsigned short to unsigned int before performing an arithmetic operation. Currently the rotate matcher in the DAGCombiner does not consider this situation. This commit extends the DAGCombiner in the way that the pattern (or (shl ([az]ext x), (ext y)), (srl ([az]ext x), (ext (sub 32, y)))) is folded into ([az]ext (rotl x, y)) The matching is restricted to aext and zext because in this cases the upper bits are either undefined or known. Test case is included. This fixes PR16726. llvm-svn: 191049	2013-09-19 23:00:28 +00:00
Kai Nacke	21c0476931	Revert PR16726: extend rol/ror matching There is a buildbot failure. Need to investigate this. llvm-svn: 191048	2013-09-19 22:53:36 +00:00
Kai Nacke	fe02753846	PR16726: extend rol/ror matching C-like languages promote types like unsigned short to unsigned int before performing an arithmetic operation. Currently the rotate matcher in the DAGCombiner does not consider this situation. This commit extends the DAGCombiner in the way that the pattern (or (shl ([az]ext x), (ext y)), (srl ([az]ext x), (ext (sub 32, y)))) is folded into ([az]ext (rotl x, y)) The matching is restricted to aext and zext because in this cases the upper bits are either undefined or known. Test case is included. This fixes PR16726. llvm-svn: 191045	2013-09-19 22:36:39 +00:00
Bill Wendling	01837a249d	Add testcase to make sure we don't generate too many jumps for a une compare. <rdar://problem/7859988> llvm-svn: 191040	2013-09-19 21:58:20 +00:00
Benjamin Kramer	f939b2b330	DAGCombiner: Don't fold vector muls with constants that look like a splat of a power of 2 but differ in bit width. PR17283. llvm-svn: 191000	2013-09-19 13:28:20 +00:00
Justin Holewinski	940e5b09fe	[NVPTX] Make constant vector test case endian-independent llvm-svn: 190998	2013-09-19 13:14:44 +00:00
Justin Holewinski	69db0d8365	[NVPTX] Support constant vector globals llvm-svn: 190997	2013-09-19 12:51:46 +00:00
Amara Emerson	7ad0409c56	[ARMv8] Add support for the v8 cryptography extensions. llvm-svn: 190996	2013-09-19 11:59:01 +00:00
Tim Northover	89d57eb12b	X86: FrameIndex addressing modes do have a base register. When selecting the DAG (add (WrapperRIP ...), (FrameIndex ...)), X86 code had spotted the FrameIndex possibility and was working out whether it could fold the WrapperRIP into this. The test for forming a %rip version is notionally whether we already have a base or index register (%rip precludes both), but we were forgetting to account for the register that would be inserted later to access the frame. rdar://problem/15024520 llvm-svn: 190995	2013-09-19 11:33:53 +00:00
Reed Kotler	03a3269d15	Fix two issues regarding Got pointer (GP) setup. 1) make sure that the first two instructions of the sequence cannot separate from each other. The linker requires that they be sequential. If they get separated, it can still work but it will not work in all cases because the first of the instructions mostly involves the hi part of the pc relative offset and that part changes slowly. You would have to be at the right boundary for this to matter. 2) make sure that this sequence begins on a longword boundary. There appears to be a bug in binutils which makes some of these calculations get messed up if the instruction sequence does not begin on a longword boundary. This is being investigated with the appropriate binutils folks. llvm-svn: 190966	2013-09-18 22:46:09 +00:00
Preston Gurd	efaca57b70	Attempt to fix llvm-ppc64-linux2 buildbot failure by adding -march=x86 to SLM test. llvm-svn: 190958	2013-09-18 21:39:33 +00:00
Preston Gurd	1800994293	Verify that llvm can generate the prefetchw instruction when the CPU is Atom Silvermont. Patch by Sriram Murali. llvm-svn: 190957	2013-09-18 21:08:09 +00:00
Richard Sandiford	76d1801e90	[SystemZ] Add unsigned compare-and-branch instructions For some reason I never got around to adding these at the same time as the signed versions. No idea why. I'm not sure whether this SystemZII::BranchC* stuff is useful, or whether it should just be replaced with an "is normal" flag. I'll leave that for later though. There are some boundary conditions that can be tweaked, such as preferring unsigned comparisons for equality with [128, 256), and "<= 255" over "< 256", but again I'll leave those for a separate patch. llvm-svn: 190930	2013-09-18 09:56:40 +00:00
Craig Topper	5d022196de	Lift alignment restrictions for load/store folding on VINSERTF128/VEXTRACTF128. Fixes PR17268. llvm-svn: 190916	2013-09-18 03:55:53 +00:00
Reid Kleckner	130539949d	COFF: Ensure that objects produced by LLVM link with /safeseh Summary: We indicate that the object files are safe by emitting a @feat.00 absolute address symbol. The address is presumably interpreted as a bitfield of features that the compiler would like to enable. Bit 0 is documented in the PE COFF spec to opt in to "registered SEH", which is what /safeseh enables. LLVM's object files are safe by default because LLVM doesn't know how to produce SEH handlers. Reviewers: Bigcheese CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1691 llvm-svn: 190898	2013-09-17 23:18:05 +00:00
Bill Schmidt	f26512486a	[PowerPC] Fix problems with large code model (PR17169). Large code model on PPC64 requires creating and referencing TOC entries when using the addis/ld form of addressing. This was not being done in all cases. The changes in this patch to PPCAsmPrinter::EmitInstruction() fix this. Two test cases are also modified to reflect this requirement. Fast-isel was not creating correct code for loading floating-point constants using large code model. This also requires the addis/ld form of addressing. Previously we were using the addis/lfd shortcut which is only applicable to medium code model. One test case is modified to reflect this requirement. llvm-svn: 190882	2013-09-17 20:03:25 +00:00
Kevin Qin	3be5824550	Implement 3 AArch64 neon instructions : umov smov ins. llvm-svn: 190839	2013-09-17 02:21:02 +00:00
Quentin Colombet	475709bdc2	[SelectionDAG] Teach the vector scalarizer about TRUNCATE. When a truncate node defines a legal vector type but uses an illegal vector type, the legalization process was splitting the vector until <1 x vector> type, but then it was failing to scalarize the node because it did not know how to handle TRUNCATE. <rdar://problem/14989896> llvm-svn: 190830	2013-09-17 00:26:56 +00:00
Preston Gurd	6efe5100eb	Add Atom Silvermont (slm) tests - check that -mcpu=slm uses the call register indirect optimization - check that -mcpu=slm runs the scheduler - check that -mcpu=slm supports the movbe instruction llvm-svn: 190814	2013-09-16 22:22:07 +00:00
Richard Sandiford	ea2b1a8b94	[SystemZ] Improve extload handling The port originally had special patterns for extload, mapping them to the same instructions as sextload. It seemed neater to have patterns that match "an extension that is allowed to be signed" and "an extension that is allowed to be unsigned". This was originally meant to be a clean-up, but it does improve the handling of promoted integers a little, as shown by args-06.ll. llvm-svn: 190777	2013-09-16 09:03:10 +00:00
Peter Collingbourne	cf3b1a2910	Implement function prefix data as an IR feature. Previous discussion: http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-July/063909.html Differential Revision: http://llvm-reviews.chandlerc.com/D1191 llvm-svn: 190773	2013-09-16 01:08:15 +00:00
Hal Finkel	5bb449bca0	PPC: Don't restrict lvsl generation to after type legalization This is a re-commit of r190764, with an extra check to make sure that we're not performing the transformation on illegal types (a small test case has been added for this as well). Original commit message: The PPC backend uses a target-specific DAG combine to turn unaligned Altivec loads into a permutation-based sequence when possible. Unfortunately, the target-specific DAG combine is not always called on all loads of interest (sometimes the routines in DAGCombine call CombineTo such that the new node and users are not added to the worklist); allowing the combine to trigger early (before type legalization) mitigates this problem. Because the autovectorizers only create legal vector types, I don't expect a lot of cases where this optimization is enabled by type legalization in practice. llvm-svn: 190771	2013-09-15 22:09:58 +00:00
Hal Finkel	c45bfe85cc	Revert r190764: PPC: Don't restrict lvsl generation to after type legalization This is causing test-suite failures. Original commit message: The PPC backend uses a target-specific DAG combine to turn unaligned Altivec loads into a permutation-based sequence when possible. Unfortunately, the target-specific DAG combine is not always called on all loads of interest (sometimes the routines in DAGCombine call CombineTo such that the new node and users are not added to the worklist); allowing the combine to trigger early (before type legalization) mitigates this problem. Because the autovectorizers only create legal vector types, I don't expect a lot of cases where this optimization is enabled by type legalization in practice. llvm-svn: 190765	2013-09-15 15:41:11 +00:00
Hal Finkel	ae7feec56e	PPC: Don't restrict lvsl generation to after type legalization The PPC backend uses a target-specific DAG combine to turn unaligned Altivec loads into a permutation-based sequence when possible. Unfortunately, the target-specific DAG combine is not always called on all loads of interest (sometimes the routines in DAGCombine call CombineTo such that the new node and users are not added to the worklist); allowing the combine to trigger early (before type legalization) mitigates this problem. Because the autovectorizers only create legal vector types, I don't expect a lot of cases where this optimization is enabled by type legalization in practice. llvm-svn: 190764	2013-09-15 15:20:54 +00:00
Hal Finkel	fc7b3598ec	Prevent assert in CombinerGlobalAA with null values DAGCombiner::isAlias can be called with SrcValue1 or SrcValue2 null, and we can't use AA in this case (if we try, then the casting code in AA will assert). llvm-svn: 190763	2013-09-15 02:19:49 +00:00
Reed Kotler	0d8133f6fe	Expand the mask capability for deciding which functions are mips16 and mips32 so it can be better used for general interoperability testing between mips32 and mips16. llvm-svn: 190762	2013-09-15 02:09:08 +00:00
Joey Gouly	0af412fe63	[ARMv8] Change hasV8Fp to hasFPARMv8, and other command line options to be more consistent. llvm-svn: 190692	2013-09-13 13:46:57 +00:00
Joey Gouly	2b0127dd73	[ARMv8] Emit the proper .fpu directive. Patch by Bradley Smith! llvm-svn: 190683	2013-09-13 11:51:52 +00:00
Richard Sandiford	30374b51cb	[SystemZ] Try to fold shifts into TMxx E.g. "SRL %r2, 2; TMLL %r2, 1" => "TMLL %r2, 4". llvm-svn: 190672	2013-09-13 09:09:50 +00:00
Vincent Lejeune	439c29a29d	R600: Move code handling literal folding into R600ISelLowering. llvm-svn: 190644	2013-09-12 23:44:53 +00:00
Vincent Lejeune	82c06999cd	R600: Move fabs/fneg/sel folding logic into PostProcessIsel This move makes possible to correctly handle multiples instructions from a single pattern. llvm-svn: 190643	2013-09-12 23:44:44 +00:00
Hal Finkel	605f51b771	Remove unnecessary TBAA metadata from r190636's test case llvm-svn: 190637	2013-09-12 23:23:12 +00:00
Hal Finkel	4b3cfb4727	Fix PPC ABI for ByVal structs with vector members When a structure is passed by value, and that structure contains a vector member, according to the PPC ABI, the structure will receive enhanced alignment (so that the vector within the structure will always be aligned). This should resolve PR16641. llvm-svn: 190636	2013-09-12 23:20:06 +00:00
Hal Finkel	47bfa9a072	Make the PPC fast-math sqrt expansion safe at 0 In fast-math mode sqrt(x) is calculated using the fast expansion of the reciprocal of the reciprocal sqrt expansion. The reciprocal and reciprocal sqrt expansions use the associated estimate instructions along with some Newton iterations. Unfortunately, as a result, sqrt(0) was being calculated as NaN, which is not correct. Now we explicitly return a result of zero if the input is zero. llvm-svn: 190624	2013-09-12 19:04:12 +00:00
Elena Demikhovsky	139f25ed2c	AVX-512: implemented extractelement with variable index. Added parsing of mask register and "zeroing" semantic, like {%k1} {z}. llvm-svn: 190595	2013-09-12 08:55:00 +00:00
Hal Finkel	6164109851	PPC: Enable aggressive anti-dependency breaking Aggressive anti-dependency breaking is enabled by default for all PPC cores. This provides a general speedup on the P7 and other platforms (among other factors, the instruction group formation for the non-embedded PPC cores is done during post-RA scheduling). In order to do this safely, the incompatibility between uses of the MFOCRF instruction and anti-dependency breaking are resolved by marking MFOCRF with hasExtraSrcRegAllocReq. As noted in the removed FIXME, the problem was that MFOCRF's output is sensitive to the identify of the source register, and always paired with a shift to undo this effect. Because anti-dependency breaking is unaware of this hidden dependency of the shift amount on the source register of the MFOCRF instruction, changing that register must be inhibited. Two test cases were adjusted: The SjLj test was made more insensitive to register choices and scheduling; the saveCR test disabled anti-dependency breaking because part of what it is testing is proper register reuse. llvm-svn: 190587	2013-09-12 05:24:49 +00:00
Tom Stellard	6a507da088	R600/SI: expose TBUFFER_STORE_FORMAT_* for OpenGL transform feedback For _XYZ, the type of VDATA is v4i32, because v3i32 doesn't exist. The ADDR64 bit is not exposed. A simpler intrinsic that doesn't take a resource descriptor might be nicer. The maximum number of input SGPRs is bumped to 17. Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 190575	2013-09-12 02:55:14 +00:00
Bill Wendling	911391c864	Try to fix the atom buildbots by adding an explicit 'cpu' to the 'llc' command. llvm-svn: 190541	2013-09-11 19:06:04 +00:00
Daniel Sanders	16a6e0ac3d	[mips][msa] Added test cases that were supposed to be part of r190507, r190509, r190512, and r190518. llvm-svn: 190522	2013-09-11 12:39:25 +00:00
Daniel Sanders	e3f2e5de18	[mips][msa] Added support for matching mulv, nlzc, sll, sra, srl, and subv from normal IR (i.e. not intrinsics) llvm-svn: 190518	2013-09-11 11:58:30 +00:00
Daniel Sanders	96466ff8b4	[mips][msa] Added support for matching fadd, fdiv, flog2, fmul, frint, fsqrt, and fsub from normal IR (i.e. not intrinsics) llvm-svn: 190512	2013-09-11 10:51:30 +00:00
Daniel Sanders	a52c7f09dc	[mips][msa] Added support for matching div_[su] from normal IR (i.e. not intrinsics) llvm-svn: 190509	2013-09-11 10:38:58 +00:00
Daniel Sanders	f68b00e629	[mips][msa] Added support for matching addv from normal IR (i.e. not intrinsics) The corresponding intrinsic is now lowered into equivalent IR (ISD::ADD) before instruction selection. llvm-svn: 190507	2013-09-11 10:28:16 +00:00
Daniel Sanders	534d28aa11	[mips][msa] Corrected the definition of the dotp_[su].[hwd] intrinsics The elements of the operands should be half the width of the elements of the result. llvm-svn: 190505	2013-09-11 09:59:17 +00:00
Richard Sandiford	bfcf129b8e	[SystemZ] Add TM and TMY The main complication here is that TM and TMY (the memory forms) set CC differently from the register forms. When the tested bits contain some 0s and some 1s, the register forms set CC to 1 or 2 based on the value the uppermost bit. The memory forms instead set CC to 1 regardless of the uppermost bit. Until now, I've tried to make it so that a branch never tests for an impossible CC value. E.g. NR only sets CC to 0 or 1, so branches on the result will only test for 0 or 1. Originally I'd tried to do the same thing for TM and TMY by using custom matching code in ISelDAGToDAG. That ended up being very ugly though, and would have meant duplicating some of the chain checks that the common isel code does. I've therefore gone for the simpler alternative of adding an extra operand to the TM DAG opcode to say whether a memory form would be OK. This means that the inverse of a "TM;JE" is "TM;JNE" rather than the more precise "TM;JNLE", just like the inverse of "TMLL;JE" is "TMLL;JNE". I suppose that's arguably less confusing though... llvm-svn: 190400	2013-09-10 10:20:32 +00:00
Daniel Sanders	32227b7995	[mips][msa] Removed unsupported dot product instructions (dotp_[su].b) The dotp_[su].b instructions never existed in any revision of the MSA spec. llvm-svn: 190398	2013-09-10 09:51:43 +00:00
Bill Wendling	5e97475233	Another attempt to fix windows buildbots. llvm-svn: 190350	2013-09-09 20:29:32 +00:00
Bill Wendling	5dadcab742	Attempt to fix buildbots by giving an explicit output to the llvm-mc command. llvm-svn: 190349	2013-09-09 20:22:38 +00:00
Bill Wendling	10cf877e75	Expand test to make sure that we can generate compact unwind from an ASM file. llvm-svn: 190348	2013-09-09 20:12:36 +00:00
Bill Wendling	a2bc7420c8	Expand test to make sure that we can generate compact unwind from an ASM file. llvm-svn: 190347	2013-09-09 20:10:54 +00:00
Joey Gouly	03af45ccfe	[ARMv8] Prevent generation of deprecated IT blocks on ARMv8 in Thumb mode. IT blocks can only be one instruction lonf, and can only contain a subset of the 16 instructions. Patch by Artyom Skrobov! llvm-svn: 190309	2013-09-09 14:21:49 +00:00
Robert Lytton	b73a61715b	XCore handling of thread local lowering Fix XCoreLowerThreadLocal trying to initialise globals which have no initializer. Add handling of const expressions containing thread local variables. These need to be replaced with instructions, as the thread ID is used to access the thread local variable. llvm-svn: 190300	2013-09-09 10:42:11 +00:00
Robert Lytton	dc8d32008e	XCore target: change to Sched::Source This sidesteps a bug in PrescheduleNodesWithMultipleUses() which does not check if callResources will be affected by the transformation. llvm-svn: 190299	2013-09-09 10:42:05 +00:00
Robert Lytton	4a5772968b	XCore target: fix weak linkage attribute handling llvm-svn: 190298	2013-09-09 10:41:57 +00:00
Bill Wendling	2c532e9c9b	Generate compact unwind encoding from CFI directives. We used to generate the compact unwind encoding from the machine instructions. However, this had the problem that if the user used `-save-temps' or compiled their hand-written `.s' file (with CFI directives), we wouldn't generate the compact unwind encoding. Move the algorithm that generates the compact unwind encoding into the MCAsmBackend. This way we can generate the encoding whether the code is from a `.ll' or `.s' file. <rdar://problem/13623355> llvm-svn: 190290	2013-09-09 02:37:14 +00:00
Jiangning Liu	b2cc9767e4	Implement aarch64 neon instruction set AdvSIMD (3V Diff), covering the following 26 instructions, SADDL, UADDL, SADDW, UADDW, SSUBL, USUBL, SSUBW, USUBW, ADDHN, RADDHN, SABAL, UABAL, SUBHN, RSUBHN, SABDL, UABDL, SMLAL, UMLAL, SMLSL, UMLSL, SQDMLAL, SQDMLSL, SMULL, UMULL, SQDMULL, PMULL llvm-svn: 190288	2013-09-09 02:20:27 +00:00
Manman Ren	edc0da266c	Debug Info Testing: use null instead of an empty string in context field. llvm-svn: 190284	2013-09-09 00:12:17 +00:00
Manman Ren	fa420c3e35	Debug Info Testing: update context from empty string to null. Context should be either null or MDNode. llvm-svn: 190267	2013-09-08 03:11:54 +00:00
Akira Hatanaka	3fb22c57eb	[mips] Fix typos. llvm-svn: 190236	2013-09-07 01:14:42 +00:00
Akira Hatanaka	3eef445630	[mips] Enhance command line option "-mno-ldc1-sdc1" to expand base+index double precision loads and stores as well as reg+imm double precision loads and stores. Previously, expansion of loads and stores was done after register allocation, but now it takes place during legalization. As a result, users will see double precision stores and loads being emitted to spill and restore 64-bit FP registers. llvm-svn: 190235	2013-09-07 00:52:30 +00:00
Akira Hatanaka	b84769b3d7	[mips] Set instruction itineraries of loads, stores and conditional moves. llvm-svn: 190219	2013-09-06 23:28:24 +00:00
Manman Ren	450526b5a9	Debug Info Testing: updated to use NULL instead of "i32 0" in a few fields. Field 2 of DIType (Context), field 9 of DIDerivedType (TypeDerivedFrom), field 12 of DICompositeType (ContainingType), fields 2, 7, 12 of DISubprogram (Context, Type, ContainingType). llvm-svn: 190205	2013-09-06 21:03:58 +00:00
Aaron Watry	e4512c5eff	R600: Add support for LDS atomic subtract Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 190200	2013-09-06 20:17:42 +00:00
Manman Ren	3a3457bef0	Debug Info Testing: Updated to use null instead of "i32 0" for containing-type field of DICompositeType. This will help the follow-on patch of using DITypeRef for containing-type field. llvm-svn: 190187	2013-09-06 18:13:59 +00:00
Tim Northover	5e921518f7	SelectionDAG: create correct BooleanContent constants Occasionally DAGCombiner can spot that a SETCC operation is completely redundant and reduce it to "all true" or "all false". If this happens to a vector, the value produced has to take account of what a normal comparison would have produced, which may be an all-1s bitmask. The fix in SelectionDAG.cpp is tested, however, as far as I can see the code in TargetLowering.cpp is possibly unreachable and almost certainly irrelevant when triggered so there are no tests. However, I believe it's still clearly the right change and may save someone else some hassle if it suddenly becomes reachable. So I'm doing it anyway. llvm-svn: 190147	2013-09-06 12:38:12 +00:00
Richard Sandiford	8d6edc5218	[SystemZ] Tweak integer comparison code The architecture has many comparison instructions, including some that extend one of the operands. The signed comparison instructions use sign extensions and the unsigned comparison instructions use zero extensions. In cases where we had a free choice between signed or unsigned comparisons, we were trying to decide at lowering time which would best fit the available instructions, taking things like extension type into account. The code to do that was getting increasingly hairy and was also making some bad decisions. E.g. when comparing the result of two LLCs, it is better to use CR rather than CLR, since CR can be fused with a branch while CLR can't. This patch removes the lowering code and instead adds an operand to integer comparisons to say whether signed comparison is required, whether unsigned comparison is required, or whether either is OK. We can then leave the choice of instruction up to the normal isel code. llvm-svn: 190138	2013-09-06 11:51:39 +00:00
Richard Sandiford	ea5b4917b9	[SystemZ] Use XC for a memset of 0 llvm-svn: 190130	2013-09-06 10:25:07 +00:00
Matt Arsenault	f658af2617	Teach CodeGenPrepare about address spaces llvm-svn: 190112	2013-09-06 00:18:43 +00:00
Juergen Ributzka	4554c8ed29	[X86] Perform VSELECT DAG combines also before DAG type legalization. If the DAG already has only legal types, then the second round of DAG combines is skipped. In this case VSELECT+SETCC patterns that match a more efficient instruction (e.g. min/max) are never recognized. This fix allows VSELECT+SETCC combines if the types are already legal before DAG type legalization. Reviewer: Nadav llvm-svn: 190105	2013-09-05 23:02:56 +00:00
Matt Arsenault	071be273be	R600: Fix i64 to i32 trunc on SI llvm-svn: 190091	2013-09-05 19:41:10 +00:00
Tom Stellard	ce0432a0c3	R600: Add support for local memory atomic add llvm-svn: 190080	2013-09-05 18:38:09 +00:00
Tom Stellard	6c1db18560	R600: Expand SELECT nodes rather than custom lowering them llvm-svn: 190079	2013-09-05 18:38:03 +00:00
Tom Stellard	8f7c5a681a	R600: Fix incorrect LDS size calculation GlobalAdderss nodes that appeared in more than one basic block were being counted twice. llvm-svn: 190078	2013-09-05 18:37:57 +00:00
Tom Stellard	d2fff2dd99	R600/SI: Don't emit S_WQM_B64 instruction for compute shaders llvm-svn: 190077	2013-09-05 18:37:52 +00:00
Joey Gouly	071ca2ff6d	[ARMv8] Implement the new DMB/DSB operands. This removes the custom ISD Node: MEMBARRIER and replaces it with an intrinsic. llvm-svn: 190055	2013-09-05 15:35:24 +00:00
Tilmann Scheller	31cc184566	Reverting 190043 for now. Solution is not sufficient to prevent 'mov pc, lr' being emitted for jump table code. Test case doesn't trigger the added functionality. llvm-svn: 190047	2013-09-05 11:59:43 +00:00
Tilmann Scheller	14c2ce0a1e	ARM: Add GPR register class excluding LR for use with the ADR instruction. This improves code generation for jump tables by avoiding the emission of "mov pc, lr" which could fool the processor into believing this is a return from a function causing mispredicts. The code generation logic for jump tables uses ADR to materialize the address of the jump target. Patch by Daniel Stewart! llvm-svn: 190043	2013-09-05 11:10:31 +00:00
Richard Sandiford	399318ba38	[SystemZ] Add NC, OC and XC For now these are just used to handle scalar ANDs, ORs and XORs in which all operands are memory. llvm-svn: 190041	2013-09-05 10:36:45 +00:00

1 2 3 4 5 ...

8246 Commits