llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Owen Anderson	b1d0514bb6	Make RegisterOperand a subclass of DAGOperand so that RegisterOperands can be passed into multiclasses that take DAGOperands as multiclass parameters. llvm-svn: 160540	2012-07-20 03:38:19 +00:00
Bill Wendling	cdfcdb6e92	Remove tabs. llvm-svn: 160473	2012-07-19 00:01:33 +00:00
Jim Grosbach	8a78dc2600	TableGen: Allow conditional instruction pattern in multiclass. Define a 'null_frag' SDPatternOperator node, which if referenced in an instruction Pattern, results in the pattern being collapsed to be as-if '[]' had been specified instead. This allows supporting a multiclass definition where some instaniations have ISel patterns associated and others do not. For example, multiclass myMulti<RegisterClass rc, SDPatternOperator OpNode = null_frag> { def _x : myI<(outs rc:), (ins rc:), []>; def _r : myI<(outs rc:), (ins rc:), [(set rc:, (OpNode rc:))]>; } defm foo : myMulti<GRa, not>; defm bar : myMulti<GRb>; llvm-svn: 160333	2012-07-17 00:47:06 +00:00
Andrew Trick	b9c8074dcd	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Andrew Trick	39f96ecbe2	whitespace llvm-svn: 159890	2012-07-07 03:59:51 +00:00
Jakob Stoklund Olesen	db187a51eb	Add an experimental early if-conversion pass, off by default. This pass performs if-conversion on SSA form machine code by speculatively executing both sides of the branch and using a cmov instruction to select the result. This can help lower the number of branch mispredictions on architectures like x86 that don't have predicable instructions. The current implementation is very aggressive, and causes regressions on mosts tests. It needs good heuristics that have yet to be implemented. llvm-svn: 159694	2012-07-04 00:09:54 +00:00
Evan Cheng	6196c5f5f3	Target option DisableJumpTables is a gross hack. Move it to TargetLowering instead. llvm-svn: 159611	2012-07-02 22:39:56 +00:00
Bob Wilson	a848f156de	Extend TargetPassConfig to allow running only a subset of the normal passes. This is still a work in progress but I believe it is currently good enough to fix PR13122 "Need unit test driver for codegen IR passes". For example, you can run llc with -stop-after=loop-reduce to have it dump out the IR after running LSR. Serializing machine-level IR is not yet supported but we have some patches in progress for that. The plan is to serialize the IR to a YAML file, containing separate sections for the LLVM IR, machine-level IR, and whatever other info is needed. Chad suggested that we stash the stop-after pass in the YAML file and use that instead of the start-after option to figure out where to restart the compilation. I think that's a great idea, but since it's not implemented yet I put the -start-after option into this patch for testing purposes. llvm-svn: 159570	2012-07-02 19:48:45 +00:00
Andrew Trick	baf8a62800	Reapply "Make NumMicroOps a variable in the subtarget's instruction itinerary." Reapplies r159406 with minor cleanup. The regressions appear to have been spurious. llvm-svn: 159541	2012-07-02 18:10:42 +00:00
Manman Ren	125c1ee4e9	Add SrcReg2 to analyzeCompare and optimizeCompareInstr to handle Compare instructions with two register operands. llvm-svn: 159465	2012-06-29 21:33:59 +00:00
Andrew Trick	251f64f946	Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." This reverts commit r159406. I noticed a performance regression so I'll back out for now. llvm-svn: 159411	2012-06-29 07:10:41 +00:00
Andrew Trick	52238a0ce5	Make NumMicroOps a variable in the subtarget's instruction itinerary. The TargetInstrInfo::getNumMicroOps API does not change, but soon it will be used by MachineScheduler. Now each subtarget can specify the number of micro-ops per itinerary class. For ARM, this is currently always dynamic (-1), because it is used for load/store multiple which depends on the number of register operands. Zero is now a valid number of micro-ops. This can be used for nop pseudo-instructions or instructions that the hardware can squash during dispatch. llvm-svn: 159406	2012-06-29 03:23:18 +00:00
Owen Anderson	9dce23492c	Define DAGOperand, an empty base class for RegisterClass and Operand. This allows one to write multiclasses that are polymorphic over both registers and non-register operands. llvm-svn: 159162	2012-06-25 21:25:16 +00:00
Jim Grosbach	9bfc8e18b0	TableGen: AsmMatcher support for better operand diagnostics. "Invalid operand" may be a completely correct diagnostic, but it's often insufficiently specific to really help identify and fix the problem in assembly source. Allow a target to specify a more-specific diagnostic kind for each AsmOperandClass derived definition and use that to provide more detailed diagnostics when an operant of that class resulted in a match failure. rdar://8987109 llvm-svn: 159050	2012-06-22 23:56:44 +00:00
Andrew Trick	279bd30bbc	Use "NoItineraries" for processors with no itineraries. This makes it explicit when ScoreboardHazardRecognizer will be used. "GenericItineraries" would only make sense if it contained real itinerary values and still required ScoreboardHazardRecognizer. llvm-svn: 158963	2012-06-22 03:58:51 +00:00
Lang Hames	68cf87e3ef	Rename -allow-excess-fp-precision flag to -fuse-fp-ops, and switch from a boolean flag to an enum: { Fast, Standard, Strict } (default = Standard). This option controls the creation by optimizations of fused FP ops that store intermediate results in higher precision than IEEE allows (E.g. FMAs). The behavior of this option is intended to match the behaviour specified by a soon-to-be-introduced frontend flag: '-ffuse-fp-ops'. Fast mode - allows formation of fused FP ops whenever they're profitable. Standard mode - allow fusion only for 'blessed' FP ops. At present the only blessed op is the fmuladd intrinsic. In the future more blessed ops may be added. Strict mode - allow fusion only if/when it can be proven that the excess precision won't effect the result. Note: This option only controls formation of fused ops by the optimizers. Fused operations that are explicitly requested (e.g. FMA via the llvm.fma.* intrinsic) will always be honored, regardless of the value of this option. Internally TargetOptions::AllowExcessFPPrecision has been replaced by TargetOptions::AllowFPOpFusion. llvm-svn: 158956	2012-06-22 01:09:09 +00:00
Lang Hames	f0b9601a6d	Add DAG-combines for aggressive FMA formation. This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or FSUB + FMUL. The combines are performed when: (a) Either AllowExcessFPPrecision option (-enable-excess-fp-precision for llc) OR UnsafeFPMath option (-enable-unsafe-fp-math) are set, and (b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of the FADD/FSUB, and (c) The FMUL only has one user (the FADD/FSUB). If your target has fast FMA instructions you can make use of these combines by overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for types supported by your FMA instruction, and adding patterns to match ISD::FMA to your FMA instructions. llvm-svn: 158757	2012-06-19 22:51:23 +00:00
Rafael Espindola	38c45a939d	Move the support for using .init_array from ARM to the generic TargetLoweringObjectFileELF. Use this to support it on X86. Unlike ARM, on X86 it is not easy to find out if .init_array should be used or not, so the decision is made via TargetOptions and defaults to off. Add a command line option to llc that enables it. llvm-svn: 158692	2012-06-19 00:48:28 +00:00
Kay Tiong Khoo	b631f7fd59	*typo: Cyles changed to Cycles llvm-svn: 158404	2012-06-13 15:53:04 +00:00
Dmitri Gribenko	6319fd5eb8	Convert comments to proper Doxygen comments. llvm-svn: 158248	2012-06-09 00:01:45 +00:00
Andrew Trick	a1df722f41	Removing strange "using" declarations form TargetInstrInfo. I can't imagine why these were added. Trial and error. llvm-svn: 158247	2012-06-08 23:56:26 +00:00
Andrew Trick	151209d9dc	TargetInstrInfo hooks implemented in codegen should be declared pure virtual. llvm-svn: 158233	2012-06-08 21:52:38 +00:00
Andrew Trick	4fe40f02fd	Continue factoring computeOperandLatency. Use it for ARM hasHighOperandLatency. llvm-svn: 158164	2012-06-07 19:42:04 +00:00
Manman Ren	f591de61da	Revert r157755. The commit is intended to fix rdar://11540023. It is implemented as part of peephole optimization. We can actually implement this in the SelectionDAG lowering phase. llvm-svn: 158122	2012-06-06 23:53:03 +00:00
Andrew Trick	24cce40009	misched: API for minimum vs. expected latency. Minimum latency determines per-cycle scheduling groups. Expected latency determines critical path and cost. llvm-svn: 158021	2012-06-05 21:11:27 +00:00
Lang Hames	30d8fb4deb	Add a new intrinsic: llvm.fmuladd. This intrinsic represents a multiply-add expression (a * b + c) that can be implemented as a fused multiply-add (fma) if the target determines that this will be more efficient. This intrinsic will be used to implement FP_CONTRACT support and an aggressive FMA formation mode. If your target has a fast FMA instruction you should override the isFMAFasterThanMulAndAdd method in TargetLowering to return true. llvm-svn: 158014	2012-06-05 19:07:46 +00:00
Andrew Trick	e7159e6731	misched: Added MultiIssueItineraries. This allows a subtarget to explicitly specify the issue width and other properties without providing pipeline stage details for every instruction. llvm-svn: 157979	2012-06-05 03:44:40 +00:00
Andrew Trick	8b333df134	whitespace llvm-svn: 157976	2012-06-05 03:44:29 +00:00
Nadav Rotem	6969a673e9	Remove the "-promote-elements" flag. This flag is now enabled by default. llvm-svn: 157925	2012-06-04 11:27:21 +00:00
Manman Ren	82e2c9debf	X86: replace SUB with CMP if possible This patch will optimize the following movq %rdi, %rax subq %rsi, %rax cmovsq %rsi, %rdi movq %rdi, %rax to cmpq %rsi, %rdi cmovsq %rsi, %rdi movq %rdi, %rax Perform this optimization if the actual result of SUB is not used. rdar: 11540023 llvm-svn: 157755	2012-05-31 17:20:29 +00:00
Jakob Stoklund Olesen	dc9a46b651	Add a PrintRegUnit helper similar to PrintReg. Reg-units are named after their root registers, and most units have a single root, so they simply print as 'AL', 'XMM0', etc. The rare dual root reg-units print as FPSCR~FPSCR_NZCV, FP0~ST7, ... The printing piggybacks on the existing register name tables, so no extra const data space is required. llvm-svn: 157754	2012-05-31 17:18:29 +00:00
Jakob Stoklund Olesen	67b7a47013	Add MCRegisterInfo::RegListIterator. Also add subclasses MCSubRegIterator, MCSuperRegIterator, and MCRegAliasIterator. These iterators provide an abstract interface to the MCRegisterInfo register lists so the internal representation can be changed without changing all clients. llvm-svn: 157695	2012-05-30 16:36:28 +00:00
Jakob Stoklund Olesen	c83fa0a0e4	Use MCRegUnitIterator to compute regsOverlap(). The register unit lists are typically much shorter than the register overlap lists, and the backing table for register units has better cache locality because it is smaller. This makes llc about 0.5% faster. The regsOverlap() function isn't that hot. llvm-svn: 157651	2012-05-29 23:40:02 +00:00
Justin Holewinski	77c4679dae	Change interface for TargetLowering::LowerCallTo and TargetLowering::LowerCall to pass around a struct instead of a large set of individual values. This cleans up the interface and allows more information to be added to the struct for future targets without requiring changes to each and every target. NV_CONTRIB llvm-svn: 157479	2012-05-25 16:35:28 +00:00
Eli Friedman	d89582030a	Simplify code for calling a function where CanLowerReturn fails, fixing a small bug in the process. llvm-svn: 157446	2012-05-25 00:09:29 +00:00
Andrew Trick	3152745a8f	misched: Added ScoreboardHazardRecognizer. The Hazard checker implements in-order contraints, or interlocked resources. Ready instructions with hazards do not enter the available queue and are not visible to other heuristics. The major code change is the addition of SchedBoundary to encapsulate the state at the top or bottom of the schedule, including both a pending and available queue. The scheduler now counts cycles in sync with the hazard checker. These are minimum cycle counts based on known hazards. Targets with no itinerary (x86_64) currently remain at cycle 0. To fix this, we need to provide some maximum issue width for all targets. We also need to add the concept of expected latency vs. minimum latency. llvm-svn: 157427	2012-05-24 22:11:09 +00:00
Owen Anderson	03e1ab9c5b	Teach tblgen's set theory "sequence" operator to support an optional stride operand. llvm-svn: 157416	2012-05-24 21:37:08 +00:00
Evandro Menezes	740dea501d	[Hexagon] Clean up Hexagon ELF definition. llvm-svn: 156996	2012-05-17 16:46:46 +00:00
Jim Grosbach	8f241263d7	TableGen'erate mapping physical registers to encoding values. Many targets always use the same bitwise encoding value for physical registers in all (or most) instructions. Add this mapping to the .td files and TableGen'erate the information and expose an accessor in MCRegisterInfo. patch by Tom Stellard. llvm-svn: 156829	2012-05-15 17:35:57 +00:00
Dan Gohman	cc1f60a86c	Rename @llvm.debugger to @llvm.debugtrap. llvm-svn: 156774	2012-05-14 18:58:10 +00:00
Dan Gohman	ed475ad173	Define a new intrinsic, @llvm.debugger. It will be similar to __builtin_trap(), but it generates int3 on x86 instead of ud2. llvm-svn: 156593	2012-05-11 00:19:32 +00:00
Jakob Stoklund Olesen	cc0cf22b98	Add an MF argument to TRI::getPointerRegClass() and TII::getRegClass(). The getPointerRegClass() hook can return register classes that depend on the calling convention of the current function (ptr_rc_tailcall). So far, we have been able to infer the calling convention from the subtarget alone, but as we add support for multiple calling conventions per target, that no longer works. Patch by Yiannis Tsiouris! llvm-svn: 156328	2012-05-07 22:10:26 +00:00
Jakob Stoklund Olesen	d92280013d	Add TRI::getCommonSuperRegClass(). This function is a generalization of getMatchingSuperRegClass() to the symmetric case where both sides are using a sub-register index. It will find a super-register class and sub-register indexes that make this diagram commute: PreA SuperRC ----------> RCA \| \| \| \| PreB \| \| SubA \| \| \| \| V V RCB ----------> SubRC SubB This can be used to coalesce copies like: %vreg1:sub16 = COPY %vreg2:sub16; GR64:%vreg1, GR32: %vreg2 llvm-svn: 156317	2012-05-07 19:14:58 +00:00
Benjamin Kramer	7a9528b540	Add a new target hook "predictableSelectIsExpensive". This will be used to determine whether it's profitable to turn a select into a branch when the branch is likely to be predicted. Currently enabled for everything but Atom on X86 and Cortex-A9 devices on ARM. I'm not entirely happy with the name of this flag, suggestions welcome ;) llvm-svn: 156233	2012-05-05 12:49:14 +00:00
Jakob Stoklund Olesen	7bdae32bfd	Remove the SubRegClasses field from RegisterClass descriptions. This information in now computed by TableGen. llvm-svn: 156152	2012-05-04 03:30:34 +00:00
Jakob Stoklund Olesen	2c4618568d	Remove TargetRegisterClass::SuperRegClasses. This manually enumerated list of super-register classes has been superceeded by the automatically computed super-register class masks available through SuperRegClassIterator. llvm-svn: 156151	2012-05-04 03:30:28 +00:00
Jakob Stoklund Olesen	8fbea83a95	Use SuperRegClassIterator for findRepresentativeClass(). The masks returned by SuperRegClassIterator are computed automatically by TableGen. This is better than depending on the manually specified SuperRegClasses. llvm-svn: 156147	2012-05-04 02:19:22 +00:00
Jakob Stoklund Olesen	8696a6109b	Add a SuperRegClassIterator class. This iterator class provides a more abstract interface to the (Idx, Mask) lists of super-registers for a register class. The layout of the tables shouldn't be exposed to clients. llvm-svn: 156144	2012-05-04 01:48:29 +00:00
Jakob Stoklund Olesen	d75f11c7fc	Use a shared implementation of getMatchingSuperRegClass(). TargetRegisterClass now gives access to the necessary tables. llvm-svn: 156122	2012-05-03 22:49:04 +00:00
Jakob Stoklund Olesen	0b38e611c1	Add TargetRegisterClass::getSuperRegIndices(). This is a pointer into one of the tables used by getMatchingSuperRegClass(). It makes it possible to use a shared implementation of that function. llvm-svn: 156121	2012-05-03 22:49:00 +00:00
Jakob Stoklund Olesen	e5602da235	Fix the type of SubClassMask. llvm-svn: 156084	2012-05-03 18:17:32 +00:00
Jakob Stoklund Olesen	9fc9ae5ef4	Don't override subreg functions in targets without subregisters. Some targets have no sub-registers at all. Use the TargetRegisterInfo versions of composeSubRegIndices(), getSubClassWithSubReg(), and getMatchingSuperRegClass() for those targets. llvm-svn: 156075	2012-05-03 16:26:20 +00:00
Andrew Trick	4d16c1f958	Added TargetRegisterInfo::getAllocatableClass. The ensures that virtual registers always belong to an allocatable class. If your target attempts to create a vreg for an operand that has no allocatable register subclass, you will crash quickly. This ensures that targets define register classes as intended. llvm-svn: 156046	2012-05-03 01:14:37 +00:00
Jim Grosbach	7ac2ac85a8	ARM: improved assembler diagnostics for missing CPU features. When an instruction match is found, but the subtarget features it requires are not available (missing floating point unit, or thumb vs arm mode, for example), issue a diagnostic that identifies what the feature mismatch is. rdar://11257547 llvm-svn: 155499	2012-04-24 22:40:08 +00:00
Preston Gurd	0a730de3c3	This patch fixes a problem which arose when using the Post-RA scheduler on X86 Atom. Some of our tests failed because the tail merging part of the BranchFolding pass was creating new basic blocks which did not contain live-in information. When the anti-dependency code in the Post-RA scheduler ran, it would sometimes rename the register containing the function return value because the fact that the return value was live-in to the subsequent block had been lost. To fix this, it is necessary to run the RegisterScavenging code in the BranchFolding pass. This patch makes sure that the register scavenging code is invoked in the X86 subtarget only when post-RA scheduling is being done. Post RA scheduling in the X86 subtarget is only done for Atom. This patch adds a new function to the TargetRegisterClass to control whether or not live-ins should be preserved during branch folding. This is necessary in order for the anti-dependency optimizations done during the PostRASchedulerList pass to work properly when doing Post-RA scheduling for the X86 in general and for the Intel Atom in particular. The patch adds and invokes the new function trackLivenessAfterRegAlloc() instead of using the existing requiresRegisterScavenging(). It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of requiresRegisterScavenging(). It changes the all the targets that implemented requiresRegisterScavenging() to also implement trackLivenessAfterRegAlloc(). It adds an assertion in the Post RA scheduler to make sure that post RA liveness information is available when it is needed. It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order to avoid running into the added assertion. Finally, this patch restores the use of anti-dependency checking (which was turned off temporarily for the 3.1 release) for Intel Atom in the Post RA scheduler. Patch by Andy Zhang! Thanks to Jakob and Anton for their reviews. llvm-svn: 155395	2012-04-23 21:39:35 +00:00
Andrew Trick	c13299f6c4	Added TargetRegisterInfo::getRegPressureSetName. llvm-svn: 155235	2012-04-20 20:45:00 +00:00
Jim Grosbach	b477f2e986	Add documentation comment. llvm-svn: 155203	2012-04-20 16:29:46 +00:00
Jim Grosbach	0cd0534390	TableGen support for auto-generating assembly two-operand aliases. Assembly matchers for instructions with a two-operand form. ARM is full of these, for example: add {Rd}, Rn, Rm // Rd is optional and is the same as Rn if omitted. The property TwoOperandAliasConstraint on the instruction definition controls when, and if, an alias will be formed. No explicit InstAlias definitions are required. rdar://11255754 llvm-svn: 155172	2012-04-19 23:59:23 +00:00
Joe Groff	cc9c07aacc	fix pr12559: mark unavailable win32 math libcalls also fix SimplifyLibCalls to use TLI rather than compile-time conditionals to enable optimizations on floor, ceil, round, rint, and nearbyint llvm-svn: 154960	2012-04-17 23:05:54 +00:00
Andrew Trick	6871e5550d	Typo in an unused field. llvm-svn: 154895	2012-04-17 01:22:58 +00:00
Andrew Trick	6d7aff8241	TableGen's regpressure: emit per-registerclass weight limits. llvm-svn: 154518	2012-04-11 18:16:28 +00:00
Duncan Sands	4f66d852d0	Comment typo fix. llvm-svn: 154488	2012-04-11 08:13:47 +00:00
Andrew Trick	eff18cab11	Added a TargetRegisterInfo interface for accessing register pressure sets. llvm-svn: 154375	2012-04-10 02:25:26 +00:00
Evan Cheng	5825e9dbf5	Fix a long standing tail call optimization bug. When a libcall is emitted legalizer always use the DAG entry node. This is wrong when the libcall is emitted as a tail call since it effectively folds the return node. If the return node's input chain is not the entry (i.e. call, load, or store) use that as the tail call input chain. PR12419 rdar://9770785 rdar://11195178 llvm-svn: 154370	2012-04-10 01:51:00 +00:00
Chandler Carruth	11c412fd2c	Teach LLVM about a PIE option which, when enabled on top of PIC, makes optimizations which are valid for position independent code being linked into a single executable, but not for such code being linked into a shared library. I discussed the design of this with Eric Christopher, and the decision was to support an optional bit rather than a completely separate relocation model. Fundamentally, this is still PIC relocation, its just that certain optimizations are only valid under a PIC relocation model when the resulting code won't be in a shared library. The simplest path to here is to expose a single bit option in the TargetOptions. If folks have different/better designs, I'm all ears. =] I've included the first optimization based upon this: changing TLS models to the *Exec models when PIE is enabled. This is the LLVM component of PR12380 and is all of the hard work. llvm-svn: 154294	2012-04-08 17:51:45 +00:00
Chandler Carruth	233e7232ae	Move the TLSModel information into the TargetMachine rather than hiding in TargetLowering. There was already a FIXME about this location being odd. The interface is simplified as a consequence. This will also make it easier to change TLS models when compiling with PIE. llvm-svn: 154292	2012-04-08 17:20:55 +00:00
Rafael Espindola	88a1aeb123	Always compute all the bits in ComputeMaskedBits. This allows us to keep passing reduced masks to SimplifyDemandedBits, but know about all the bits if SimplifyDemandedBits fails. This allows instcombine to simplify cases like the one in the included testcase. llvm-svn: 154011	2012-04-04 12:51:34 +00:00
Owen Anderson	157487e7c5	Add predicates for checking whether targets have free FNEG and FABS operations, and prevent the DAGCombiner from turning them into bitwise operations if they do. llvm-svn: 153901	2012-04-02 22:10:29 +00:00
Craig Topper	c0bc260ca3	Prune some includes and forward declarations. llvm-svn: 153414	2012-03-25 18:09:44 +00:00
Pete Cooper	df5d2a8893	Target override to allow CodeGenPrepare to sink address operands to intrinsics in the same way it current does for loads and stores llvm-svn: 152666	2012-03-13 20:59:56 +00:00
Jim Grosbach	a64613af7a	MCRegisterInfo-ize getMatchingSuperReg. llvm-svn: 152044	2012-03-05 19:33:24 +00:00
Craig Topper	a95d527c6a	Convert more GenRegisterInfo tables from unsigned to uint16_t to reduce static data size. llvm-svn: 152016	2012-03-05 05:37:41 +00:00
Craig Topper	8cc9d75c6a	Use uint16_t to store register overlaps to reduce static data. llvm-svn: 152001	2012-03-04 10:43:23 +00:00
Craig Topper	4ca8c48cc1	Use uint16_t instead of unsigned to store registers in reg classes. Reduces static data size. llvm-svn: 151998	2012-03-04 10:16:38 +00:00
Craig Topper	585b4225c3	Use uint16_t to store registers in callee saved register tables to reduce size of static data. llvm-svn: 151996	2012-03-04 03:33:22 +00:00
Benjamin Kramer	111608aae1	Move getSubRegIndex out of generated code into MCRegisterInfo, devirtualize it. llvm-svn: 151821	2012-03-01 18:16:35 +00:00
Jim Grosbach	9b3f9f1d37	Move TargetRegisterInfo::getSubReg() to MCRegisterInfo. Allows us to de-virtualize the function and provides access to it in the instruction printer, which is useful for handling composite physical registers (e.g., ARM register lists). llvm-svn: 151815	2012-03-01 17:30:39 +00:00
Benjamin Kramer	44c3c88cb7	Make TargetRegisterClasses non-virtual by making the only virtual function a function pointer. This allows us to make TRC non-polymorphic and value-initializable, eliminating a huge static initializer and a ton of cruft from the generated code. Shrinks ARMBaseRegisterInfo.o by ~100k. llvm-svn: 151806	2012-03-01 13:37:55 +00:00
Evan Cheng	c5ead6c49e	Re-commit r151623 with fix. Only issue special no-return calls if it's a direct call. llvm-svn: 151645	2012-02-28 18:51:51 +00:00
Daniel Dunbar	b448d31a6b	Revert r151623 "Some ARM implementaions, e.g. A-series, does return stack prediction. ...", it is breaking the Clang build during the Compiler-RT part. llvm-svn: 151630	2012-02-28 15:36:07 +00:00
Evan Cheng	d29a22e4b0	Some ARM implementaions, e.g. A-series, does return stack prediction. That is, the processor keeps a return addresses stack (RAS) which stores the address and the instruction execution state of the instruction after a function-call type branch instruction. Calling a "noreturn" function with normal call instructions (e.g. bl) can corrupt RAS and causes 100% return misprediction so LLVM should use a unconditional branch instead. i.e. mov lr, pc b _foo The "mov lr, pc" is issued in order to get proper backtrace. rdar://8979299 llvm-svn: 151623	2012-02-28 06:42:03 +00:00
Craig Topper	3ed929de0a	Make all pointers to TargetRegisterClass const since they are all pointers to static data that should not be modified. llvm-svn: 151134	2012-02-22 05:59:10 +00:00
James Molloy	6cbdd131b3	Improve generated code for extending loads and some trunc stores on ARM. Teach TargetSelectionDAG about lengthening loads for vector types and set v4i8 as legal. Allow FP_TO_UINT for v4i16 from v4i32. llvm-svn: 150956	2012-02-20 09:24:05 +00:00
Bill Wendling	d483464dd5	Modify the code that emits the module flags to use the new module flags accessor method. This allows the target lowering code to not have to deal with MDNodes. Also, avoid leaking memory like a sieve by not creating a global variable for the image info section, but just emitting the code directly. llvm-svn: 150624	2012-02-15 22:36:15 +00:00
Bill Wendling	493a72b2fe	Add code to the target lowering object file module to handle module flags. The MachO back-end needs to emit the garbage collection flags specified in the module flags. This is a WIP, so the front-end hasn't been modified to emit these flags just yet. Documentation and front-end switching to occur soon. llvm-svn: 150507	2012-02-14 21:28:13 +00:00
Lang Hames	70bdeac646	Rename getExceptionAddressRegister() to getExceptionPointerRegister() for consistency with setExceptionPointerRegister(...). llvm-svn: 150460	2012-02-14 04:45:49 +00:00
Nick Lewycky	0e211cfa88	Remove redundant getAnalysis<> calls in GlobalOpt. Add a few Itanium ABI calls to TargetLibraryInfo and use one of them in GlobalOpt. llvm-svn: 150323	2012-02-12 02:15:20 +00:00
Andrew Trick	c3cc8fa604	RegAlloc superpass: includes phi elimination, coalescing, and scheduling. Creates a configurable regalloc pipeline. Ensure specific llc options do what they say and nothing more: -reglloc=... has no effect other than selecting the allocator pass itself. This patch introduces a new umbrella flag, "-optimize-regalloc", to enable/disable the optimizing regalloc "superpass". This allows for example testing coalscing and scheduling under -O0 or vice-versa. When a CodeGen pass requires the MachineFunction to have a particular property, we need to explicitly define that property so it can be directly queried rather than naming a specific Pass. For example, to check for SSA, use MRI->isSSA, not addRequired<PHIElimination>. CodeGen transformation passes are never "required" as an analysis ProcessImplicitDefs does not require LiveVariables. We have a plan to massively simplify some of the early passes within the regalloc superpass. llvm-svn: 150226	2012-02-10 04:10:36 +00:00
Benjamin Kramer	260bef228a	Store just the SimpleValueType in the generated VT tables for each register class, eliminating static ctors. llvm-svn: 150173	2012-02-09 12:35:37 +00:00
Craig Topper	e18a06be4d	Convert assert(0) to llvm_unreachable llvm-svn: 149849	2012-02-05 22:14:15 +00:00
Andrew Trick	fb574d1c19	TargetPassConfig: confine the MC configuration to TargetMachine. Passes prior to instructon selection are now split into separate configurable stages. Header dependencies are simplified. The bulk of this diff is simply removal of the silly DisableVerify flags. Sorry for the target header churn. Attempting to stabilize them. llvm-svn: 149754	2012-02-04 02:56:59 +00:00
Andrew Trick	da0c52e742	Added TargetPassConfig. The first little step toward configuring codegen passes. Allows command line overrides to be centralized in LLVMTargetMachine.cpp. LLVMTargetMachine can intercept common passes and give precedence to command line overrides. Allows adding "internal" target configuration options without touching TargetOptions. Encapsulates the PassManager. Provides a good point to initialize all CodeGen passes so that Pass ID's can be used in APIs. Allows modifying the target configuration hooks without rebuilding the world. llvm-svn: 149672	2012-02-03 05:12:41 +00:00
Jakob Stoklund Olesen	b84880cf78	Require non-NULL register masks. It doesn't seem worthwhile to give meaning to a NULL register mask pointer. It complicates all the code using register mask operands. llvm-svn: 149646	2012-02-02 23:52:57 +00:00
Jakob Stoklund Olesen	230a0a4b40	Specify SubRegIndex components on the index itself. It is simpler to define a composite index directly: def ssub_2 : SubRegIndex<[dsub_1, ssub_0]>; def ssub_3 : SubRegIndex<[dsub_1, ssub_1]>; Than specifying the composite indices on each register: CompositeIndices = [(ssub_2 dsub_1, ssub_0), (ssub_3 dsub_1, ssub_1)] in ... This also makes it clear that SubRegIndex composition is supposed to be unique. llvm-svn: 149556	2012-02-01 23:16:41 +00:00
Andrew Trick	b4963dd8da	VLIW specific scheduler framework that utilizes deterministic finite automaton (DFA). This new scheduler plugs into the existing selection DAG scheduling framework. It is a top-down critical path scheduler that tracks register pressure and uses a DFA for pipeline modeling. Patch by Sergei Larin! llvm-svn: 149547	2012-02-01 22:13:57 +00:00
Anton Korobeynikov	682b2821ce	Properly emit ctors / dtors with priorities into desired sections and let linker handle the rest. This finally fixes PR5329 llvm-svn: 148990	2012-01-25 22:24:19 +00:00
Jakob Stoklund Olesen	3ff9738293	Add an (interleave A, B, ...) SetTheory operator. This will interleave the elements from two or more lists. llvm-svn: 148824	2012-01-24 18:06:05 +00:00
David Blaikie	06ecc99a56	More dead code removal (using -Wunreachable-code) llvm-svn: 148578	2012-01-20 21:51:11 +00:00
Nick Lewycky	c1e7e2eaf6	Add a TargetOption for disabling tail calls. llvm-svn: 148442	2012-01-19 00:34:10 +00:00
Jakob Stoklund Olesen	e9c53bc69b	Add a CoveredBySubRegs property to Register descriptions. When set, this bit indicates that a register is completely defined by the value of its sub-registers. Use the CoveredBySubRegs property to infer which super-registers are call-preserved given a list of callee-saved registers. For example, the ARM registers D8-D15 are callee-saved. This now automatically implies that Q4-Q7 are call-preserved. Conversely, Win64 callees save XMM6-XMM15, but the corresponding YMM6-YMM15 registers are not call-preserved because they are not fully defined by their sub-registers. llvm-svn: 148363	2012-01-18 00:16:39 +00:00
Jakob Stoklund Olesen	c5fac0a05c	Add TableGen support for callee saved registers. Targets can now add CalleeSavedRegs defs to their CallingConv.td file. TableGen will use this to create a _SaveList array suitable for returning from getCalleeSavedRegs() as well as a *_RegMask bit mask suitable for returning from getCallPreservedMask(). llvm-svn: 148346	2012-01-17 22:46:58 +00:00
Andrew Trick	8cee8a6cb3	Moving options declarations around. More short term hackery until we have a way to configure passes that work on LiveIntervals. llvm-svn: 148289	2012-01-17 06:54:59 +00:00
Jakob Stoklund Olesen	742b6d295b	Add TRI::getCallPreservedMask() hook. The hook returns a bit-mask of call-preserved registers that will eventually replace the current list of implicit defs on call instructions. This will make it possible to support multiple calling conventions without duplicating call instruction descriptors. The call-preserved mask is slightly different from the list returned by the getCalleeSavedRegs() hook, it includes all aliases that are preserved by calls. The hook takes a CallingConv::ID argument instead of a MachineFunction pointer, so it can provide information about calls to extern functions, and even indirect function calls. TRI::getCalleeSavedRegs() returns information about the function currently being compiled. TRI::getCallPreservedMask() returns information about the functions it is calling. llvm-svn: 148165	2012-01-14 01:45:25 +00:00
Andrew Trick	85c44d1485	Added the MachineSchedulerPass skeleton. llvm-svn: 148105	2012-01-13 06:30:30 +00:00
Andrew Trick	6e61e9f2b6	whitespace llvm-svn: 148104	2012-01-13 06:30:25 +00:00
Evan Cheng	fc3ec91768	Allow targets to select source order pre-RA scheduler. llvm-svn: 148033	2012-01-12 18:27:52 +00:00
Evan Cheng	2fcee784d6	Move Sched::Preference out of TargetMachine.h where it is not referenced. llvm-svn: 148014	2012-01-12 02:35:23 +00:00
Chandler Carruth	2a6b59a693	Add 'llvm_unreachable' to passify GCC's understanding of the constraints of several newly un-defaulted switches. This also helps optimizers (including LLVM's) recognize that every case is covered, and we should assume as much. llvm-svn: 147861	2012-01-10 18:08:01 +00:00
David Blaikie	8d47bb30e3	Remove unnecessary default cases in switches that cover all enum values. llvm-svn: 147855	2012-01-10 16:47:17 +00:00
Devang Patel	921a16318d	Split AsmParser into two components - AsmParser and AsmParserVariant AsmParser holds info specific to target parser. AsmParserVariant holds info specific to asm variants supported by the target. llvm-svn: 147787	2012-01-09 19:13:28 +00:00
David Blaikie	576aba04f1	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Evan Cheng	4c77fb5375	Add a if-conversion optimization that allows 'true' side of a diamond to be unpredicated. That is, turn subeq r0, r1, #1 addne r0, r1, #1 into sub r0, r1, #1 addne r0, r1, #1 For targets where conditional instructions are always executed, this may be beneficial. It may remove pseudo anti-dependency in out-of-order execution CPUs. e.g. op r1, ... str r1, [r10] ; end-of-life of r1 as div result cmp r0, #65 movne r1, #44 ; raw dependency on previous r1 moveq r1, #12 If movne is unpredicated, then op r1, ... str r1, [r10] cmp r0, #65 mov r1, #44 ; r1 written unconditionally moveq r1, #12 Both mov and moveq are no longer depdendent on the first instruction. This gives the out-of-order execution engine more freedom to reorder them. This has passed entire LLVM test suite. But it has not been enabled for any ARM variant pending more performance evaluation. rdar://8951196 llvm-svn: 146914	2011-12-19 22:01:30 +00:00
Eli Friedman	53dbd7d97f	Add "using" to silence warnings. llvm-svn: 146913	2011-12-19 21:53:12 +00:00
Eli Friedman	602ab7a677	Attempt to fix PR11607 by shuffling around which class defines which methods. llvm-svn: 146897	2011-12-19 20:06:03 +00:00
Jakob Stoklund Olesen	0aa2f7755a	Emit a getMatchingSuperRegClass() implementation for every target. Use information computed while inferring new register classes to emit accurate, table-driven implementations of getMatchingSuperRegClass(). Delete the old manual, error-prone implementations in the targets. llvm-svn: 146873	2011-12-19 16:53:34 +00:00
Evan Cheng	8d5b09811a	Model ARM predicated write as read-mod-write. e.g. r0 = mov #0 r0 = moveq #1 Then the second instruction has an implicit data dependency on the first instruction. Sadly I have yet to come up with a small test case that demonstrate the post-ra scheduler taking advantage of this. llvm-svn: 146583	2011-12-14 20:00:08 +00:00
Evan Cheng	3257bff430	Allow target to specify register output dependency. Still default to one. llvm-svn: 146547	2011-12-14 02:28:53 +00:00
Chandler Carruth	e0484f6b37	Initial CodeGen support for CTTZ/CTLZ where a zero input produces an undefined result. This adds new ISD nodes for the new semantics, selecting them when the LLVM intrinsic indicates that the undef behavior is desired. The new nodes expand trivially to the old nodes, so targets don't actually need to do anything to support these new nodes besides indicating that they should be expanded. I've done this for all the operand types that I could figure out for all the targets. Owners of various targets, please review and let me know if any of these are incorrect. Note that the expand behavior is conservatively correct, and exactly matches LLVM's current behavior with these operations. Ideally this patch will not change behavior in any way. For example the regtest suite finds the exact same instruction sequences coming out of the code generator. That's why there are no new tests here -- all of this is being exercised by the existing test suite. Thanks to Duncan Sands for reviewing the various bits of this patch and helping me get the wrinkles ironed out with expanding for each target. Also thanks to Chris for clarifying through all the discussions that this is indeed the approach he was looking for. That said, there are likely still rough spots. Further review much appreciated. llvm-svn: 146466	2011-12-13 01:56:10 +00:00
Nick Lewycky	03325a99c6	Minimize #include's and forward-declares in Target. llvm-svn: 146335	2011-12-10 22:35:47 +00:00
Evan Cheng	5a218b203a	Move isUnpredicatedTerminator() default implementation to TargetInstrInfoImpl to break Target's dependency on CodeGen. llvm-svn: 146247	2011-12-09 06:41:08 +00:00
Owen Anderson	285891eccf	Enhance both TargetLibraryInfo and SelectionDAGBuilder so that the latter can use the former to prevent the formation of libm SDNode's when -fno-builtin is passed. llvm-svn: 146193	2011-12-08 22:15:21 +00:00
Jim Grosbach	14e284f16a	Extend AsmMatcher token literal matching to allow aliasing. For example, ARM allows: vmov.u32 s4, #0 -> vmov.i32, #0 'u32' is a more specific designator for the 32-bit integer type specifier and is legal for any instruction which accepts 'i32' as a datatype suffix. We want to say, def : TokenAlias<".u32", ".i32">; This works by marking the match class of 'From' as a subclass of the match class of 'To'. rdar://10435076 llvm-svn: 145992	2011-12-06 23:43:54 +00:00
Evan Cheng	5061553f9d	First chunk of MachineInstr bundle support. 1. Added opcode BUNDLE 2. Taught MachineInstr class to deal with bundled MIs 3. Changed MachineBasicBlock iterator to skip over bundled MIs; added an iterator to walk all the MIs 4. Taught MachineBasicBlock methods about bundled MIs llvm-svn: 145975	2011-12-06 22:12:01 +00:00
Nadav Rotem	1a91e4381d	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Nick Lewycky	7d0d3c2d58	Move global variables in TargetMachine into new TargetOptions class. As an API change, now you need a TargetOptions object to create a TargetMachine. Clang patch to follow. One small functionality change in PTX. PTX had commented out the machine verifier parts in their copy of printAndVerify. That now calls the version in LLVMTargetMachine. Users of PTX who need verification disabled should rely on not passing the command-line flag to enable it. llvm-svn: 145714	2011-12-02 22:16:29 +00:00
Dylan Noblesmith	eb962ee28f	TargetMachine: document unnamed bool argument Its meaning was slightly mysterious without looking at subclasses. llvm-svn: 145705	2011-12-02 20:53:57 +00:00
Chad Rosier	7610c57a02	Add a few more functions to TargetLibraryInfo. More of rdar://10500969. llvm-svn: 145596	2011-12-01 17:54:37 +00:00
Chad Rosier	8c6d695c1e	Add a few functions to TargetLibraryInfo. llvm-svn: 145508	2011-11-30 19:19:00 +00:00
Chad Rosier	8b15aca551	Alphabetize TargetLibraryInfo enum and fix doxygen comments. No functional change intended. llvm-svn: 145468	2011-11-30 01:51:49 +00:00
Chad Rosier	c5fa9f413a	Add support for sqrt, sqrtl, and sqrtf in TargetLibraryInfo. Disable (fptrunc (sqrt (fpext x))) -> (sqrtf x) transformation if -fno-builtin is specified. rdar://10466410 llvm-svn: 145460	2011-11-29 23:57:10 +00:00
Eli Friedman	d02d82d355	Add support for custom names for library functions in TargetLibraryInfo. Add a custom name for fwrite and fputs on x86-32 OSX. Make SimplifyLibCalls honor the custom names for fwrite and fputs. Fixes <rdar://problem/9815881>. llvm-svn: 144876	2011-11-17 01:27:36 +00:00
Evan Cheng	2b239cbcf6	Sink codegen optimization level into MCCodeGenInfo along side relocation model and code model. This eliminates the need to pass OptLevel flag all over the place and makes it possible for any codegen pass to use this information. llvm-svn: 144788	2011-11-16 08:38:26 +00:00
Jakob Stoklund Olesen	2709f65821	Break false dependencies before partial register updates. Two new TargetInstrInfo hooks lets the target tell ExecutionDepsFix about instructions with partial register updates causing false unwanted dependencies. The ExecutionDepsFix pass will break the false dependencies if the updated register was written in the previoius N instructions. The small loop added to sse-domains.ll runs twice as fast with dependency-breaking instructions inserted. llvm-svn: 144602	2011-11-15 01:15:30 +00:00
Craig Topper	8f22b6d95a	Fix comment for LegalizeTypeAction enum. llvm-svn: 144511	2011-11-13 22:11:24 +00:00
Richard Osborne	87ed868306	Don't introduce custom nodes after legalization in TargetLowering::BuildSDIV() and TargetLowering::BuildUDIV(). Fixes PR11283 llvm-svn: 143964	2011-11-07 17:09:05 +00:00
Eli Friedman	e140806c4c	Add missing includes/decls. llvm-svn: 143722	2011-11-04 18:45:34 +00:00
Lang Hames	e8bb71f80d	Rename NonScalarIntSafe to something more appropriate. llvm-svn: 143080	2011-10-26 23:50:43 +00:00
Dan Gohman	5d5a540002	Delete the Latency scheduling preference. llvm-svn: 142815	2011-10-24 17:56:48 +00:00
Lang Hames	747fa3d477	Haven't yet found a nice way to handle TargetData verification in the AsmParser. This patch adds validation for target data layout strings upon construction of TargetData objects. An attempt to construct a TargetData object from a malformed string will trigger an assertion. llvm-svn: 142605	2011-10-20 19:24:44 +00:00
Duncan Sands	a18fd5ecb4	Comment out or remove unused parameter names so as to avoid a slew of compiler warnings. llvm-svn: 142574	2011-10-20 08:57:54 +00:00
Andrew Trick	1b84db85e7	Use ARM/t2PseudoInst class from ARM/Thumb2 special adds/subs patterns. Clean up the patterns, fix comments, and avoid confusing both tools and coders. Note that the special adds/subs SelectionDAG nodes no longer have the dummy cc_out operand. llvm-svn: 142397	2011-10-18 19:18:52 +00:00
Andrew Trick	5e61a8e533	whitespace llvm-svn: 142394	2011-10-18 18:40:53 +00:00
Lang Hames	dfb145da26	Backing out patch. Will refactor to remove the AsmParser dependency on Target. llvm-svn: 142323	2011-10-18 00:23:49 +00:00
Lang Hames	4370882b22	Re-applying the target data layout verification patch from r142288, plus appropriate CMake dependencies. Thanks to Raphael Espindola for tracking down the CMake issues. llvm-svn: 142306	2011-10-17 23:24:48 +00:00
Nick Lewycky	c551c1c5f9	Add support for a new extension to the .file directive: .file filenumber "directory" "filename" This removes one join+split of the directory+filename in MC internals. Because bitcode files have independent fields for directory and filenames in debug info, this patch may change the .o files written by existing .bc files. llvm-svn: 142300	2011-10-17 23:05:28 +00:00
Rafael Espindola	a40b412c70	142288 broke the build: Linking CXX executable ../../bin/llvm-as ../../lib/libLLVMAsmParser.a(LLParser.cpp.o):/home/espindola/llvm/llvm/lib/AsmParser/LLParser.cpp:function llvm::LLParser::ParseTargetDefinition(): error: undefined reference to 'llvm::TargetData::parseSpecifier(llvm::StringRef, llvm::TargetData*)' clang-3: error: linker command failed with exit code 1 (use -v to see invocation) Revert "Validate target data layout strings." This reverts commit 599d2d4c25d3aee63a21d9c67a88cd43bd971b7e. llvm-svn: 142296	2011-10-17 22:37:51 +00:00
Lang Hames	416d4f3f36	Validate target data layout strings. Invalid strings in asm files will result in parse errors. Invalid string literals passed to TargetData constructors will result in an assertion. llvm-svn: 142288	2011-10-17 22:05:34 +00:00
Hal Finkel	e8091aad04	Add comments to TargetLowering.h indicating that the set*Alignment functions take arguments in log2(bytes) llvm-svn: 142213	2011-10-17 18:23:13 +00:00
Chris Lattner	9795434c0b	improve some of the documentation around target data layout strings. llvm-svn: 141733	2011-10-11 23:01:39 +00:00
Lang Hames	be4997db2f	Add a natural stack alignment field to TargetData, and prevent InstCombine from promoting allocas to preferred alignments that exceed the natural alignment. This avoids some potentially expensive dynamic stack realignments. The natural stack alignment is set in target data strings via the "S<size>" option. Size is in bits and must be a multiple of 8. The natural stack alignment defaults to "unspecified" (represented by a zero value), and the "unspecified" value does not prevent any alignment promotions. Target maintainers that care about avoiding promotions should explicitly add the "S<size>" option to their target data strings. llvm-svn: 141599	2011-10-10 23:42:08 +00:00

1 2 3 4 5 ...

2363 Commits