llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Andrew Trick	75af469e99	Added MispredictPenalty to SchedMachineModel. This replaces an existing subtarget hook on ARM and allows standard CodeGen passes to potentially use the property. llvm-svn: 161471	2012-08-08 02:44:16 +00:00
Andrew Trick	b9c8074dcd	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Andrew Trick	baf8a62800	Reapply "Make NumMicroOps a variable in the subtarget's instruction itinerary." Reapplies r159406 with minor cleanup. The regressions appear to have been spurious. llvm-svn: 159541	2012-07-02 18:10:42 +00:00
Andrew Trick	251f64f946	Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." This reverts commit r159406. I noticed a performance regression so I'll back out for now. llvm-svn: 159411	2012-06-29 07:10:41 +00:00
Andrew Trick	52238a0ce5	Make NumMicroOps a variable in the subtarget's instruction itinerary. The TargetInstrInfo::getNumMicroOps API does not change, but soon it will be used by MachineScheduler. Now each subtarget can specify the number of micro-ops per itinerary class. For ARM, this is currently always dynamic (-1), because it is used for load/store multiple which depends on the number of register operands. Zero is now a valid number of micro-ops. This can be used for nop pseudo-instructions or instructions that the hardware can squash during dispatch. llvm-svn: 159406	2012-06-29 03:23:18 +00:00
Andrew Trick	80ddb55a53	ARM itinerary properties. llvm-svn: 157980	2012-06-05 03:44:43 +00:00
Evan Cheng	12bfe1150d	Fix a number of problems with ARM fused multiply add/subtract instructions. 1. The new instruction itinerary entries are not properly described. 2. The asm parser can't handle vfms and vfnms. 3. There were no assembler, disassembler test cases. 4. HasNEON2 has the wrong assembler predicate. rdar://10139676 llvm-svn: 154456	2012-04-11 00:13:00 +00:00
Evan Cheng	6dc21c7358	Sorry, several patches in one. TargetInstrInfo: Change produceSameValue() to take MachineRegisterInfo as an optional argument. When in SSA form, targets can use it to make more aggressive equality analysis. Machine LICM: 1. Eliminate isLoadFromConstantMemory, use MI.isInvariantLoad instead. 2. Fix a bug which prevent CSE of instructions which are not re-materializable. 3. Use improved form of produceSameValue. ARM: 1. Teach ARM produceSameValue to look pass some PIC labels. 2. Look for operands from different loads of different constant pool entries which have same values. 3. Re-implement PIC GA materialization using movw + movt. Combine the pair with a "add pc" or "ldr [pc]" to form pseudo instructions. This makes it possible to re-materialize the instruction, allow machine LICM to hoist the set of instructions out of the loop and make it possible to CSE them. It's a bit hacky, but it significantly improve code quality. 4. Some minor bug fixes as well. With the fixes, using movw + movt to materialize GAs significantly outperform the load from constantpool method. 186.crafty and 255.vortex improved > 20%, 254.gap and 176.gcc ~10%. llvm-svn: 123905	2011-01-20 08:34:58 +00:00
Bob Wilson	bd3d3d2937	Add support for NEON VLD3-dup instructions. The encoding for alignment in VLD4-dup instructions is still a work in progress. llvm-svn: 120356	2010-11-30 00:00:35 +00:00
Bob Wilson	aa197b07e6	Add support for NEON VLD3-dup instructions. llvm-svn: 120312	2010-11-29 19:35:29 +00:00
Bob Wilson	cb675664c4	Fix copy-and-paste errors in VLD2-dup scheduling itineraries. llvm-svn: 120311	2010-11-29 19:35:23 +00:00
Bob Wilson	3bb61d1932	Add support for NEON VLD2-dup instructions. llvm-svn: 120236	2010-11-28 06:51:26 +00:00
Bob Wilson	cbd6281807	Add NEON VLD1-dup instructions (load 1 element to all lanes). llvm-svn: 120194	2010-11-27 06:35:16 +00:00
Bob Wilson	c66e4574f1	Fix incorrect scheduling itineraries for NEON vld1/vst1 instructions. I added these instructions recently but I have no idea where these "1" values in the NextCycles field came from. As far as I can tell now, these instruction stages are clearly intended to overlap. llvm-svn: 120193	2010-11-27 06:35:09 +00:00
Evan Cheng	239d9b439d	Conditional moves are slightly more expensive than moves. llvm-svn: 118985	2010-11-13 05:14:20 +00:00
Evan Cheng	eab7251695	Fix preload instruction isel. Only v7 supports pli, and only v7 with mp extension supports pldw. Add subtarget attribute to denote mp extension support and legalize illegal ones to nothing. llvm-svn: 118160	2010-11-03 06:34:55 +00:00
Evan Cheng	e156473432	Modify scheduling itineraries to correct instruction latencies (not operand latencies) of loads. llvm-svn: 118134	2010-11-03 00:40:22 +00:00
Bob Wilson	248c691f9a	Add NEON VST1-lane instructions. Partial fix for Radar 8599955. llvm-svn: 118069	2010-11-02 21:18:25 +00:00
Bob Wilson	b6bc135df8	Add NEON VLD1-lane instructions. Partial fix for Radar 8599955. llvm-svn: 117964	2010-11-01 22:04:05 +00:00
Evan Cheng	7695213793	Fix fpscr <-> GPR latency info. llvm-svn: 117737	2010-10-29 23:16:55 +00:00
Andrew Trick	4a3b819c1f	putback r116983 and fix simple-fp-encoding.ll tests llvm-svn: 116992	2010-10-21 03:40:16 +00:00
Owen Anderson	7da515c665	Revert r116983, which is breaking all the buildbots. llvm-svn: 116987	2010-10-21 03:11:16 +00:00
Evan Cheng	0b9eaaf45d	Add missing scheduling itineraries for transfers between core registers and VFP registers. llvm-svn: 116983	2010-10-21 01:12:00 +00:00
Evan Cheng	6aac1548ab	More ARM scheduling itinerary fixes. llvm-svn: 116266	2010-10-11 23:41:41 +00:00
Evan Cheng	77ba7b098a	Proper VST scheduling itineraries. llvm-svn: 116251	2010-10-11 22:03:18 +00:00
Evan Cheng	8c17a06411	Add VLD4 scheduling itineraries. llvm-svn: 116143	2010-10-09 04:07:58 +00:00
Evan Cheng	df7f5672ee	Finish vld3 and vld4. llvm-svn: 116140	2010-10-09 01:45:34 +00:00
Evan Cheng	bf6307d869	Complete vld2 instruction itineries. llvm-svn: 116136	2010-10-09 01:26:12 +00:00
Evan Cheng	c0933d5ec1	Multiply instructions are issued on pipeline 0. They do not need to reserve pipeline 1. llvm-svn: 116135	2010-10-09 01:15:04 +00:00
Evan Cheng	15fc769cf2	Correct some load / store instruction itinerary mistakes: 1. Cortex-A8 load / store multiplies can only issue on ALU0. 2. Eliminate A8_Issue, A8_LSPipe will correctly limit the load / store issues. 3. Correctly model all vld1 and vld2 variants. llvm-svn: 116134	2010-10-09 01:03:04 +00:00
Evan Cheng	1ce29574c2	Model operand cycles of vldm / vstm; also fixes scheduling itineraries of vldr / vstr, etc. llvm-svn: 115898	2010-10-07 01:50:48 +00:00
Evan Cheng	6fbb6dea7c	- Add TargetInstrInfo::getOperandLatency() to compute operand latencies. This allow target to correctly compute latency for cases where static scheduling itineraries isn't sufficient. e.g. variable_ops instructions such as ARM::ldm. This also allows target without scheduling itineraries to compute operand latencies. e.g. X86 can return (approximated) latencies for high latency instructions such as division. - Compute operand latencies for those defined by load multiple instructions, e.g. ldm and those used by store multiple instructions, e.g. stm. llvm-svn: 115755	2010-10-06 06:27:31 +00:00
Evan Cheng	0da8dff3c7	Fix scheduling infor for vmovn and vshrn which I broke accidentially. llvm-svn: 115354	2010-10-01 21:48:06 +00:00
Evan Cheng	cf5ed3cd53	Add operand cycles for vldr / vstr. llvm-svn: 115353	2010-10-01 21:40:30 +00:00
Evan Cheng	fc1aee5b3c	NEON scheduling info fix. vmov reg, reg are single cycle instructions. llvm-svn: 115344	2010-10-01 20:50:58 +00:00
Evan Cheng	fa5d40dbff	ARM instruction itinerary fixes: 1. Cortex-a9 8-bit and 16-bit loads / stores AGU cycles are 1 cycle longer than 32-bit ones. 2. Cortex-a9 is out-of-order so model all read cycles as cycle 1. 3. Lots of other random fixes for A8 and A9. llvm-svn: 115121	2010-09-30 01:08:25 +00:00
Evan Cheng	b44d480808	Model Cortex-a9 load to SUB, RSB, ADD, ADC, SBC, RSC, CMN, MVN, or CMP pipeline forwarding path. llvm-svn: 115098	2010-09-29 22:42:35 +00:00
Evan Cheng	7eb08b1ad9	Separate itinerary classes for mvn from mov; for tst / teq from cmp / cmn. llvm-svn: 115010	2010-09-29 00:49:25 +00:00
Evan Cheng	7fffe3cf58	Assign bitwise binary instructions different itinerary classes from ALU instructions such as add / sub. llvm-svn: 115008	2010-09-29 00:27:46 +00:00
Evan Cheng	39c462b4f1	Add support to model pipeline bypass / forwarding. llvm-svn: 115005	2010-09-28 23:50:49 +00:00
Evan Cheng	2279dc1d2a	Remove a unused instruction itinerary class. llvm-svn: 114782	2010-09-25 01:06:02 +00:00
Evan Cheng	64a24ab747	Fix zero and sign extension instructions scheduling itineraries. llvm-svn: 114780	2010-09-25 00:49:35 +00:00
Evan Cheng	124ae30ef8	More pseudo instruction scheduling itinerary fixes. llvm-svn: 114768	2010-09-24 22:41:41 +00:00
Evan Cheng	eb81dc39dc	Fix scheduling itinerary for pseudo mov immediate instructions which expand into two real instructions. llvm-svn: 114766	2010-09-24 22:03:46 +00:00
Evan Cheng	b87520ca74	Fix LDM_RET schedule itinery. llvm-svn: 113435	2010-09-08 22:57:08 +00:00
Jim Grosbach	a0473aa51c	minor housekeeping cleanup: 80-column, trailing whitespace, spelling, etc.. No functional change. llvm-svn: 106988	2010-06-28 04:27:01 +00:00
Anton Korobeynikov	e325c693a5	Make processor FUs unique for given itinerary. This extends the limit of 32 FU per CPU arch to 32 per intinerary allowing precise modelling of quite complex pipelines in the future. llvm-svn: 101754	2010-04-18 20:31:01 +00:00
Anton Korobeynikov	51fba0e4eb	Split A8/A9 itins - they already were too big. llvm-svn: 100672	2010-04-07 18:22:11 +00:00

48 Commits