llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-31 07:52:55 +01:00

Author	SHA1	Message	Date
Chandler Carruth	728acc9bd9	Flip the new block-placement pass to be on by default. This is mostly to test the waters. I'd like to get results from FNT build bots and other bots running on non-x86 platforms. This feature has been pretty heavily tested over the last few months by me, and it fixes several of the execution time regressions caused by the inlining work by preventing inlining decisions from radically impacting block layout. I've seen very large improvements in yacr2 and ackermann benchmarks, along with the expected noise across all of the benchmark suite whenever code layout changes. I've analyzed all of the regressions and fixed them, or found them to be impossible to fix. See my email to llvmdev for more details. I'd like for this to be in 3.1 as it complements the inliner changes, but if any failures are showing up or anyone has concerns, it is just a flag flip and so can be easily turned off. I'm switching it on tonight to try and get at least one run through various folks' performance suites in case SPEC or something else has serious issues with it. I'll watch bots and revert if anything shows up. llvm-svn: 154816	2012-04-16 13:49:17 +00:00
Akira Hatanaka	103a1edc4d	Revert changes that were accidentally committed. llvm-svn: 154563	2012-04-11 23:19:55 +00:00
Akira Hatanaka	991d556243	Fix string that is being checked. llvm-svn: 154547	2012-04-11 23:11:33 +00:00
Akira Hatanaka	48dbb62cb1	Emit neg.s or neg.d only if -enable-no-nans-fp-math is supplied by user, otherwise expand FNEG during legalization. llvm-svn: 154546	2012-04-11 22:59:08 +00:00
Akira Hatanaka	11a442d515	Emit abs.s or abs.d only if -enable-no-nans-fp-math is supplied by user. Invalid operation is signaled if the operand of these instructions is NaN. llvm-svn: 154545	2012-04-11 22:49:04 +00:00
Akira Hatanaka	6636922675	Fix bugs in lowering of FCOPYSIGN nodes. - FCOPYSIGN nodes that have operands of different types were not handled. - Different code was generated depending on the endianness of the target. Additionally, code is added that emits INS and EXT instructions, if they are supported by target (they are R2 instructions). llvm-svn: 154540	2012-04-11 22:13:04 +00:00
Akira Hatanaka	1b46e841a2	Have TargetLowering::getPICJumpTableRelocBase return a node that points to the GOT if jump table uses 64-bit gp-relative relocation. llvm-svn: 154341	2012-04-09 20:32:12 +00:00
Akira Hatanaka	5cce394620	Add lines in global-address.ll to test N32 and N64 code generation. llvm-svn: 154202	2012-04-06 20:23:36 +00:00
Akira Hatanaka	f3ec345016	Reapply test case in 154038, this time with triple to prevent the backend from emitting gp_rel relocation. llvm-svn: 154122	2012-04-05 20:44:35 +00:00
Akira Hatanaka	e5ea70212f	Reapply 154038 without the failing test. llvm-svn: 154062	2012-04-04 22:16:36 +00:00
Owen Anderson	f6f930a990	Revert r154038. It was causing make check failures. llvm-svn: 154054	2012-04-04 21:18:58 +00:00
Akira Hatanaka	4df2267566	Fix LowerGlobalAddress to produce instructions with the correct relocation types for N32 ABI. Add new test case and update existing ones. llvm-svn: 154038	2012-04-04 19:02:38 +00:00
Akira Hatanaka	c8028e2551	Fix LowerConstantPool to produce instructions with the correct relocation types for N32 ABI and update test case. llvm-svn: 154034	2012-04-04 18:26:12 +00:00
Akira Hatanaka	913d78a99c	Fix LowerBlockAddress to produce instructions with the correct relocation types for N32 ABI and update test case. llvm-svn: 154031	2012-04-04 18:22:53 +00:00
Akira Hatanaka	fa2f5577e9	Expand FREM. llvm-svn: 153671	2012-03-29 18:43:11 +00:00
Akira Hatanaka	3d463d748a	Fix test case. llvm-svn: 153555	2012-03-28 00:25:01 +00:00
Eli Bendersky	3ef88c1833	Continue cleanup of LIT, getting rid of the remaining artifacts from dejagnu * Removed test/lib/llvm.exp - it is no longer needed * Deleted the dg.exp reading code from test/lit.cfg. There are no dg.exp files left in the test suite so this code is no longer required. test/lit.cfg is now much shorter and clearer * Removed a lot of duplicate code in lit.local.cfg files that need access to the root configuration, by adding a "root" attribute to the TestingConfig object. This attribute is dynamically computed to provide the same information as was previously provided by the custom getRoot functions. * Documented the config.root attribute in docs/CommandGuide/lit.pod llvm-svn: 153408	2012-03-25 09:02:19 +00:00
Benjamin Kramer	d42906ae81	Remove the no longer existent psp triple from a test. The test fell back to the C backend, making it useless and it started to fail on configurations that don't build the C backend. llvm-svn: 152342	2012-03-08 21:22:27 +00:00
Akira Hatanaka	f4288c9e0e	Test case for r152280, r152285 and r152290. llvm-svn: 152292	2012-03-08 03:32:42 +00:00
Akira Hatanaka	75b06f4a49	Fix bugs which were introduced when support for base+index floating point loads and stores was added. - SelectAddr should return false if Parent is an unaligned f32 load or store. - Only aligned load and store nodes should be matched to select reg+imm floating point instructions. - MIPS does not have support for f64 unaligned load or store instructions. llvm-svn: 151843	2012-03-01 22:12:30 +00:00
Akira Hatanaka	0934449dd8	Add support for floating point base register + offset register addressing mode load and store instructions. llvm-svn: 151611	2012-02-28 02:55:02 +00:00
Akira Hatanaka	8fc9a35d3f	Add definitions of floating point multiply add/sub and negative multiply add/sub instructions. llvm-svn: 151415	2012-02-25 00:21:52 +00:00
Akira Hatanaka	3b3ee53886	Add an option to use a virtual register as the global base register instead of reserving a physical register ($gp or $28) for that purpose. This will completely eliminate loads that restore the value of $gp after every function call, if the register allocator assigns a callee-saved register, or eliminate unnecessary loads if it assigns a temporary register. example: .cpload $25 // set $gp. ... .cprestore 16 // store $gp to stack slot 16($sp). ... jalr $25 // function call. clobbers $gp. lw $gp, 16($sp) // not emitted if callee-saved reg is chosen. ... lw $2, 4($gp) ... jalr $25 // function call. lw $gp, 16($sp) // not emitted if $gp is not live after this instruction. ... llvm-svn: 151402	2012-02-24 22:34:47 +00:00
Eli Bendersky	4afdeeb682	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Akira Hatanaka	874523adc5	Add a new MachineJumpTableInfo entry type, EK_GPRel64BlockAddress, which is needed to emit a 64-bit gp-relative relocation entry. Make changes necessary for emitting jump tables which have entries with directive .gpdword. This patch does not implement the parts needed for direct object emission or JIT. llvm-svn: 149668	2012-02-03 04:33:00 +00:00
Bill Wendling	7761976036	Remove all references to the old EH. There was always the current EH. -- Ministry of Truth llvm-svn: 149335	2012-01-31 02:09:07 +00:00
Akira Hatanaka	175341c860	Modify MipsFrameLowering::emitPrologue and emitEpilogue. - Use MipsAnalyzeImmediate to expand immediates that do not fit in 16-bit. - Change the types of variables so that they are sufficiently large to handle 64-bit pointers. - Emit instructions to set register $28 in a function prologue after instructions which store callee-saved registers have been emitted. llvm-svn: 148917	2012-01-25 04:12:04 +00:00
Akira Hatanaka	6880302ac2	Lower 64-bit immediates using MipsAnalyzeImmediate that has just been added. Add a test case to show fewer instructions are needed to load an immediate with the new way of loading immediates. llvm-svn: 148908	2012-01-25 03:01:35 +00:00
Akira Hatanaka	12cdcf3bc6	Pattern for f32 to i64 conversion. llvm-svn: 148869	2012-01-24 22:05:25 +00:00
Akira Hatanaka	7b1d08124d	64-bit sign extension in register instructions. llvm-svn: 148862	2012-01-24 21:41:09 +00:00
Akira Hatanaka	fdcba196ca	Have getRegForInlineAsmConstraint return the correct register class when target is Mips64. llvm-svn: 147516	2012-01-04 02:45:01 +00:00
Akira Hatanaka	72c5800ed2	Test case for r147232. llvm-svn: 147233	2011-12-24 03:05:43 +00:00
Akira Hatanaka	4ab17eaca0	Fix bug in zero-store peephole pattern reported in pr11615. The patch and test case were originally written by Mans Rullgard. llvm-svn: 147024	2011-12-21 00:31:10 +00:00
Akira Hatanaka	0af792d12b	Expand 64-bit CTLZ nodes if target architecture does not support it. Add test case for DCLO and DCLZ. llvm-svn: 147022	2011-12-21 00:20:27 +00:00
Akira Hatanaka	fb94688c7a	Test case for r147017. llvm-svn: 147018	2011-12-20 23:58:36 +00:00
Akira Hatanaka	2e4f1786b1	Add function MipsDAGToDAGISel::SelectMULT and factor out code that generates nodes needed for multiplication. Add code for selecting 64-bit MULHS and MULHU nodes. llvm-svn: 147008	2011-12-20 23:10:57 +00:00
Akira Hatanaka	f728a1b2c5	64-bit data directive. llvm-svn: 147005	2011-12-20 22:52:19 +00:00
Akira Hatanaka	ad193d95ae	32-to-64-bit sext_inreg pattern. llvm-svn: 147004	2011-12-20 22:40:40 +00:00
Akira Hatanaka	8728f4ed69	Add code in MipsDAGToDAGISel for selecting constant +0.0. MIPS64 can generate constant +0.0 with a single DMTC1 instruction. llvm-svn: 146999	2011-12-20 22:25:50 +00:00
Akira Hatanaka	7ef923c1f0	Add a test case for r146900. llvm-svn: 146901	2011-12-19 20:24:28 +00:00
Akira Hatanaka	e54da3bfa2	Add patterns for matching immediates whose lower 16-bit is cleared. These patterns emit a single LUi instruction instead of a pair of LUi and ORi. llvm-svn: 146900	2011-12-19 20:21:18 +00:00
Akira Hatanaka	804863071f	Remove definitions of double word shift plus 32 instructions. Assembler or direct-object emitter should emit the appropriate shift instruction depending on the shift amount. llvm-svn: 146893	2011-12-19 19:44:09 +00:00
Akira Hatanaka	b7ebcb2ded	Remove the restriction on the first operand of the add node in SelectAddr. This change reduces the number of instructions generated. For example, (load (add (sub $n0, $n1), (MipsLo got(s)))) results in the following sequence of instructions: 1. sub $n2, $n0, $n1 2. lw got(s)($n2) Previously, three instructions were needed. 1. sub $n2, $n0, $n1 2. addiu $n3, $n2, got(s) 3. lw 0($n3) llvm-svn: 146888	2011-12-19 19:28:37 +00:00
Akira Hatanaka	3fca32d88e	Add support for local dynamic TLS model in LowerGlobalTLSAddress. Direct object emission is not supported yet, but a patch that adds the support should follow soon. llvm-svn: 146572	2011-12-14 18:26:41 +00:00
Akira Hatanaka	a9290d5ab9	Move direct object emitter test to directory test/MC/Mips. Rename it to elf-relsym.ll. llvm-svn: 146470	2011-12-13 03:50:34 +00:00
Akira Hatanaka	28140f744a	Relocation against a symbol, instead of against section. We had some extreme test cases where there were a lot of relocations applied relative to a large rodata section. Gas would create a symbol for each of these whereas we would be relative to the beginning of the rodata section. This change mimics what gas does. Patch by Jack Carter. llvm-svn: 146468	2011-12-13 02:27:40 +00:00
Akira Hatanaka	46dd9e66a6	Test case for r146432 by Jack Carter. llvm-svn: 146433	2011-12-12 22:41:39 +00:00
Chandler Carruth	2bedf185c9	Manually upgrade the test suite to specify the flag to cttz and ctlz. I followed three heuristics for deciding whether to set 'true' or 'false': - Everything target independent got 'true' as that is the expected common output of the GCC builtins. - If the target arch only has one way of implementing this operation, set the flag in the way that exercises the most of codegen. For most architectures this is also the likely path from a GCC builtin, with 'true' being set. It will (eventually) require lowering away that difference, and then lowering to the architecture's operation. - Otherwise, set the flag differently dependending on which target operation should be tested. Let me know if anyone has any issue with this pattern or would like specific tests of another form. This should allow the x86 codegen to just iteratively improve as I teach the backend how to differentiate between the two forms, and everything else should remain exactly the same. llvm-svn: 146370	2011-12-12 11:59:10 +00:00
Akira Hatanaka	ce89ae9f84	jalr should use t9 ($25) for indirect calls regardless of the relocation model specified. llvm-svn: 146229	2011-12-09 01:45:12 +00:00
Akira Hatanaka	7db0038ac0	32 to 64-bit zext pattern. llvm-svn: 146096	2011-12-07 23:14:41 +00:00

1 2 3 4

200 Commits