llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Krzysztof Parzyszek	9830c7641e	[Hexagon] Fix incorrect generation of S4_subi_asl_ri Patch by Jyotsna Verma. llvm-svn: 279267	2016-08-19 16:35:05 +00:00
Krzysztof Parzyszek	7674324b11	[Hexagon] Allow tail-call optimization when mixing C and fast calling conv Patch by Arnold Schwaighofer. llvm-svn: 279251	2016-08-19 15:02:18 +00:00
Krzysztof Parzyszek	6c216ccafc	[Hexagon] Check for empty live interval Patch by Brendon Cahoon. llvm-svn: 279249	2016-08-19 14:29:43 +00:00
Krzysztof Parzyszek	53fcc0a9e4	[Hexagon] Mark PS_jumpret as pseudo-instruction, expand it into J2_jumpr llvm-svn: 279241	2016-08-19 14:04:45 +00:00
Krzysztof Parzyszek	3cad3632a2	[Hexagon] Improvements to handling and generation of FP instructions Improved handling of fma, floating point min/max, additional load/store instructions for floating point types. Patch by Jyotsna Verma. llvm-svn: 279239	2016-08-19 13:34:31 +00:00
Krzysztof Parzyszek	7d14392913	[Hexagon] Create vcombine in HexagonCopyToCombine llvm-svn: 279067	2016-08-18 14:12:34 +00:00
Brendon Cahoon	17a7582d1f	[Pipeliner] Fix an asssert due to invalid Phi in the epilog The pipeliner was generating an invalid Phi name for an operand in the epilog block, which caused an assert in the live variable analysis pass. The fix is to the code that generates new Phis in the epilog block. In this case, there is an existing Phi that needs to be reused rather than creating a new Phi instruction. Differential Revision: https://reviews.llvm.org/D23513 llvm-svn: 278805	2016-08-16 14:29:24 +00:00
Ron Lieberman	3fdb0d769a	[Hexagon] Improve test to check for @PCREL, only run llc, not opt -> llc. llvm-svn: 278796	2016-08-16 13:10:09 +00:00
James Molloy	3fdcf4e64c	[LSR] Don't try and create post-inc expressions on non-rotated loops If a loop is not rotated (for example when optimizing for size), the latch is not the backedge. If we promote an expression to post-inc form, we not only increase register pressure and add a COPY for that IV expression but for all IVs! Motivating testcase: void f(float a, float b, float c, int n) { while (n-- > 0) c++ = a++ + b++; } It's imperative that the pointer increments be located in the latch block and not the header block; if not, we cannot use post-increment loads and stores and we have to keep both the post-inc and pre-inc values around until the end of the latch which bloats register usage. llvm-svn: 278658	2016-08-15 07:53:03 +00:00
Ron Lieberman	4789174868	Fix unsupported relocation type R_HEX_6_X' for symbol .rodata LowerTargetConstantPool is not properly setting the TargetFlag to indicate desired relocation. Coding error, the offset parameter was omitted, so the TargetFlag was used as the offset, and the TargetFlag defaulted to zero. This only affects -fpic compilation, and only those items created in a Constant Pool, for example a vector of constants. Halide ran into this issue. llvm-svn: 278614	2016-08-13 23:41:11 +00:00
Krzysztof Parzyszek	c1b900dd08	[Hexagon] Allow non-returning calls in hardware loops llvm-svn: 278416	2016-08-11 21:14:25 +00:00
Krzysztof Parzyszek	14a4f4a432	If-conversion incorrectly calculates liveness of redefined registers Differential Revision: https://reviews.llvm.org/D23207 llvm-svn: 278383	2016-08-11 18:42:06 +00:00
Krzysztof Parzyszek	fb4a7f1227	[Hexagon] Skip byval arguments when checking parameter attributes From the point of view of register assignment, byval parameters are ignored: a byval parameter is not going to be assigned to a register, and it will not affect the assignments of subsequent parameters. When matching registers with parameters in the bit tracker, make sure to skip byval parameters before advancing the registers. llvm-svn: 278375	2016-08-11 18:15:16 +00:00
Kyle Butt	beb1a39bc6	Codegen: Tail Merge: Be less aggressive with special cases. This change makes it possible for tail-duplication and tail-merging to be disjoint. By being less aggressive when merging during layout, there are no overlapping cases between tail-duplication and tail-merging, provided the thresholds are disjoint. There is a remaining TODO to benchmark the succ_size() test for non-layout tail merging. llvm-svn: 278265	2016-08-10 18:36:18 +00:00
Krzysztof Parzyszek	458d8ce010	[Hexagon] Simplify the SplitConst32/64 pass llvm-svn: 278256	2016-08-10 18:05:47 +00:00
Krzysztof Parzyszek	bdc1668cd8	[Hexagon] Add extra patterns for single-precision min/max instructions llvm-svn: 278252	2016-08-10 17:56:24 +00:00
Krzysztof Parzyszek	631100a1eb	[Hexagon] Use integer instructions for floating point immediates Floating point instructions use general purpose registers, so the few instructions that can put floating point immediates into registers are, in fact, integer instruction. Use them explicitly instead of having pseudo-instructions specifically for dealing with floating point values. Simplify the constant loading instructions (from sdata) to have only two: one for 32-bit values and one for 64-bit values: CONST32 and CONST64. llvm-svn: 278244	2016-08-10 16:46:36 +00:00
Krzysztof Parzyszek	24135ad35e	[Hexagon] Add pattern for 64-bit mulhs llvm-svn: 278040	2016-08-08 19:24:25 +00:00
Krzysztof Parzyszek	5c74e232b1	[Hexagon] Validate register class when doing bit simplification llvm-svn: 277740	2016-08-04 17:56:19 +00:00
Krzysztof Parzyszek	4a04420cea	[Hexagon] Clear kill flags from modified registers in peephole optimizer llvm-svn: 277727	2016-08-04 14:17:16 +00:00
Krzysztof Parzyszek	0ae2c174f5	[Hexagon] Generate COPY/REG_SEQUENCE more aggressively for vectors llvm-svn: 277626	2016-08-03 18:35:48 +00:00
Krzysztof Parzyszek	1040f51a3b	[Hexagon] Do not check alignment for unsized types in isLegalAddressingMode When the same base address is used to load two different data types, LSR would assume a memory type of "void". This type is not sized and has no alignment information. Checking for it causes a crash. llvm-svn: 277601	2016-08-03 15:06:18 +00:00
Krzysztof Parzyszek	79d02725d4	[Hexagon] Recognize vcombine in copy propagation llvm-svn: 277528	2016-08-02 21:49:20 +00:00
Krzysztof Parzyszek	37ebf5f0df	[Hexagon] Prefer _io over _rr for 64-bit store with constant offset Identify patterns where the address is aligned to an 8-byte boundary, but both the base address and the constant offset are both proper multiples of 4. In such cases, extract Base+4 into a separate instruc- tion, and use S2_storerd_io, instead of using S4_storerd_rr. llvm-svn: 277497	2016-08-02 18:50:05 +00:00
Ron Lieberman	84b9473461	[Hexagon] Generate vector printing instructions llvm-svn: 277370	2016-08-01 19:36:39 +00:00
Krzysztof Parzyszek	3495701817	[Hexagon] Check for offset overflow when reserving scavenging slots Scavenging slots were only reserved when pseudo-instruction expansion in frame lowering created new virtual registers. It is possible to still need a scavenging slot even if no virtual registers were created, in cases where the stack is large enough to overflow instruction offsets. llvm-svn: 277355	2016-08-01 17:15:30 +00:00
Michael Kuperstein	2b452a8187	[Hexagon] Fix test that uses -debug-only to require asserts. llvm-svn: 277218	2016-07-29 21:44:33 +00:00
Krzysztof Parzyszek	3adce2360c	[Hexagon] Testcase for not merging stores into a misaligned store The DAG combiner will try to merge consecutive stores into a bigger store, unless the resulting store is not fast. Misaligned vector stores are allowed on Hexagon, but are not fast. Add a testcase to make sure this type of merging does not occur. Patch by Pranav Bhandarkar. llvm-svn: 277182	2016-07-29 17:55:37 +00:00
Krzysztof Parzyszek	3a5bc2df22	Revert r277178, the actual change had already been applied Will submit another patch with the testcase only. llvm-svn: 277180	2016-07-29 17:50:47 +00:00
Krzysztof Parzyszek	f5f51e9c74	[Hexagon] Misaligned loads and stores are not fast The DAG combiner tries to merge stores to adjacent vector wide memory locations by creating stores which are integral multiples of the vector width. Discourage this by informing it that this is slow. This should not affect legalization passes, because all of them ignore the "Fast" argument. Patch by Pranav Bhandarkar. llvm-svn: 277178	2016-07-29 17:45:16 +00:00
Brendon Cahoon	e37295579e	MachinePipeliner pass that implements Swing Modulo Scheduling Software pipelining is an optimization for improving ILP by overlapping loop iterations. Swing Modulo Scheduling (SMS) is an implementation of software pipelining that attempts to reduce register pressure and generate efficient pipelines with a low compile-time cost. This implementaion of SMS is a target-independent back-end pass. When enabled, the pass should run just prior to the register allocation pass, while the machine IR is in SSA form. If the pass is successful, then the original loop is replaced by the optimized loop. The optimized loop contains one or more prolog blocks, the pipelined kernel, and one or more epilog blocks. This pass is enabled for Hexagon only. To enable for other targets, a couple of target specific hooks must be implemented, and the pass needs to be called from the target's TargetMachine implementation. Differential Review: http://reviews.llvm.org/D16829 llvm-svn: 277169	2016-07-29 16:44:44 +00:00
Krzysztof Parzyszek	ce9680792b	[Hexagon] Custom lower VECTOR_SHUFFLE and EXTRACT_SUBVECTOR for HVX If the mask of a vector shuffle has alternating odd or even numbers starting with 1 or 0 respectively up to the largest possible index for the given type in the given HVX mode (single of double) we can generate vpacko or vpacke instruction respectively. E.g. %42 = shufflevector <32 x i16> %37, <32 x i16> %41, <32 x i32> <i32 1, i32 3, ..., i32 63> is %42.h = vpacko(%41.w, %37.w) Patch by Pranav Bhandarkar. llvm-svn: 277168	2016-07-29 16:44:27 +00:00
Krzysztof Parzyszek	4ca53a9c57	[Hexagon] Improve balancing of address calculation Rebalances address calculation trees and applies Hexagon-specific optimizations to the trees to improve instruction selection. Patch by Tobias Edler von Koch. llvm-svn: 277151	2016-07-29 15:15:35 +00:00
Krzysztof Parzyszek	fe4b956a14	[Hexagon] Implement MI-level constant propagation llvm-svn: 277028	2016-07-28 20:01:59 +00:00
Krzysztof Parzyszek	7e077bcfe5	[Hexagon] Insert CFI instructions before throwing calls Normally, CFI instructions should be inserted after allocframe, but if allocframe is in the same packet with a call, the CFI instructions should be inserted before that packet. llvm-svn: 277020	2016-07-28 19:13:46 +00:00
Krzysztof Parzyszek	ca2db7e8ff	[Hexagon] Find speculative loop preheader in hardware loop generation Before adding a new preheader block, check if there is a candidate block where the loop setup could be placed speculatively. This will be off by default. llvm-svn: 276919	2016-07-27 21:20:54 +00:00
Krzysztof Parzyszek	8aa44efbb6	[Hexagon] Do not optimize volatile stack spill slots llvm-svn: 276916	2016-07-27 20:50:42 +00:00
Krzysztof Parzyszek	e6e3bb2045	[Hexagon] Post-increment loads/stores enhancements - Generate vector post-increment stores more aggressively. - Predicate post-increment and vector stores in early if-conversion. llvm-svn: 276800	2016-07-26 20:30:30 +00:00
Krzysztof Parzyszek	0984ca425e	[Hexagon] Gracefully handle reg class mismatch in HexagonLoopReschedule llvm-svn: 276793	2016-07-26 19:17:13 +00:00
Krzysztof Parzyszek	97d3aa900c	[Hexagon] Rerun bit tracker on new instructions in RIE Consider this case: vreg1 = A2_zxth vreg0 (1) ... vreg2 = A2_zxth vreg1 (2) Redundant instruction elimination could delete the instruction (1) because the user (2) only cares about the low 16 bits. Then it could delete (2) because the input is already zero-extended. The problem is that the properties allowing each individual instruction to be deleted depend on the existence of the other instruction, so either one can be deleted, but not both. The existing check for this situation in RIE was insufficient. The fix is to update all dependent cells when an instruction is removed (replaced via COPY) in RIE. llvm-svn: 276792	2016-07-26 19:08:45 +00:00
Krzysztof Parzyszek	6d72583592	[Hexagon] Bitwise operations for insert/extract word not simplified Change the bit simplifier to generate REG_SEQUENCE instructions in addition to COPY, which will handle cases of word insert/extract. llvm-svn: 276787	2016-07-26 18:30:11 +00:00
Krzysztof Parzyszek	587906f308	[Hexagon] Add support for proper handling of H and L constraints H -> High part of reg pair. L -> Low part of reg pair. Patch by Sundeep Kushwaha. llvm-svn: 276773	2016-07-26 17:31:02 +00:00
Krzysztof Parzyszek	9b5d973061	[Hexagon] Add target feature to generate long calls llvm-svn: 276638	2016-07-25 14:42:11 +00:00
Krzysztof Parzyszek	b56c88136d	[Hexagon] Use loop data prefetch on Hexagon llvm-svn: 276422	2016-07-22 14:22:43 +00:00
Krzysztof Parzyszek	47ef3fe260	[Hexagon] Handle returning small structures by value This is compliant with the official ABI, but allows experimentation with calling conventions. llvm-svn: 275822	2016-07-18 17:30:41 +00:00
Krzysztof Parzyszek	524c3d031b	[Hexagon] Enable .cur formation in MISched for Hexagon V60 Schedule a load and its use in the same packet in MISched. Previously, isResourceAvailable was returning false for dependences in the same packet, which prevented MISched from packetizing a load and its use in the same packet for v60. Patch by Ikhlas Ajbar. llvm-svn: 275804	2016-07-18 16:05:27 +00:00
Krzysztof Parzyszek	1e2ce9dfbe	[Hexagon] Use timing class info as tie-breaker in machine scheduler Patch by Sirish Pande. llvm-svn: 275794	2016-07-18 15:17:10 +00:00
Krzysztof Parzyszek	cc6e20c7d4	[Hexagon] HexagonMachineScheduler should account for resources The machine scheduler needs to account for available resources more accurately in order to avoid scheduling an instruction that forces a new packet to be created. This occurs in two ways: First, an instruction without an available resource may have a large priority due to other metrics and be scheduled when there are other instructions with available resources. Second, an instruction with a non-zero latency may become available prematurely. In both these cases, we attempt change the priority in order to allow a better instruction to be scheduled. Patch by Brendon Cahoon. llvm-svn: 275793	2016-07-18 14:52:13 +00:00
Krzysztof Parzyszek	d8fd4012ae	[Hexagon] Fix zero latency instructions with multiple predecessors An instruction may have multiple predecessors that are candidates for using .cur. However, only one of them can use .cur in the packet. When this case occurs, we need to make sure that only one of the dependences gets a 0 latency value. Patch by Brendon Cahoon. llvm-svn: 275790	2016-07-18 14:23:10 +00:00
Krzysztof Parzyszek	6119f0e34d	[Hexagon] Improve patterns with stack-based addressing - Treat bitwise OR with a frame index as an ADD wherever possible, fold it into addressing mode. - Extend patterns for memops to allow memops with frame indexes as address operands. llvm-svn: 275569	2016-07-15 15:35:52 +00:00

1 2 3 4 5 ...

332 Commits