llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 13:02:52 +02:00

Author	SHA1	Message	Date
Chandler Carruth	a7b2a4e865	[C++11] Add two range adaptor views to User: operands and operand_values. The first provides a range view over operand Use objects, and the second provides a range view over the Value*s being used by those operands. The naming is "STL-style" rather than "LLVM-style" because we have historically named iterator methods STL-style, and range methods seem to have far more in common with their iterator counterparts than with "normal" APIs. Feel free to bikeshed on this one if you want, I'm happy to change these around if people feel strongly. I've switched code in SROA and LCG to exercise these mostly to ensure they work correctly -- we don't really have an easy way to unittest this and they're trivial. llvm-svn: 202687	2014-03-03 10:42:58 +00:00
Venkatraman Govindaraju	34b3409b1a	[Sparc] Add trap on integer condition codes (Ticc) instructions to Sparc backend. llvm-svn: 202670	2014-03-02 23:39:07 +00:00
Venkatraman Govindaraju	773ff0c22b	[Sparc] Add return/rett instruction to Sparc backend. llvm-svn: 202666	2014-03-02 22:55:53 +00:00
Venkatraman Govindaraju	b64b412386	[Sparc] Add support for decoding jmpl/retl/ret instruction. llvm-svn: 202663	2014-03-02 21:17:44 +00:00
Venkatraman Govindaraju	bb48cf7681	[Sparc] Add fcmpe* instructions to Sparc backend. llvm-svn: 202661	2014-03-02 19:56:19 +00:00
Venkatraman Govindaraju	13d3f28f34	[Sparc] Add VIS instructions to sparc backend. llvm-svn: 202660	2014-03-02 19:31:21 +00:00
Hal Finkel	64680c3ba1	Add a PPC inline asm constraint type for single CR bits Now that the PowerPC backend can track individual CR bits as first-class registers, we should also have a way of allocating them for inline asm statements. Because these registers are only one bit, if an output variable is implicitly cast to a larger integer size, we'll get an any_extend to that larger type (this is part of the existing target-independent logic). As a result, regardless of the size of the output type, only the first bit is meaningful. The constraint identifier "wc" has been chosen for this purpose. Although gcc does not currently support allocating individual CR bits, this identifier choice has been coordinated with the gcc PowerPC team, and will be marked as reserved for this purpose in the gcc constraints.md file. llvm-svn: 202657	2014-03-02 18:23:39 +00:00
Benjamin Kramer	3ac154a395	[C++11] Replace llvm::tie with std::tie. The old implementation is no longer needed in C++11. llvm-svn: 202644	2014-03-02 13:30:33 +00:00
Chandler Carruth	6e284b6c8f	[C++11] Replace LLVM_STATIC_ASSERT with static_assert, we now have access to it on all host toolchains. llvm-svn: 202642	2014-03-02 13:10:45 +00:00
Benjamin Kramer	e4eb1b495f	[C++11] Replace llvm::next and llvm::prior with std::next and std::prev. Remove the old functions. llvm-svn: 202636	2014-03-02 12:27:27 +00:00
Venkatraman Govindaraju	caf11daede	[SparcV9] Adds support for branch on integer register instructions (BPr) and conditional moves on integer register (MOVr/FMOVr). llvm-svn: 202628	2014-03-02 09:46:56 +00:00
Elena Demikhovsky	838b163a58	AVX-512: Fixed extract_vector_elt for v8i1 vector llvm-svn: 202624	2014-03-02 09:19:44 +00:00
Craig Topper	b0056a4ca7	Switch all uses of LLVM_OVERRIDE to just use 'override' directly. llvm-svn: 202621	2014-03-02 09:09:27 +00:00
Craig Topper	c8a0b9e381	Switch all uses of LLVM_FINAL to just use 'final', and remove the macro. llvm-svn: 202618	2014-03-02 08:08:51 +00:00
Venkatraman Govindaraju	e478c48fa6	[Sparc] Add support for parsing branches and conditional move instructions with %fcc1-%fcc3 conditional registers. llvm-svn: 202616	2014-03-02 06:28:15 +00:00
Venkatraman Govindaraju	b02c5bdb65	[Sparc] Make floating point branch instruction formats to accept %fcc0-%fcc1 conditional registers as input. No functionality change. llvm-svn: 202614	2014-03-02 04:43:45 +00:00
Chandler Carruth	db906c8499	[C++11] Switch all uses of the llvm_move macro to use std::move directly, and remove the macro. llvm-svn: 202612	2014-03-02 04:08:41 +00:00
Venkatraman Govindaraju	6aea38dc6d	[Sparc] Add support for parsing fcmp with %fcc registers. llvm-svn: 202610	2014-03-02 03:39:39 +00:00
Alp Toker	e89523ae73	[C++11] Expand and eliminate the LLVM_ENUM_INT_TYPE() macro llvm-svn: 202607	2014-03-02 03:20:38 +00:00
Venkatraman Govindaraju	591cfa469a	[Sparc] Add register class for floating point conditional flags (%fcc0 - %fcc3). llvm-svn: 202604	2014-03-02 02:12:33 +00:00
Venkatraman Govindaraju	1c2ec775cb	[SparcV9] Add support for parsing branch instructions with prediction. llvm-svn: 202602	2014-03-01 22:03:07 +00:00
Hal Finkel	4937443651	Remove extra truncs/exts around i32 bit operations on PPC64 This generalizes the code to eliminate extra truncs/exts around i1 bit operations to also do the same on PPC64 for i32 bit operations. This eliminates a fairly prevalent code wart: int foo(int a) { return a == 5 ? 7 : 8; } On PPC64, because of the extension implied by the ABI, this would generate: cmplwi 0, 3, 5 li 12, 8 li 4, 7 isel 3, 4, 12, 2 rldicl 3, 3, 0, 32 blr where the 'rldicl 3, 3, 0, 32', the extension, is completely unnecessary. At least for the single-BB case (which is all that the DAG combine mechanism can handle), this unnecessary extension is no longer generated. llvm-svn: 202600	2014-03-01 21:36:57 +00:00
Venkatraman Govindaraju	44a4d4b894	[Sparc] Add support for parsing annulled branch instructions. llvm-svn: 202599	2014-03-01 20:08:48 +00:00
Venkatraman Govindaraju	30d1614f94	[Sparc] Add support for parsing sparcv9 instructions addc/subc/addccc/subccc. llvm-svn: 202598	2014-03-01 18:54:52 +00:00
Venkatraman Govindaraju	96169e7898	[Sparc] Add missing ALU instruction patterns. llvm-svn: 202597	2014-03-01 17:51:00 +00:00
Benjamin Kramer	7a2174b3b0	Make helper function static. llvm-svn: 202596	2014-03-01 17:24:40 +00:00
Benjamin Kramer	803ba41365	Now that we have C++11, turn simple functors into lambdas and remove a ton of boilerplate. No intended functionality change. llvm-svn: 202588	2014-03-01 11:47:00 +00:00
Chandler Carruth	435d84ee45	[C++11] Remove the use of LLVM_HAS_RVALUE_REFERENCES from the rest of the core LLVM libraries. llvm-svn: 202582	2014-03-01 09:32:03 +00:00
Venkatraman Govindaraju	9a36f7b46b	[Sparc] Add support to decode unimp instruction. llvm-svn: 202581	2014-03-01 09:28:18 +00:00
Chandler Carruth	df9f7f4aec	[C++11] Remove the R-value reference #if usage from the ADT and Support libraries. It is now always 1 in LLVM builds. llvm-svn: 202580	2014-03-01 09:27:28 +00:00
Venkatraman Govindaraju	043ff79772	[Sparc] Add support to decode negative simm13 operands in the sparc disassembler. llvm-svn: 202578	2014-03-01 09:11:57 +00:00
Venkatraman Govindaraju	fcbe857272	[Sparc] Add support for decoding call instructions in the sparc disassembler. llvm-svn: 202577	2014-03-01 08:30:58 +00:00
Venkatraman Govindaraju	1eb4e172be	[Sparc] Add support to disassemble sparc memory instructions. llvm-svn: 202575	2014-03-01 07:46:33 +00:00
Venkatraman Govindaraju	78e98664c7	Add support for parsing sun-style section flags in ELFAsmParser. llvm-svn: 202573	2014-03-01 06:21:00 +00:00
Venkatraman Govindaraju	9242ded302	[Sparc] Implement writeNopData. Emit actual NOP instruction instead of just filling with zeroes. llvm-svn: 202572	2014-03-01 05:45:09 +00:00
Venkatraman Govindaraju	dd81a65959	[Sparc] Teach SparcAsmParser to emit correct relocations for PIC code. llvm-svn: 202571	2014-03-01 05:07:21 +00:00
Mark Seaborn	83fd697a96	Fix RWMutex to be thread-safe when pthread_rwlock is not available lib/Support/RWMutex.cpp contains an implementation of RWMutex that uses pthread_rwlock, but when pthread_rwlock is not available (such as under NaCl, when using newlib), it silently falls back to using the no-op definition in lib/Support/Unix/RWMutex.inc, which is not thread-safe. Fix this case to be thread-safe by using a normal mutex. Differential Revision: http://llvm-reviews.chandlerc.com/D2892 llvm-svn: 202570	2014-03-01 04:30:32 +00:00
Venkatraman Govindaraju	a60509dc31	[Sparc] 80 column rule. No functionality change. llvm-svn: 202565	2014-03-01 02:28:34 +00:00
Venkatraman Govindaraju	789e2fd1b7	[Sparc] Add support for parsing directives in SparcAsmParser. llvm-svn: 202564	2014-03-01 02:18:04 +00:00
Venkatraman Govindaraju	439a7d90a6	[Sparc] Emit 'restore' instead of 'restore %g0, %g0, %g0'. This improves the readability of the generated code. llvm-svn: 202563	2014-03-01 01:04:26 +00:00
Manman Ren	fe531c446c	SpillPlacement: fix a bug in iterate. Inside iterate, we scan backwards then scan forwards in a loop. When iteration is not zero, the last node was just updated so we can skip it. But when iteration is zero, we can't skip the last node. For the testing case, fixing this will save a spill and move register copies from hot path to cold path. llvm-svn: 202557	2014-02-28 23:05:31 +00:00
Reid Kleckner	4698f3f991	Reflow isProfitableToMakeFastCC llvm-svn: 202555	2014-02-28 22:50:08 +00:00
Lang Hames	06b78004a4	Jumped the gun with r202551 and broke some bots that weren't yet C++11ified. Reverting until the C++11 switch is complete. llvm-svn: 202554	2014-02-28 22:44:44 +00:00
Lang Hames	e6a310e01a	New PBQP solver, and updates to the PBQP graph. The previous PBQP solver was very robust but consumed a lot of memory, performed a lot of redundant computation, and contained some unnecessarily tight coupling that prevented experimentation with novel solution techniques. This new solver is an attempt to address these shortcomings. Important/interesting changes: 1) The domain-independent PBQP solver class, HeuristicSolverImpl, is gone. It is replaced by a register allocation specific solver, PBQP::RegAlloc::Solver (see RegAllocSolver.h). The optimal reduction rules and the backpropagation algorithm have been extracted into stand-alone functions (see ReductionRules.h), which can be used to build domain specific PBQP solvers. This provides many more opportunities for domain-specific knowledge to inform the PBQP solvers' decisions. In theory this should allow us to generate better solutions. In practice, we can at least test out ideas now. As a side benefit, I believe the new solver is more readable than the old one. 2) The solver type is now a template parameter of the PBQP graph. This allows the graph to notify the solver of any modifications made (e.g. by domain independent rules) without the overhead of a virtual call. It also allows the solver to supply policy information to the graph (see below). 3) Significantly reduced memory overhead. Memory management policy is now an explicit property of the PBQP graph (via the CostAllocator typedef on the graph's solver template argument). Because PBQP graphs for register allocation tend to contain many redundant instances of single values (E.g. the value representing an interference constraint between GPRs), the new RASolver class uses a uniquing scheme. This massively reduces memory consumption for large register allocation problems. For example, looking at the largest interference graph in each of the SPEC2006 benchmarks (the largest graph will always set the memory consumption high-water mark for PBQP), the average memory reduction for the PBQP costs was 400x. That's times, not percent. The highest was 1400x. Yikes. So - this is fixed. "PBQP: No longer feasting upon every last byte of your RAM". Minor details: - Fully C++11'd. Never copy-construct another vector/matrix! - Cute tricks with cost metadata: Metadata that is derived solely from cost matrices/vectors is attached directly to the cost instances themselves. That way if you unique the costs you never have to recompute the metadata. 400x less memory means 400x less cost metadata (re)computation. Special thanks to Arnaud de Grandmaison, who has been the source of much encouragement, and of many very useful test cases. This new solver forms the basis for future work, of which there's plenty to do. I will be adding TODO notes shortly. - Lang. llvm-svn: 202551	2014-02-28 22:25:24 +00:00
Eric Christopher	d593a35066	Fix >> to be > > for non-c++11. llvm-svn: 202545	2014-02-28 21:37:28 +00:00
Tom Stellard	f4a7aeb04e	R600: Verify all instructions in the AsmPrinter on debug builds Make a call to R600's implementation of verifyInstruction() to check that instructions are only using legal operands. llvm-svn: 202544	2014-02-28 21:36:41 +00:00
Tom Stellard	6280afdecd	R600/SI: Expand all v16[if]32 operations llvm-svn: 202543	2014-02-28 21:36:37 +00:00
Eric Christopher	91e928b9a6	80-col. llvm-svn: 202541	2014-02-28 21:27:59 +00:00
Eric Christopher	5e2c142a79	Fix a crasher where when we're attempting to replace a type during the finalization for CGDebugInfo in clang we would RAUW a type and it would result in a corrupted MDNode for an imported declaration. Testcase pending as reducing has been difficult. llvm-svn: 202540	2014-02-28 21:27:57 +00:00
Justin Bogner	56a8b49ffd	CommandLine: Exit successfully for -version and -help Tools that use the CommandLine library currently exit with an error when invoked with -version or -help. This is unusual and non-standard, so we'll fix them to exit successfully instead. I don't expect that anyone relies on the current behaviour, so this should be a fairly safe change. llvm-svn: 202530	2014-02-28 19:08:01 +00:00
Zoran Jovanovic	9c1887bef4	Fixed operand of SC microMIPS instruction. llvm-svn: 202526	2014-02-28 18:22:56 +00:00
Zoran Jovanovic	ebb68d0712	Fixed encoding of SYSCALL microMIPS instruction. llvm-svn: 202523	2014-02-28 18:17:08 +00:00
Zoran Jovanovic	d914bd8ae2	Revert revision 202518 because of wrong commit message. llvm-svn: 202521	2014-02-28 18:14:16 +00:00
Zoran Jovanovic	43ca53260b	Fix operand of SC instruction. llvm-svn: 202518	2014-02-28 18:02:17 +00:00
Evgeniy Stepanov	da60e2a9aa	X86Operand is extracted into individual header. X86Operand is extracted into individual header, because it allows to create an arbitrary memory operand and append it to MCInst. It'll be reused in X86 inline assembly instrumentation. Patch by Yuri Gorshenin. llvm-svn: 202496	2014-02-28 12:28:07 +00:00
NAKAMURA Takumi	ee6de3fa2e	Reorder Mips/MCTargetDesc/CMakeLists.txt. llvm-svn: 202483	2014-02-28 10:18:21 +00:00
Sasa Stankovic	1eac2858b7	[mips] Add MipsNaClELFStreamer.cpp to CMakeLists.txt. llvm-svn: 202482	2014-02-28 10:14:12 +00:00
Sasa Stankovic	b0018b8bdb	[mips] Implement NaCl sandboxing of indirect jumps: * Align targets of indirect jumps to instruction bundle boundaries (in MI layer). * Add masking instructions before indirect jumps (in MC layer). Differential Revision: http://llvm-reviews.chandlerc.com/D2847 llvm-svn: 202479	2014-02-28 10:00:38 +00:00
Tobias Grosser	803bee29dd	Add 'remark' diagnostic type in LLVM A 'remark' is information that is not an error or a warning, but rather some additional information provided to the user. In contrast to a 'note' a 'remark' is an independent diagnostic, whereas a 'note' always depends on another diagnostic. A typical use case for remark nodes is information provided to the user, e.g. information provided by the vectorizer about loops that have been vectorized. llvm-svn: 202474	2014-02-28 09:08:45 +00:00
Hal Finkel	1970087008	Swap PPC isel operands to allow for 0-folding The PPC isel instruction can fold 0 into the first operand (thus eliminating the need to materialize a zero-containing register when the 'true' result of the isel is 0). When the isel is fed by a bit register operation that we can invert, do so as part of the bit-register-operation peephole routine. llvm-svn: 202469	2014-02-28 06:11:16 +00:00
Rafael Espindola	e3d6551059	Now that it is possible, use the mangler in IRObjectFile. A really simple patch marks the end of a lot of yak shaving :-) llvm-svn: 202463	2014-02-28 02:17:23 +00:00
Hal Finkel	3bd3f4e287	Trying to unbreak the darwin11 builder The CR bit tracking code broke PPC/Darwin; trying to get it working again... (the darwin11 builder, which defaults to the darwin ABI when running PPC tests, asserted when running test/CodeGen/PowerPC/inverted-bool-compares.ll) llvm-svn: 202459	2014-02-28 01:17:25 +00:00
Hal Finkel	94f3724df6	Try to unbreak the C++11 build Cannot use negative numbers in case statements without running afoul of -Wc++11-narrowing. llvm-svn: 202455	2014-02-28 00:45:27 +00:00
Hal Finkel	883c64377d	Add CR-bit tracking to the PowerPC backend for i1 values This change enables tracking i1 values in the PowerPC backend using the condition register bits. These bits can be treated on PowerPC as separate registers; individual bit operations (and, or, xor, etc.) are supported. Tracking booleans in CR bits has several advantages: - Reduction in register pressure (because we no longer need GPRs to store boolean values). - Logical operations on booleans can be handled more efficiently; we used to have to move all results from comparisons into GPRs, perform promoted logical operations in GPRs, and then move the result back into condition register bits to be used by conditional branches. This can be very inefficient, because the throughput of these CR <-> GPR moves have high latency and low throughput (especially when other associated instructions are accounted for). - On the POWER7 and similar cores, we can increase total throughput by using the CR bits. CR bit operations have a dedicated functional unit. Most of this is more-or-less mechanical: Adjustments were needed in the calling-convention code, support was added for spilling/restoring individual condition-register bits, and conditional branch instruction definitions taking specific CR bits were added (plus patterns and code for generating bit-level operations). This is enabled by default when running at -O2 and higher. For -O0 and -O1, where the ability to debug is more important, this feature is disabled by default. Individual CR bits do not have assigned DWARF register numbers, and storing values in CR bits makes them invisible to the debugger. It is critical, however, that we don't move i1 values that have been promoted to larger values (such as those passed as function arguments) into bit registers only to quickly turn around and move the values back into GPRs (such as happens when values are returned by functions). A pair of target-specific DAG combines are added to remove the trunc/extends in: trunc(binary-ops(binary-ops(zext(x), zext(y)), ...) and: zext(binary-ops(binary-ops(trunc(x), trunc(y)), ...) In short, we only want to use CR bits where some of the i1 values come from comparisons or are used by conditional branches or selects. To put it another way, if we can do the entire i1 computation in GPRs, then we probably should (on the POWER7, the GPR-operation throughput is higher, and for all cores, the CR <-> GPR moves are expensive). POWER7 test-suite performance results (from 10 runs in each configuration): SingleSource/Benchmarks/Misc/mandel-2: 35% speedup MultiSource/Benchmarks/Prolangs-C++/city/city: 21% speedup MultiSource/Benchmarks/MiBench/automotive-susan: 23% speedup SingleSource/Benchmarks/CoyoteBench/huffbench: 13% speedup SingleSource/Benchmarks/Misc-C++/Large/sphereflake: 13% speedup SingleSource/Benchmarks/Misc-C++/mandel-text: 10% speedup SingleSource/Benchmarks/Misc-C++-EH/spirit: 10% slowdown MultiSource/Applications/lemon/lemon: 8% slowdown llvm-svn: 202451	2014-02-28 00:27:01 +00:00
Hal Finkel	41d36f7b16	Fix visitTRUNCATE for legal i1 values This extract-and-trunc vector optimization cannot work for i1 values as currently implemented, and so I'm disabling this for now for i1 values. In the future, this can be fixed properly. Soon I'll commit support for i1 CR bit tracking in the PowerPC backend, and this will be covered by one of the existing regression tests. llvm-svn: 202449	2014-02-28 00:26:45 +00:00
Andrew Trick	a8fdecaea7	Provide a target override for the latest regalloc heuristic. This is a temporary workaround for native arm linux builds: PR18996: Changing regalloc order breaks "lencod" on native arm linux builds. llvm-svn: 202433	2014-02-27 21:37:33 +00:00
Roman Divacky	f36febf578	Lower FNEG just like FABS to fneg[ds] and fmov[ds], thus avoiding expensive libcall. Also, Qp_neg is not implemented on at least FreeBSD. This is also what gcc is doing. llvm-svn: 202422	2014-02-27 19:26:29 +00:00
Eric Christopher	a35169d497	Revert r201751 and solve the const problem a different way - by making the cache mutable. llvm-svn: 202417	2014-02-27 18:36:10 +00:00
Adrian Prantl	d7f77dd966	Debug info: Remove ARMAsmPrinter::EmitDwarfRegOp(). AsmPrinter can now scan the register file for sub- and super-registers. No functionality change intended. (Tests are updated because the comments in the assembler output are different.) llvm-svn: 202416	2014-02-27 17:56:08 +00:00
Richard Osborne	947c19eaa0	[XCore] Support functions returning more than 4 words. If a function returns a large struct by value return the first 4 words in registers and the rest on the stack in a location reserved by the caller. This is needed to support the xC language which supports functions returning an arbitrary number of return values. This is r202397 reapplied with a fix to avoid an uninitialized read of a member. llvm-svn: 202414	2014-02-27 17:47:54 +00:00
Richard Osborne	f1c5c83f06	[XCore] Make LowerCallResult a static function. No functionality change. This is r202396 reapplied with no changes. llvm-svn: 202413	2014-02-27 17:47:48 +00:00
Rafael Espindola	7a4b8493b1	Remove MCPureStreamer. We moved MCJIT to use native object formats a long time ago and R600 now uses ELF, so it was dead. llvm-svn: 202408	2014-02-27 16:17:34 +00:00
Alexander Kornienko	4ec82215fc	Re-apply r200853, which should not crash after Clang plugins were converted to loadable modules in r201256. llvm-svn: 202404	2014-02-27 14:47:37 +00:00
Richard Osborne	f8fb4e8a7f	Revert r202396, r202397. These are causing test failures, revert for now. llvm-svn: 202398	2014-02-27 14:24:13 +00:00
Richard Osborne	cb6866dfec	[XCore] Support functions returning more than 4 words. Summary: If a function returns a large struct by value return the first 4 words in registers and the rest on the stack in a location reserved by the caller. This is needed to support the xC language which supports functions returning an arbitrary number of return values. Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2889 llvm-svn: 202397	2014-02-27 14:00:40 +00:00
Richard Osborne	35b73c788e	[XCore] Make LowerCallResult a static function. No functionality change. llvm-svn: 202396	2014-02-27 14:00:34 +00:00
Richard Osborne	5ac74685fd	[XCore] Target optimized library function __memcpy_4() Summary: If the src, dst and size of a memcpy are known to be 4 byte aligned we can call __memcpy_4() instead of memcpy(). Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2871 llvm-svn: 202395	2014-02-27 13:39:07 +00:00
Richard Osborne	75c16f2bf4	[XCore] Add dag combines for instructions that ignore some input bits. These instructions ignore the high bits of one of their input operands - try and use this to simplify the code. llvm-svn: 202394	2014-02-27 13:20:11 +00:00
Richard Osborne	f815df9c6e	[XCore] Provide information about known zero bits of resource instructions. llvm-svn: 202393	2014-02-27 13:20:06 +00:00
Kostya Serebryany	1d5ff228f0	[asan] fix a pair of silly typos llvm-svn: 202391	2014-02-27 13:13:59 +00:00
Kostya Serebryany	71b9995519	[asan] disable asan-detect-invalid-pointer-pair (was enabled by mistake) llvm-svn: 202390	2014-02-27 12:56:20 +00:00
Kostya Serebryany	089f21bdde	[asan] experimental implementation of invalid-pointer-pair detector (finds when two unrelated pointers are compared or subtracted). This implementation has both false positives and false negatives and is not tuned for performance. A bug report for a proper implementation will follow. llvm-svn: 202389	2014-02-27 12:45:36 +00:00
Eric Christopher	90c684ef81	Don't emit anything into the debug_ranges section if we aren't emitting any ranges - this includes CU ranges where we were previously emitting an end list marker even if we didn't have a list. Testcase includes a test for line table only code emission as the problem was noticed while writing this test. llvm-svn: 202357	2014-02-27 07:44:45 +00:00
Craig Topper	6d9ad3a694	[X86] Fix Uses/Defs lists for INS, OUTS, SCAS, CMPS, LODS llvm-svn: 202348	2014-02-27 05:08:25 +00:00
Craig Topper	e406fab3df	[X86] Add RAX/EAX/AX Uses/Defs to XCHG RAX/EAX/AX instructions. llvm-svn: 202347	2014-02-27 04:27:00 +00:00
Craig Topper	4fcab63947	[X86] Add RAX/EAX/AX/AL Uses/Defs to the absolute memory location move instructions. Patch by Florian Lukas with some additional instructions fixed by me. Fixes PR18975. llvm-svn: 202345	2014-02-27 04:07:57 +00:00
Craig Topper	85c1025ceb	Fix odd indentation. llvm-svn: 202342	2014-02-27 03:11:13 +00:00
Ben Langmuir	6633cebb29	Revert "Use StringRef in raw_fd_ostream constructor" This reverts commit r202225, which may cause a performance regression. llvm-svn: 202338	2014-02-27 02:09:10 +00:00
Michel Danzer	8edacce1de	R600/SI: Optimize SI_KILL for constant operands If the SI_KILL operand is constant, we can either clear the exec mask if the operand is negative, or do nothing otherwise. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202337	2014-02-27 01:47:09 +00:00
Michel Danzer	0ddce64f7c	R600/SI: Allow SI_KILL for geometry shaders Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202336	2014-02-27 01:47:02 +00:00
Eric Christopher	2a887e3673	If we're only emitting line tables for a particular CU then don't add any ranges to the list of ranges for the CU as we don't want to emit them anyway. This ensures that we will still emit ranges if we have a compile unit compiled with only line tables and one compiled with full debug info requested (we'll emit for the one with full debug info). Update testcase metadata accordingly to continue emitting ranges. llvm-svn: 202333	2014-02-27 01:25:00 +00:00
Eric Christopher	b69957cb27	Add a debug info code generation level to the compile unit metadata and update everything accordingly. This can be used to conditionalize the amount of output in the backend based on the amount of debug requested/metadata emission scheme by a front end (e.g. clang). Paired with a commit to clang. llvm-svn: 202332	2014-02-27 01:24:56 +00:00
Adrian Prantl	f3438e470c	Fix a type error that crept into r202313. llvm-svn: 202317	2014-02-26 23:46:39 +00:00
Eric Christopher	1b1ce405c8	Remove unnecessary llvm:: qualification. llvm-svn: 202316	2014-02-26 23:27:16 +00:00
Adrian Prantl	4255f02083	Debug info: Refactor AsmPrinter::EmitDwarfRegOp to make the control flow more obvious. llvm-svn: 202313	2014-02-26 23:03:37 +00:00
Matt Arsenault	4ecfd35fdd	R600: Remove unnecessary build_vector pattern. It is already fully handled in AMDGPUISelDAGToDAG. llvm-svn: 202312	2014-02-26 23:00:58 +00:00
Andrew Trick	ba61c4e6cf	Add a limit to the heuristic that register allocates instructions in local order. This handles pathological cases in which we see 2x increase in spill code for large blocks (~50k instructions). I don't have a unit test for this behavior. Fixes rdar://16072279. llvm-svn: 202304	2014-02-26 22:07:26 +00:00
Quentin Colombet	e639a79f72	Lower unsigned vsetcc to psubus in certain cases The current approach to lower a vsetult is to flip the sign bit of the operands, swap the operands and then use a (signed) pcmpgt. psubus (unsigned saturating subtract) can be used to emulate a vsetult more efficiently: + case ISD::SETULT: { + // If the comparison is against a constant we can turn this into a + // setule. With psubus, setule does not require a swap. This is + // beneficial because the constant in the register is no longer + // destructed as the destination so it can be hoisted out of a loop. I also enable lowering via psubus in a few other cases where it's clearly beneficial: setule and setuge if minu/maxu cannot be used. rdar://problem/14338765 Patch by Adam Nemet <anemet@apple.com>. llvm-svn: 202301	2014-02-26 21:39:12 +00:00
Aaron Ballman	9e5315239d	Silencing an MSVC signed comparison warning. llvm-svn: 202295	2014-02-26 20:22:20 +00:00
Hal Finkel	e0ccbea861	Fix the aggressive anti-dep breaker's subregister definition handling The aggressive anti-dependency breaker scans instructions, bottom-up, within the scheduling region in order to find opportunities where register renaming can be used to break anti-dependencies. Unfortunately, the aggressive anti-dep breaker was treating a register definition as defining all of that register's aliases (including super registers). This behavior is incorrect when the super register is live and there are other definitions of subregisters of the super register. For example, given the following sequence: %CR2EQ<def> = CROR %CR3UN, %CR3UN<kill> %CR2GT<def> = IMPLICIT_DEF %X4<def> = MFOCRF8 %CR2 the analysis of the first subregister definition would work as expected: Anti: %CR2GT<def> = IMPLICIT_DEF Def Groups: CR2GT=g194->g0(via CR2) Antidep reg: CR2GT (zero group) Use Groups: but the analysis of the second one would not: Anti: %CR2EQ<def> = CROR %CR3UN, %CR3UN<kill> Def Groups: CR2EQ=g195 Antidep reg: CR2EQ Rename Candidates for Group g195: ... because, when processing the %CR2GT<def>, we'd mark all super registers of %CR2GT (%CR2 in this case) as defined. As a result, when processing %CR2EQ<def>, %CR2 no longer appears to be live, and %CR2EQ<def>'s group is not %unioned with the %CR2 group. I don't have an in-tree test case for this yet (and even if I did, I don't have a small one). llvm-svn: 202294	2014-02-26 20:20:30 +00:00

1 2 3 4 5 ...

67439 Commits