llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Ahmed Bougacha	f08562247d	[TableGen][DAGISel] Dedup predicates with same code to run. NFCI. I locally hit the 255 limit, but a lot of these are redundant: each predicate coming from a different record was allocated a new number, even when we already emitted the same code for another predicate. Instead, re-use numbers and emit the predicate code only once. This reduces the total text size of *DAGISel.cpp.o by ~1%. llvm-svn: 246208	2015-08-27 20:43:34 +00:00
Tyler Nowicki	44e6771caf	Fix test introduced in r246187 that failed on some systems. llvm-svn: 246207	2015-08-27 20:43:29 +00:00
Lang Hames	33230c9911	Oops - Re-add the Kaleidoscope regression tests themselves (accidentally left out of r246201). llvm-svn: 246203	2015-08-27 20:33:22 +00:00
Lang Hames	9cc11d6337	Recommit r246175 - Add Kaleidoscope regression tests, with a fix to make sure the kaleidoscope 'library' functions aren't dead-stripped in release builds. llvm-svn: 246201	2015-08-27 20:31:44 +00:00
Erik Schnetter	a3e8c48df2	Enable constant propagation for more math functions Constant propagation for single precision math functions (such as tanf) is already working, but was not enabled. This patch enables these for many single-precision functions, and adds respective test cases. Newly handled functions: acosf asinf atanf atan2f ceilf coshf expf exp2f fabsf floorf fmodf logf log10f powf sinhf tanf tanhf llvm-svn: 246194	2015-08-27 19:56:57 +00:00
Erik Schnetter	3def1ebc0a	Revert 246186; still breaks on some systems llvm-svn: 246191	2015-08-27 19:34:14 +00:00
Tyler Nowicki	49268c1eff	Improve vectorization diagnostic messages and extend vectorize(enable) pragma. This patch changes the analysis diagnostics produced when loops with floating-point recurrences or memory operations are identified. The new messages say "cannot prove it is safe to reorder * operations; allow reordering by specifying #pragma clang loop vectorize(enable)". Depending on the type of diagnostic the message will include additional options such as ffast-math or __restrict__. This patch also allows the vectorize(enable) pragma to override the low pointer memory check threshold. When the hint is given a higher threshold is used. See the clang patch for the options produced for each diagnostic. llvm-svn: 246187	2015-08-27 18:56:49 +00:00
Erik Schnetter	4fbf515263	Enable constant propagation for more math functions Constant propagation for single precision math functions (such as tanf) is already working, but was not enabled. This patch enables these for many single-precision functions, and adds respective test cases. Newly handled functions: acosf asinf atanf atan2f ceilf coshf expf exp2f fabsf floorf fmodf logf log10f powf sinhf tanf tanhf llvm-svn: 246186	2015-08-27 18:56:23 +00:00
Lang Hames	fad915524f	Revert r246175 to get builder green again. llvm-svn: 246185	2015-08-27 18:54:41 +00:00
Ahmed Bougacha	a13a2b31c7	[TableGen] Remove dead code. NFC. The only user of this was removed in r129670. llvm-svn: 246176	2015-08-27 18:14:21 +00:00
Lang Hames	6a2c50847c	Add Kaleidoscope regression tests. These will be run if LLVM_BUILD_EXAMPLES is enabled. llvm-svn: 246175	2015-08-27 18:13:34 +00:00
Matt Arsenault	1fb24b1ab8	AMDGPU/SI: Add test for folding constants into operands Patch by Axel Davy llvm-svn: 246167	2015-08-27 17:41:27 +00:00
Erik Schnetter	d46c97a451	Revert r246158 since it breaks LLVM.Transforms/ConstProp.calls.ll llvm-svn: 246166	2015-08-27 17:24:01 +00:00
Jonathan Roelofs	3734bdda85	Fix a case of `CHECK[^:]*$`. http://reviews.llvm.org/D11917 llvm-svn: 246163	2015-08-27 17:03:14 +00:00
Erik Schnetter	01b21a7411	Enable constant propagation for more math functions Constant propagation for single precision math functions (such as tanf) is already working, but was not enabled. This patch enables these for many single-precision functions, and adds respective test cases. Newly handled functions: acosf asinf atanf atan2f ceilf coshf expf exp2f fabsf floorf fmodf logf log10f powf sinhf tanf tanhf llvm-svn: 246158	2015-08-27 16:36:37 +00:00
NAKAMURA Takumi	6c960875f9	[CMake] OBJLIB-ize -tblgen. This improves dependency chain of; (LLVMSupport && LLVMTableGen) && (.cpp in -tblgen) && (linking -tblgen) with; (LLVMSupport && LLVMTableGen && .cpp) && (linking -tblgen) llvm-svn: 246156	2015-08-27 16:10:47 +00:00
Chad Rosier	a4f654dac4	[LoopVectorize] Move test from r246149 into a target-specific folder to appease bots. llvm-svn: 246154	2015-08-27 15:24:47 +00:00
NAKAMURA Takumi	209218a51c	[CMake] Let ExceptionDemo buildable with ENABLE_EH. llvm-svn: 246152	2015-08-27 15:13:14 +00:00
Davide Italiano	aaf02e450d	[llvm-readobj] Add support for dumping MachO min version load command. Example output: File: <stdin> Format: Mach-O arm Arch: arm AddressSize: 32bit MinVersion { Cmd: LC_VERSION_MIN_IPHONEOS Size: 16 Version: 99.8.7 SDK: n/a } Differential Revision: http://reviews.llvm.org/D12373 llvm-svn: 246151	2015-08-27 15:11:32 +00:00
Chad Rosier	6e3e56c088	[LoopVectorize] Add Support for Small Size Reductions. Unlike scalar operations, we can perform vector operations on element types that are smaller than the native integer types. We type-promote scalar operations if they are smaller than a native type (e.g., i8 arithmetic is promoted to i32 arithmetic on Arm targets). This patch detects and removes type-promotions within the reduction detection framework, enabling the vectorization of small size reductions. In the legality phase, we look through the ANDs and extensions that InstCombine creates during promotion, keeping track of the smaller type. In the profitability phase, we use the smaller type and ignore the ANDs and extensions in the cost model. Finally, in the code generation phase, we truncate the result of the reduction to allow InstCombine to rewrite the entire expression in the smaller type. This fixes PR21369. http://reviews.llvm.org/D12202 Patch by Matt Simpson <mssimpso@codeaurora.org>! llvm-svn: 246149	2015-08-27 14:12:17 +00:00
James Molloy	d7310f7b46	[LoopVectorize] Extract InductionInfo into a helper class... ... and move it into LoopUtils where it can be used by other passes, just like ReductionDescriptor. The API is very similar to ReductionDescriptor - that is, not very nice at all. Sorting these both out will come in a followup. NFC llvm-svn: 246145	2015-08-27 09:53:00 +00:00
Alex Rosenberg	e7cdadc960	Whoops, remove trailing whitespace. llvm-svn: 246141	2015-08-27 05:37:12 +00:00
Pete Cooper	7ecbe8117c	isKnownNonNull needs to consider globals in non-zero address spaces. Globals in address spaces other than one may have 0 as a valid address, so we should not assume that they can be null. Reviewed by Philip Reames. llvm-svn: 246137	2015-08-27 03:16:29 +00:00
Philip Reames	d2d22a2dd9	Allow value forwarding past release fences in EarlyCSE A release fence acts as a publication barrier for stores within the current thread to become visible to other threads which might observe the release fence. It does not require the current thread to observe stores performed on other threads. As a result, we can allow store-load and load-store forwarding across a release fence. We do need to make sure that stores before the fence can't be eliminated even if there's another store to the same location after the fence. In theory, we could reorder the second store above the fence and then eliminate the former, but we can't do this if the stores are on opposite sides of the fence. Note: While more aggressive then what's there, this patch is still implementing a really conservative ordering. In particular, I'm not trying to exploit undefined behavior via races, or the fact that the LangRef says only 'atomic' accesses are ordered w.r.t. fences. Differential Revision: http://reviews.llvm.org/D11434 llvm-svn: 246134	2015-08-27 01:32:33 +00:00
Philip Reames	258ea49e29	[RewriteStatepointsForGC] Reduce the number of new instructions for base pointers When computing base pointers, we introduce new instructions to propagate the base of existing instructions which might not be bases. However, the algorithm doesn't make any effort to recognize when the new instruction to be inserted is the same as an existing one already in the IR. Since this is happening immediately before rewriting, we don't really have a chance to fix it after the pass runs without teaching loop passes about statepoints. I'm really not thrilled with this patch. I've rewritten it 4 different ways now, but this is the best I've come up with. The case where the new instruction is just the original base defining value could be merged into the existing algorithm with some complexity. The problem is that we might have something like an extractelement from a phi of two vectors. It may be trivially obvious that the base of the 0th element is an existing instruction, but I can't see how to make the algorithm itself figure that out. Thus, I resort to the call to SimplifyInstruction instead. Note that we can only adjust the instructions we've inserted ourselves. The live sets are still being tracked in side structures at this point in the code. We can't easily muck with instructions which might be in them. Long term, I'm really thinking we need to materialize the live pointer sets explicitly in the IR somehow rather than using side structures to track them. Differential Revision: http://reviews.llvm.org/D12004 llvm-svn: 246133	2015-08-27 01:02:28 +00:00
Tyler Nowicki	35ff72ec4d	Improved printing of analysis diagnostics in the loop vectorizer. This patch ensures that every analysis diagnostic produced by the vectorizer will be printed if the loop has a vectorization hint on it. The condition has also been improved to prevent printing when a disabling hint is specified. llvm-svn: 246132	2015-08-27 01:02:04 +00:00
Cong Hou	ccb5362a50	Fixed a bug that edge weights are not assigned correctly when lowering switch statement. This is a one-line-change patch that moves the update to UnhandledWeights to the correct position: it should be updated for all clusters instead of just range clusters. Differential Revision: http://reviews.llvm.org/D12391 llvm-svn: 246129	2015-08-27 00:37:40 +00:00
NAKAMURA Takumi	79e549d8b8	Kaleidoscope: Prune unused libdeps. llvm-svn: 246126	2015-08-27 00:04:24 +00:00
Philip Reames	1e50c760b7	[SimplifyCFG] Prune code from a provably unreachable switch default As Sanjoy pointed out over in http://reviews.llvm.org/D11819, a switch on an icmp should always be able to become a branch instruction. This patch generalizes that notion slightly to prove that the default case of a switch is unreachable if the cases completely cover all possible bit patterns in the condition. Once that's done, the switch to branch conversion kicks in just fine. Note: Duplicate case values are disallowed by the LangRef and verifier. Differential Revision: http://reviews.llvm.org/D11995 llvm-svn: 246125	2015-08-26 23:56:46 +00:00
Hal Finkel	719ae331a6	[PowerPC] Remove unnecessary braces in PPCVSXFMAMutate Address Eric's post-commit review of r245741. NFC. llvm-svn: 246121	2015-08-26 23:41:53 +00:00
Bjarke Hammersholt Roune	28d5088cb1	[NVPTX] Let NVPTX backend detect integer min and max patterns. Summary: Let NVPTX backend detect integer min and max patterns during isel and emit intrinsics that enable hardware support. Reviewers: jholewinski, meheff, jingyue Subscribers: arsenm, llvm-commits, meheff, jingyue, eliben, jholewinski Differential Revision: http://reviews.llvm.org/D12377 llvm-svn: 246107	2015-08-26 23:22:02 +00:00
Cong Hou	1eb3aab19c	[ARM] Use BranchProbability::scale() to scale an integer with a probability in ARMBaseInstrInfo.cpp, Previously in isProfitableToIfCvt() in ARMBaseInstrInfo.cpp, the multiplication between an integer and a branch probability is done manually in an unsafe way that may lead to overflow. This patch corrects those cases by using BranchProbability's member function scale() to avoid overflow (which stores the intermediate result in int64). Differential Revision: http://reviews.llvm.org/D12295 llvm-svn: 246106	2015-08-26 23:17:52 +00:00
Cong Hou	9a38b9833b	Assign weights to edges to jump table / bit test header when lowering switch statement. Currently, when lowering switch statement and a new basic block is built for jump table / bit test header, the edge to this new block is not assigned with a correct weight. This patch collects the edge weight from all its successors and assign this sum of weights to the edge (and also the other fall-through edge). Test cases are adjusted accordingly. Differential Revision: http://reviews.llvm.org/D12166#fae6eca7 llvm-svn: 246104	2015-08-26 23:15:32 +00:00
Philip Reames	080a182fda	[docs][Statepoints] More on base pointers Expand the information on base pointers to include an example, the assumptions a collector is allowed to make, legal optimizations over gc.relocates, and the assumptions made by RewriteStatepointsForGC. This is the result of a recent conversation with folks from LLIC and the confusions that came to light therein. llvm-svn: 246103	2015-08-26 23:13:35 +00:00
JF Bastien	2f36851fb3	WebAssembly: NFC comment update llvm-svn: 246101	2015-08-26 23:03:07 +00:00
Duncan P. N. Exon Smith	ab788bc6cf	DI: Make Subprogram definitions 'distinct' Change `DIBuilder` always to produce 'distinct' nodes when creating `DISubprogram` definitions. I measured a ~5% memory improvement in the link step (of ld64) when using `-flto -g`. `DISubprogram`s are used in two ways in the debug info graph. Some are definitions, point at actual functions, and can't really be shared between compile units. With full debug info, these point down at their variables, forming uniquing cycles. These uniquing cycles are expensive to link between modules, since all unique nodes that reference them transitively need to be duplicated (see commit message for r244181 for more details). Others are declarations, primarily used for member functions in the type hierarchy. Definitions never show up there; instead, a definition points at its corresponding declaration node. I started by making all subprograms 'distinct'. However, that was too big a hammer: memory usage increased ~5% (net increase vs. this patch of ~10%) because the 'distinct' declarations undermine LTO type uniquing. This is a targeted fix for the definitions (where uniquing is an observable problem). A couple of notes: - There's an accompanying commit to update IRGen testcases in clang. - ^ That's what I'm using to test this commit. - In a follow-up, I'll change the verifier to require 'distinct' on definitions and add an upgrade to `BitcodeReader`. llvm-svn: 246098	2015-08-26 22:50:16 +00:00
JF Bastien	b2ba1c1ecc	WebAssembly: handle private/internal globals. Things of note: - Other linkage types aren't handled yet. We'll figure it out with dynamic linking. - Special LLVM globals are either ignored, or error out for now. - TLS isn't supported yet (WebAssembly will have threads later). - There currently isn't a syntax for alignment, I left it in a comment so it's easy to hook up. - Undef is convereted to whatever the type's appropriate null value is. - assert versus report_fatal_error: follow what other AsmPrinters do, and assert only on what should have been caught elsewhere. llvm-svn: 246092	2015-08-26 22:09:54 +00:00
Reid Kleckner	bbad92826d	[ms-inline-asm] Relax assertion around funky identifiers slightly A corresponding clang change will make it so that clang can consume part of an assembler token. The assembler treats '.' as an identifier character while clang does not, so it's view of the token stream is a little different. llvm-svn: 246089	2015-08-26 21:57:25 +00:00
Kostya Serebryany	0e83baec1a	[libFuzzer] fix minor inefficiency, PR24584 llvm-svn: 246087	2015-08-26 21:55:19 +00:00
Mehdi Amini	8a71899819	Fix LLVM C API for DataLayout We removed access to the DataLayout on the TargetMachine and deprecated the C API function LLVMGetTargetMachineData() in r243114. However the way I tried to be backward compatible was broken: I changed the wrapper of the TargetMachine to be a structure that includes the DataLayout as well. However the TargetMachine is also wrapped by the ExecutionEngine, in the more classic way. A client using the TargetMachine wrapped by the ExecutionEngine and trying to get the DataLayout would break. It seems tricky to solve the problem completely in the C API implementation. This patch tries to address this backward compatibility in a more lighter way in the C++ API. The C API is restored in its original state and the removed C++ API is reintroduced, but privately. The C API is friended to the TargetMachine and should be the only consumer for this API. Reviewers: ributzka Differential Revision: http://reviews.llvm.org/D12263 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 246082	2015-08-26 21:16:29 +00:00
Lang Hames	116085be9e	[Kaleidoscope] Fix a typo in Chapter 5. llvm-svn: 246081	2015-08-26 20:57:03 +00:00
Matt Arsenault	ec9bd6fd7e	AMDGPU: Delete dead code There is no context where s_mov_b64 is emitted and could potentially be moved to the VALU. It is currently only emitted for materializing immediates, which can't be dependent on vector sources. The immediate splitting is already done when selecting constants. I'm not sure what contexts if any the register splitting would have been used before. Also clean up using s_mov_b64 in place of v_mov_b64_pseudo, although this isn't required and just skips the extra step of eliminating the copy from the SReg_64. llvm-svn: 246080	2015-08-26 20:48:08 +00:00
Matt Arsenault	951daff52a	AMDGPU: Don't reprocess instructions when splitting i64 bcnt llvm-svn: 246079	2015-08-26 20:48:04 +00:00
Matt Arsenault	a7c6405427	AMDGPU: Fix not moving users of s_bfe_i64 to VALU This wouldn't propagate to users of the original BFE and would hit a verifier error. llvm-svn: 246078	2015-08-26 20:47:58 +00:00
Matt Arsenault	c3af5ff7c8	AMDGPU: Don't create intermediate SALU instructions When splitting 64-bit operations, create the correct VALU instructions immediately. This was splitting things like s_or_b64 into the two s_or_b32s and then pushing the new instructions onto the worklist. There's no reason we need to do this intermediate step. llvm-svn: 246077	2015-08-26 20:47:50 +00:00
Matthias Braun	296a9f8855	SelectionDAGBuilder: Fix SPDescriptor not resetting GuardReg This was causing problems when some functions use a GuardReg and some don't as can happen when mixing SelectionDAG and FastISel generated functions. llvm-svn: 246075	2015-08-26 20:46:52 +00:00
Matthias Braun	d99861af5f	FastISel: Avoid adding a successor block twice for degenerate IR. This fixes http://llvm.org/PR24581 Differential Revision: http://reviews.llvm.org/D12350 llvm-svn: 246074	2015-08-26 20:46:49 +00:00
Andrew Kaylor	107f881276	Expose hasLiveCondCodeDef as a member function of the X86InstrInfo class. NFC This takes the existing static function hasLiveCondCodeDef and makes it a member function of the X86InstrInfo class. This is a useful utility function that an upcoming change would like to use. NFC. Patch by: Kevin B. Smith Differential Revision: http://reviews.llvm.org/D12371 llvm-svn: 246073	2015-08-26 20:36:52 +00:00
Diego Novillo	1b494a160f	Fix memory leak in sample profile pass. The problem here were the function analyses invoked by the function pass manager from the new IPO pass. I looked at other IPO passes needing dominance information and the only one that requires it (partial inliner) does not use the standard dependency mechanism. This patch mimics what the partial inliner does to compute dominance, post-dominance and loop info. One thing I like about this approach is that I can delay the computation of all this until I actually need it. This should bring the ASAN buildbot back to green. If there's a better way to fix this, I'll do it in a follow-up patch. llvm-svn: 246066	2015-08-26 20:00:27 +00:00
Mehdi Amini	d4c368cdf7	Revert "Fix LLVM C API for DataLayout" This reverts commit r246052. Third attempt, still unpleasant for some bots. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 246057	2015-08-26 19:24:59 +00:00

... 3 4 5 6 7 ...

121248 Commits