llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 08:23:21 +01:00

Author	SHA1	Message	Date
Duncan Sands	4757061c47	Factorize common code out of the InstructionSimplify shift logic. Add in threading of shifts over selects and phis while there. This fires here and there in the testsuite, to not much effect. For example when compiling spirit it fires 5 times, during early-cse, resulting in 6 more cse simplifications, and 3 more terminators being folded by jump threading, but the final bitcode doesn't change in any interesting way: other optimizations would have caught the opportunity anyway, only later. llvm-svn: 123441	2011-01-14 14:44:12 +00:00
Duncan Sands	01be7e406d	Rename this test. llvm-svn: 123440	2011-01-14 14:16:33 +00:00
Chris Lattner	c0bd89331e	switch the second scalarrepl pass to use SSAUpdater. We run two scalarrepl passes: one early in the cleanup code and one late interlaced with the inliner. The second one is important because inlining and other scalar optzns can unpin allocas, allowing them to be split up and promoted. While important for performance, this is also relatively rare, and we would previously force a (non-lazy) computation of DomFrontiers, which happened even if nothing became unpinned. With this patch, the first pass of scalarrepl still promotes the vast bulk of allocas in programs, but hte second pass has changed to use SSAUpdater, which is more "sparse" and lazy. This speeds up opt -O3 time on kimwitu++ (a c++ app) by about 1%. The numbers are interesting: the first pass promotes ~17500 allocas. The second pass promotes about 1600. For non-C++ codes, the compile time win should be greater, because the second pass of scalarrepl does less. llvm-svn: 123437	2011-01-14 08:21:08 +00:00
Chris Lattner	8e171470d3	split SROA into two passes: one that uses DomFrontiers (-scalarrepl) and one that uses SSAUpdater (-scalarrepl-ssa) llvm-svn: 123436	2011-01-14 08:13:00 +00:00
Jay Foad	fa61721cf2	Remove casts between Value and Constant, which won't work if a static_cast from Constant* to Value* has to adjust the "this" pointer. This is groundwork for PR889. llvm-svn: 123435	2011-01-14 08:07:43 +00:00
Chris Lattner	b5c39352d8	Implement full support for promoting allocas to registers using SSAUpdater instead of DomTree/DomFrontier. This may be interesting for reducing compile time. This is currently disabled, but seems to work just fine. When this is enabled, we eliminate two runs of dominator frontier, one in the "early per-function" optimizations and one in the "interlaced with inliner" function passes. llvm-svn: 123434	2011-01-14 07:50:47 +00:00
Chris Lattner	de9ec03027	relax testcase a bit. llvm-svn: 123433	2011-01-14 07:46:33 +00:00
Jakob Stoklund Olesen	9f5e00f957	Try for the third time to teach getFirstTerminator() about debug values. This time let's rephrase to trick gcc-4.3 into not miscompiling. llvm-svn: 123432	2011-01-14 06:33:45 +00:00
Chris Lattner	eba719204c	revert my fastisel patch again which apparently still gives the llvm-gcc-i386-linux-selfhost buildbot heartburn... llvm-svn: 123431	2011-01-14 06:14:33 +00:00
Chris Lattner	ee950eeb24	reapply r123414 now that the botz are calmed down and the fix is already in. llvm-svn: 123427	2011-01-14 04:24:28 +00:00
Chris Lattner	b1ba935526	indentation llvm-svn: 123426	2011-01-14 04:23:53 +00:00
Evan Cheng	0cdd5547f1	Completed :lower16: / :upper16: support for movw / movt pairs on Darwin. - Fixed :upper16: fix up routine. It should be shifting down the top 16 bits first. - Added support for Thumb2 :lower16: and :upper16: fix up. - Added :upper16: and :lower16: relocation support to mach-o object writer. llvm-svn: 123424	2011-01-14 02:38:49 +00:00
Jakob Stoklund Olesen	99ad62ed9e	Revert r123419. It still breaks llvm-gcc-i386-linux-selfhost. llvm-svn: 123423	2011-01-14 02:12:54 +00:00
Chris Lattner	349735530b	r123414 broke llvm-gcc bootstrap apparently, revert llvm-svn: 123422	2011-01-14 02:07:32 +00:00
Chris Lattner	a0074ca5fc	Set the insertion point correctly for instructions generated by load folding: they should go before the new instruction not after it. llvm-svn: 123420	2011-01-14 01:33:40 +00:00
Jakob Stoklund Olesen	3d8deb13ee	Try again to teach getFirstTerminator() about debug values. Fix some callers to better deal with debug values. llvm-svn: 123419	2011-01-14 01:17:53 +00:00
Owen Anderson	6de2a4d67c	Rather than doing early instcombine, try doing early CSE instead. This should still handle most important simplifications, as well as resolving phase ordering issues where instcombine would inhibit important CSE'ing opportunities, for instance on BitBench/drop3. llvm-svn: 123418	2011-01-14 00:41:11 +00:00
Duncan Sands	44c273d907	Move some shift transforms out of instcombine and into InstructionSimplify. While there, I noticed that the transform "undef >>a X -> undef" was wrong. For example if X is 2 then the top two bits must be equal, so the result can not be anything. I fixed this in the constant folder as well. Also, I made the transform for "X << undef" stronger: it now folds to undef always, even though X might be zero. This is in accordance with the LangRef, but I must admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef" following the LangRef and the constant folder, likewise fairly aggressive. llvm-svn: 123417	2011-01-14 00:37:45 +00:00
Owen Anderson	e9841116c0	Don't bother conditionalizing the use of SROA in -O1 mode. We're already running it unconditionally later in the pipeline. llvm-svn: 123416	2011-01-14 00:36:40 +00:00
Chris Lattner	5baec05809	fix PR8961 - a fast isel miscompilation where we'd insert a new instruction after sext's generated for addressing that got folded. Previously we compiled test5 into: _test5: ## @test5 ## BB#0: movq -8(%rsp), %rax ## 8-byte Reload movq (%rdi,%rax), %rdi addq %rdx, %rdi movslq %esi, %rax movq %rax, -8(%rsp) ## 8-byte Spill movq %rdi, %rax ret which is insane and wrong. Now we produce: _test5: ## @test5 ## BB#0: movslq %esi, %rax movq (%rdi,%rax), %rax addq %rdx, %rax ret llvm-svn: 123414	2011-01-14 00:01:01 +00:00
Jakob Stoklund Olesen	b5e12bb37c	Better terminator avoidance. This approach also works when the terminator doesn't have a slot index. (Which can happen??) llvm-svn: 123413	2011-01-13 23:35:53 +00:00
Evan Cheng	579f2b17bf	Add comment about Thumb2 fixup comments being completely bogus. llvm-svn: 123411	2011-01-13 23:27:39 +00:00
Tobias Grosser	189efecfed	Add single entry / single exit accessors. Add methods for accessing the (single) entry / exit edge of a region. If no such edge exists, null is returned. Both accessors return the start block of the corresponding edge. The edge can finally be formed by utilizing Region::getEntry() or Region::getExit(); Contributed by: Andreas Simbuerger <simbuerg@fim.uni-passau.de> llvm-svn: 123410	2011-01-13 23:18:04 +00:00
Owen Anderson	58bcb5d7f2	Recognize alternative register names like ip -> r12. Fixes <rdar://problem/8857982>. llvm-svn: 123409	2011-01-13 22:50:36 +00:00
Jakob Stoklund Olesen	918de3a3b8	Fix a few more places that should use MBB::getLastNonDebugInstr(). llvm-svn: 123408	2011-01-13 22:47:43 +00:00
Owen Anderson	4f5dac3541	As far as I can tell, unified syntax uses c0-c15 instead of cr0-cr15 for mcr and friends. llvm-svn: 123407	2011-01-13 22:38:16 +00:00
Chris Lattner	d2d217dc46	typo llvm-svn: 123406	2011-01-13 22:11:56 +00:00
Chris Lattner	6745cd150c	memcpy + metadata = bliss :) llvm-svn: 123405	2011-01-13 22:08:15 +00:00
Owen Anderson	18dfab2332	Add support to the ARM MC infrastructure to support mcr and friends. This requires supporting the symbolic immediate names used for these instructions, fixing their pretty-printers, and adding proper encoding information for them. With this, we can properly pretty-print and encode assembly like: mrc p15, #0, r3, c13, c0, #3 Fixes <rdar://problem/8857858>. llvm-svn: 123404	2011-01-13 21:46:02 +00:00
Evan Cheng	cf9949ddbd	Relax an assertion. On archs like ARM, an immediate field may be scattered. So it's possible for some bits of every 8 bits to be encoded already, and the rest still needs to be fixed up. llvm-svn: 123403	2011-01-13 21:45:26 +00:00
Jakob Stoklund Olesen	d63287ff98	Temporary workaround for an i386 crash in LiveDebugVariables. llvm-svn: 123400	2011-01-13 21:28:55 +00:00
Jakob Stoklund Olesen	0f2b9d9dc4	Teach frame lowering to ignore debug values after the terminators. llvm-svn: 123399	2011-01-13 21:28:52 +00:00
Bob Wilson	569cd41943	Tidy comments, indentation, and 80-column violations. llvm-svn: 123397	2011-01-13 21:10:12 +00:00
Bob Wilson	1238f872da	Fix whitespace. llvm-svn: 123396	2011-01-13 20:59:44 +00:00
Kevin Enderby	eee2f3489b	Fix ARMAsmParser::ParseOperand() to allow it to parse . as a branch target and directional local labels like 1f and 2b. llvm-svn: 123393	2011-01-13 20:32:36 +00:00
Devang Patel	b2899fce10	Little help to debug the bugpoint itself. Patch by Bob Wilson. llvm-svn: 123390	2011-01-13 19:48:54 +00:00
Devang Patel	8e59113036	Speculatively revert r123384 to make llvm-gcc-i386-linux-selfhost buildbot happy. llvm-svn: 123389	2011-01-13 19:27:50 +00:00
Oscar Fuentes	9618cded65	Add some platform tests. Patch by arrowdodger! llvm-svn: 123388	2011-01-13 19:17:28 +00:00
Jim Grosbach	767dfbf685	When updating a tSpill/tRestore instruction to be a tSTRr/tLDRr, correctly set up the source operands. The original instr has an immediate operand that should be replaced with the frame reg operand rather than just adding the reg operand. Previously, the instruction ended up with too many operands causing an assert() when adding the default predicate. rdar://8825456 llvm-svn: 123387	2011-01-13 19:16:48 +00:00
Jakob Stoklund Olesen	6aa35206e7	Teach MachineBasicBlock::getFirstTerminator to ignore debug values. It will still return an iterator that points to the first terminator or end(), but there may be DBG_VALUE instructions following the first terminator. llvm-svn: 123384	2011-01-13 18:41:05 +00:00
Bob Wilson	fbab825516	Check for empty structs, and for consistency, zero-element arrays. llvm-svn: 123383	2011-01-13 18:26:59 +00:00
Bob Wilson	3b0197489e	Extend SROA to handle arrays accessed as homogeneous structs and vice versa. This is a minor extension of SROA to handle a special case that is important for some ARM NEON operations. Some of the NEON intrinsics return multiple values, which are handled as struct types containing multiple elements of the same vector type. The corresponding return types declared in the arm_neon.h header have equivalent arrays. We need SROA to recognize that it can split up those arrays and structs into separate vectors, even though they are not always accessed with the same type. SROA already handles loads and stores of an entire alloca by using insertvalue/extractvalue to access the individual pieces, and that code works the same regardless of whether the type is a struct or an array. So, all that needs to be done is to check for compatible arrays and homogeneous structs. llvm-svn: 123381	2011-01-13 17:45:11 +00:00
Bob Wilson	9f8d730f9b	Make SROA more aggressive with allocas containing padding. SROA only split up structs and arrays one level at a time, so padding can only cause trouble if it is located in between the struct or array elements. llvm-svn: 123380	2011-01-13 17:45:08 +00:00
Oscar Fuentes	8d5e1d912b	Disable RTTI when building unit tests. This avoids errors at link time. llvm-svn: 123377	2011-01-13 15:31:45 +00:00
Oscar Fuentes	f975a7423b	Platform tests for argz_* functions. Patch by arrowdodger! llvm-svn: 123376	2011-01-13 15:06:32 +00:00
Duncan Sands	69fbfa2b0e	Remove some wrong code which fortunately was never executed (as explained in the comment I added): an extern weak global may have a null address. llvm-svn: 123373	2011-01-13 10:43:08 +00:00
Duncan Sands	36b007d63b	The most common simplification missed by instsimplify in unoptimized bitcode is "X != 0 -> X" when X is a boolean. This occurs a lot because of the way llvm-gcc converts gcc's conditional expressions. Add this, and a few other similar transforms for completeness. llvm-svn: 123372	2011-01-13 08:56:29 +00:00
Evan Cheng	cc474b4864	Model :upper16: and :lower16: as ARM specific MCTargetExpr. This is a step in the right direction. It eliminated some hacks and will unblock codegen work. But it's far from being done. It doesn't reject illegal expressions, e.g. (FOO - :lower16:BAR). It also doesn't work in Thumb2 mode at all. llvm-svn: 123369	2011-01-13 07:58:56 +00:00
Eric Christopher	3821f63f4b	Experiment with changing the default 32-bit linux stack alignment to 16 bytes for PR8969. Update all testcases accordingly. llvm-svn: 123367	2011-01-13 06:47:10 +00:00
Rafael Espindola	0272c002ae	Keep unnamed_addr when linking. llvm-svn: 123364	2011-01-13 05:12:34 +00:00

1 2 3 4 5 ...

69139 Commits