llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Matt Arsenault	c77e92f437	AMDGPU: Add intrinsics for sin/cos These provide direct access to the hardware instruction without the unit version required like llvm.sin/llvm.cos lowering requires. llvm-svn: 260782	2016-02-13 01:19:56 +00:00
Matt Arsenault	4ff4c396c1	AMDGPU: Rename intrinsic to better match instruction name Also fixes missing f32 test. llvm-svn: 260780	2016-02-13 01:03:00 +00:00
Tom Stellard	a308dba9ed	AMDGPU/SI: Add instruction defs for VOP1 DPP instructions Reviewers: nhaustov, cfang, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17159 llvm-svn: 260774	2016-02-13 00:51:31 +00:00
Matt Arsenault	4cdd9956f3	AMDGPU: Fix broken condition causing warning llvm-svn: 260773	2016-02-13 00:36:10 +00:00
Tom Stellard	6bf7a73a66	AMDGPU/SI: Organize intrinsics by subtarget Reviewers: arsenm Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17210 llvm-svn: 260771	2016-02-13 00:29:57 +00:00
Pirama Arumuga Nainar	d09ef3dd88	Don't combine fp_round (fp_round x) if f80 to f16 is generated Summary: This patch skips DAG combine of fp_round (fp_round x) if it results in an fp_round from f80 to f16. fp_round from f80 to f16 always generates an expensive (and as yet, unimplemented) libcall to __truncxfhf2. This prevents selection of native f16 conversion instructions from f32 or f64. Moreover, the first (value-preserving) fp_round from f80 to either f32 or f64 may become a NOP in platforms like x86. Reviewers: ab Subscribers: srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D17221 llvm-svn: 260769	2016-02-13 00:08:05 +00:00
Alexey Samsonov	cd76db6136	Fix Windows buildbot breakage. llvm-svn: 260766	2016-02-12 23:51:06 +00:00
Tom Stellard	9943755afb	AMDGPU/SI: Detect uniform branches and emit s_cbranch instructions Reviewers: arsenm Subscribers: mareko, MatzeB, qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16603 llvm-svn: 260765	2016-02-12 23:45:29 +00:00
Yunzhong Gao	9e56bc9706	Disable the vzeroupper insertion pass on PS4. Differential Revision: http://reviews.llvm.org/D16837 llvm-svn: 260764	2016-02-12 23:37:57 +00:00
Justin Bogner	dec93a4fca	cmake: Simplify the iOS.cmake toolchain - Remove a comment that was clearly copy pasted from Android.cmake and isn't relevant. - Remove the toolchain's sensitivity to the environment. It's less error prone to just allow users to set CMAKE_OSX_SYSROOT if they want to use a custom SDK. - Stop explicitly setting -mios-version-min to the default value. It just adds needless complexity. This makes building the native tablegen work for me even when SDKROOT is set in the environment (or passed in as -DCMAKE_OSX_SYSROOT). llvm-svn: 260763	2016-02-12 23:36:05 +00:00
Derek Schuff	eed189f4ef	[WebAssembly] Report more meaningful error messages for some unsupported ops. Computed gotos and RETURNADDR may never be supported; we can do FRAMEADDR in the future. llvm-svn: 260759	2016-02-12 22:56:03 +00:00
Krzysztof Parzyszek	702277f07f	[Hexagon] Optimize stack slot spills Replace spills to memory with spills to registers, if possible. This applies mostly to predicate registers (both scalar and vector), since they are very limited in number. A spill of a predicate register may happen even if there is a general-purpose register available. In cases like this the stack spill/reload may be eliminated completely. This optimization will consider all stack objects, regardless of where they came from and try to match the live range of the stack slot with a dead range of a register from an appropriate register class. llvm-svn: 260758	2016-02-12 22:53:35 +00:00
David Majnemer	85466f02fc	[llvm-pdbdump] Start to decode some streams We can decode a little bit of the first stream now. llvm-svn: 260754	2016-02-12 22:27:44 +00:00
Krzysztof Parzyszek	882483351a	[Hexagon] Mark HVX registers as volatile llvm-svn: 260753	2016-02-12 22:26:44 +00:00
Sanjay Patel	9507ea18eb	fix test to use FileCheck llvm-svn: 260751	2016-02-12 22:07:54 +00:00
Derek Schuff	6f279569a2	[WebAssembly] Update test expectations after r260737 llvm-svn: 260750	2016-02-12 22:05:08 +00:00
Krzysztof Parzyszek	4004463702	[Hexagon] Recognize more cases in copyPhysReg and stack slot load/store llvm-svn: 260748	2016-02-12 21:56:41 +00:00
Reid Kleckner	7c03262156	[codeview] Describe local variables in registers llvm-svn: 260746	2016-02-12 21:48:30 +00:00
Rong Xu	669262d490	[PGO] Add another interface for annotateValueSite Add another interface to function annotateValueSite() which directly uses the VauleData array. Differential Revision: http://reviews.llvm.org/D17108 llvm-svn: 260741	2016-02-12 21:36:17 +00:00
Dan Gohman	25721173a7	[WebAssembly] Fix byval for empty types. llvm-svn: 260740	2016-02-12 21:30:18 +00:00
Chad Rosier	9f8cc1098d	[AArch64] Enable post-RA MI scheduler for Kryo. This should have landed in r260686. llvm-svn: 260739	2016-02-12 21:27:33 +00:00
Dan Gohman	3aceffcf65	[WebAssembly] Fix insertion of a BLOCK in a loop header that also ends a BLOCK. llvm-svn: 260737	2016-02-12 21:19:25 +00:00
Andrew Kaylor	d60c52101d	[WinEH] Prevent EH state numbering from skipping nested cleanup pads that never return Differential Revision: http://reviews.llvm.org/D17208 llvm-svn: 260733	2016-02-12 21:10:16 +00:00
Chad Rosier	926854dd2a	[LIR] Allow merging of memsets in negatively strided loops. Last part of PR25166. llvm-svn: 260732	2016-02-12 21:03:23 +00:00
Justin Lebar	c576113aad	Fix typo in comment. llvm-svn: 260731	2016-02-12 21:01:37 +00:00
Justin Lebar	b60c46619b	[SimplifyCFG] Don't fold conditional branches that contain calls to convergent functions. Summary: Performing this optimization duplicates the call to the convergent function and adds new control-flow dependencies, which is a no-no. Reviewers: jingyue Subscribers: broune, hfinkel, tra, resistor, joker.eph, arsenm, llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D17128 llvm-svn: 260730	2016-02-12 21:01:36 +00:00
Justin Lebar	9fd0bc5568	[LoopRotate] Don't perform loop rotation if the loop header calls a convergent function. Summary: Calls to convergent functions can be duplicated, but only if the duplicates are not control-flow dependent on any additional values. Loop rotation doesn't meet the bar. Reviewers: jingyue Subscribers: mzolotukhin, llvm-commits, arsenm, joker.eph, resistor, tra, hfinkel, broune Differential Revision: http://reviews.llvm.org/D17127 llvm-svn: 260729	2016-02-12 21:01:33 +00:00
Justin Lebar	a884bf8763	Add convergent property to CodeMetrics. Summary: No functional changes. Reviewers: jingyue, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17126 llvm-svn: 260728	2016-02-12 21:01:31 +00:00
Justin Lebar	aa89e5f886	Initialize CodeMetrics' member variables inline with definitions. Summary: No functional changes. Reviewers: jingyue Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17125 llvm-svn: 260727	2016-02-12 20:59:20 +00:00
Krzysztof Parzyszek	31aecfe356	[Hexagon] Recognize more instructions in isLoadFromStackSlot/isStoreToStackSlot llvm-svn: 260725	2016-02-12 20:54:15 +00:00
Quentin Colombet	0c794fb773	Get rid of some GLOBAL_ISEL ifdefs that should be harmless for code size. More to come, but those were easy. llvm-svn: 260723	2016-02-12 20:41:24 +00:00
David Majnemer	42e3fde99d	Remove unused variable llvm-svn: 260722	2016-02-12 20:33:51 +00:00
Benjamin Kramer	f375282672	Remove LLVMGetTargetMachineData leftovers. llvm-svn: 260720	2016-02-12 20:26:46 +00:00
Argyrios Kyrtzidis	94fe44f7a0	[ADT] Revert the llvm/ADT/OptionSet.h header and unit test. llvm-svn: 260714	2016-02-12 19:47:35 +00:00
Philip Reames	53e99b3ac9	[GVN] Common code for local and non-local load availability [NFCI] The attached patch removes all of the block local code for performing X-load forwarding by reusing the code used in the non-local case. The motivation here is to remove duplication and in the process increase our test coverage of some fairly tricky code. I have some upcoming changes I'll be proposing in this area and wanted to have the code cleaned up a bit first. Note: The review for this mostly happened in email which didn't make it to phabricator on the 258882 commit thread. Differential Revision: http://reviews.llvm.org/D16608 llvm-svn: 260711	2016-02-12 19:24:57 +00:00
Chad Rosier	d1b5bf7e94	[LIR] Partially revert r252926(NFC), which introduced a very subtle change. In short, before r252926 we were comparing an unsigned (StoreSize) against an a APInt (Stride), which is fine and well. After we were zero extending the Stride and then converting to an unsigned, which is not the same thing. Obviously, Stides can also be negative. This commit just restores the original behavior. AFAICT, it's not possible to write a test case to expose the issue because the code already has checks to make sure the StoreSize can't overflow an unsigned (which prevents the Stride from overflowing an unsigned as well). llvm-svn: 260706	2016-02-12 19:05:27 +00:00
Philip Reames	070a34b8a9	[LVI] Exploit nsw/nuw when computing constant ranges As the title says. Modelled after similar code in SCEV. This is useful when analysing induction variables in loops which have been canonicalized by other passes. I wrote the tests as non-loops specifically to avoid the generality introduced in http://reviews.llvm.org/D17174. While that can handle many induction variables without needing to exploit nsw, there's no reason not to use it if we've already proven it. Differential Revision: http://reviews.llvm.org/D17177 llvm-svn: 260705	2016-02-12 19:05:16 +00:00
Hans Wennborg	9d006a860e	[CMake] don't build libLTO when LLVM_ENABLE_PIC is OFF When cmake is run with -DLLVM_ENABLE_PIC=OFF, build fails while linking shared library libLTO.so, because its dependencies are built with -fno-PIC. More details here: https://llvm.org/bugs/show_bug.cgi?id=26484. This diff reverts r252652 (git 9fd4377ddb83aee3c049dc8757e7771edbb8ee71), which removed check NOT LLVM_ENABLE_PIC before disabling build for libLTO.so. Patch by Igor Sugak! Differential Revision: http://reviews.llvm.org/D17049 llvm-svn: 260703	2016-02-12 19:02:39 +00:00
Mehdi Amini	217c73784f	GlobalISel is always built since r260566, reflect it in LLVMBuild.txt Other component could not depends on an optional library in llvm-config From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260701	2016-02-12 18:43:14 +00:00
Mehdi Amini	dcb378d5fc	llvm-config: replace assertions with a helpful error message From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260700	2016-02-12 18:43:10 +00:00
Krzysztof Parzyszek	01c8f9e8e7	[Hexagon] Add utility functions to detect sign- and zero-extending loads llvm-svn: 260698	2016-02-12 18:37:23 +00:00
Krzysztof Parzyszek	8094113d3a	[Hexagon] Replace expansion of spill pseudo-instructions in frame lowering Rewrite the code to handle all pseudo-instructions in a single pass. This temporarily reverts spill slot optimization that used general- purpose registers to hold values of spilled predicate registers. llvm-svn: 260696	2016-02-12 18:19:53 +00:00
David Majnemer	5886331bc4	[InstCombine] Don't aggressively replace xor with icmp For some cases, InstCombine replaces the sequence of xor/sub instruction followed by cmp instruction into a single cmp instruction. However, this replacement may result suboptimal result especially when the xor/sub has more than one use, as discussed in bug 26465 (https://llvm.org/bugs/show_bug.cgi?id=26465). This patch make the replacement happen only when xor/sub has only one use. Differential Revision: http://reviews.llvm.org/D16915 Patch by Taewook Oh! llvm-svn: 260695	2016-02-12 18:12:38 +00:00
Tom Stellard	10d903c4f3	[AMDGPU] Assembler: Swap operands of flat_store instructions to match AMD assembler Historically, AMD internal sp3 assembler has flat_store* addr, data format. To match existing code and to enable reuse, change LLVM definitions to match. Also update MC and CodeGen tests. Differential Revision: http://reviews.llvm.org/D16927 Patch by: Nikolay Haustov llvm-svn: 260694	2016-02-12 17:57:54 +00:00
Changpeng Fang	7cf99f3396	AMDGPU/SI: Annotate Loops with Constant Condition in SIAnnotateControlFlow pass. Summary: It is possible that the loop condition can be a boolean constant (infinite loop, for example). So we sould handle constant condition in annotating a loop. This patch adds this functionality to support annotating constant condition. Reviewers: tstellarAMD, arsenm Subscribers: llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D15093 llvm-svn: 260692	2016-02-12 17:11:04 +00:00
Krzysztof Parzyszek	c69fcdf24c	[Hexagon] Remove HexagonExpandPredSpillCode pass This code is dead. The expansion is now done in HexagonFrameLowering. llvm-svn: 260691	2016-02-12 17:09:58 +00:00
Krzysztof Parzyszek	782b19da54	[Hexagon] Eliminate pseudo instructions for circ/brev loads and stores We can generate the actual instructions from the intrinsics without the need for pseudo-instructions. Also, since the intrinsics have a side- effect in a form of a store, attempt to optimize away loads from the store location. llvm-svn: 260690	2016-02-12 17:01:51 +00:00
Geoff Berry	2d034feb0d	[AArch64] Reduce number of callee-save save/restores. Summary: Before this change, callee-save registers would be rounded up to even pairs of GPRs and FPRs. This change eliminates these extra padding load/stores, though it does keep the stack allocation the same size unless both the GPR and FPR sets have an odd size, in which case one full pair stack slot (16 bytes) is saved. This optimization cannot currently be done for MachO targets since they rely on a fast-path .debug_frame equivalent that can only encode callee-save registers as pairs. Reviewers: t.p.northover, rengolin, mcrosier, jmolloy Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D17000 llvm-svn: 260689	2016-02-12 16:31:41 +00:00
Krzysztof Parzyszek	e3cbbaf0f0	[Hexagon] Handle out-of-range offsets in eliminateFrameIndex Create a virtual register that will hold the actual address and use it with the offset of 0 in the place of the original FI. llvm-svn: 260688	2016-02-12 16:27:23 +00:00
Chad Rosier	81da1b9bcf	[AArch64] Add support for Qualcomm Kryo CPU. Machine model description by Dave Estes <cestes@codeaurora.org>. llvm-svn: 260686	2016-02-12 15:51:51 +00:00

1 2 3 4 5 ...

127455 Commits