llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Mirror of https://github.com/RPCS3/llvm-mirror

Go to file

Eli Friedman ca89c6b055 [SelectionDAG] Improve the legalisation lowering of UMULO. There is no way in the universe, that doing a full-width division in software will be faster than doing overflowing multiplication in software in the first place, especially given that this same full-width multiplication needs to be done anyway. This patch replaces the previous implementation with a direct lowering into an overflowing multiplication algorithm based on half-width operations. Correctness of the algorithm was verified by exhaustively checking the output of this algorithm for overflowing multiplication of 16 bit integers against an obviously correct widening multiplication. Baring any oversights introduced by porting the algorithm to DAG, confidence in correctness of this algorithm is extremely high. Following table shows the change in both t = runtime and s = space. The change is expressed as a multiplier of original, so anything under 1 is “better” and anything above 1 is worse. +-------+-----------+-----------+-------------+-------------+ \| Arch \| u64u64 t \| u64u64 s \| u128u128 t \| u128u128 s \| +-------+-----------+-----------+-------------+-------------+ \| X64 \| - \| - \| ~0.5 \| ~0.64 \| \| i686 \| ~0.5 \| ~0.6666 \| ~0.05 \| ~0.9 \| \| armv7 \| - \| ~0.75 \| - \| ~1.4 \| +-------+-----------+-----------+-------------+-------------+ Performance numbers have been collected by running overflowing multiplication in a loop under `perf` on two x86_64 (one Intel Haswell, other AMD Ryzen) based machines. Size numbers have been collected by looking at the size of function containing an overflowing multiply in a loop. All in all, it can be seen that both performance and size has improved except in the case of armv7 where code size has regressed for 128-bit multiply. u128*u128 overflowing multiply on 32-bit platforms seem to benefit from this change a lot, taking only 5% of the time compared to original algorithm to calculate the same thing. The final benefit of this change is that LLVM is now capable of lowering the overflowing unsigned multiply for integers of any bit-width as long as the target is capable of lowering regular multiplication for the same bit-width. Previously, 128-bit overflowing multiply was the widest possible. Patch by Simonas Kazlauskas! Differential Revision: https://reviews.llvm.org/D50310 llvm-svn: 339922		2018-08-16 18:39:39 +00:00
bindings	[LLVM-C] [OCaml] Remove LLVMAddBBVectorizePass	2018-05-28 16:58:10 +00:00
cmake	[cmake] Prevent LLVMgold.so from being unloaded on Linux	2018-08-16 15:12:12 +00:00
docs	Update the coding standards and developer policy documentation surrounding whitespace.	2018-08-10 17:26:07 +00:00
examples	[ORC] Update JITCompileCallbackManager to support multi-threaded code.	2018-05-30 01:57:45 +00:00
include	[codeview] Use push_macro to avoid conflicts instead of a prefix	2018-08-16 17:34:31 +00:00
lib	[SelectionDAG] Improve the legalisation lowering of UMULO.	2018-08-16 18:39:39 +00:00
projects	[cmake] Support moving debuginfo-tests to llvm/projects	2017-12-12 17:06:08 +00:00
resources
runtimes	Revert "[CMake] Pass Clang defaults to runtimes builds"	2018-07-13 20:01:55 +00:00
test	[SelectionDAG] Improve the legalisation lowering of UMULO.	2018-08-16 18:39:39 +00:00
tools	[llvm-strip] Add support for -p/--preserve-dates	2018-08-16 18:29:40 +00:00
unittests	[codeview] Use push_macro to avoid conflicts instead of a prefix	2018-08-16 17:34:31 +00:00
utils	[TableGen] TypeSetByHwMode::operator== optimization	2018-08-16 16:16:28 +00:00
.arcconfig	[llvm] Set up .arcconfig to point to Diffusion L repository	2018-01-12 15:37:41 +00:00
.clang-format
.clang-tidy
.gitattributes	[DebugInfo] Add DILabel metadata and intrinsic llvm.dbg.label.	2018-05-09 02:40:45 +00:00
.gitignore
CMakeLists.txt	Remove vestiges of configure buildsystem	2018-08-14 21:25:49 +00:00
CODE_OWNERS.TXT	Add owner for llvm-objcopy	2018-08-09 22:05:19 +00:00
configure
CREDITS.TXT	Update my information in the CREDITS file.	2018-06-15 20:02:11 +00:00
LICENSE.TXT	Update copyright year to 2018.	2018-06-18 12:22:17 +00:00
llvm.spec.in
LLVMBuild.txt
README.txt	Test commit: remove a blank line	2018-06-08 21:21:55 +00:00
RELEASE_TESTERS.TXT	Remove myself from the release testers list. (NFC)	2018-06-20 21:25:50 +00:00

README.txt

The LLVM Compiler Infrastructure
================================

This directory and its subdirectories contain source code for LLVM,
a toolkit for the construction of highly optimized compilers,
optimizers, and runtime environments.

LLVM is open source software. You may freely distribute it under the terms of
the license agreement found in LICENSE.txt.

Please see the documentation provided in docs/ for further
assistance with LLVM, and in particular docs/GettingStarted.rst for getting
started with LLVM and docs/README.txt for an overview of LLVM's
documentation setup.

If you are writing a package for LLVM, see docs/Packaging.rst for our
suggestions.