llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Go to file

Ahmed Bougacha eb32174104 [X86] Don't custom-lower vNi32 uint_to_fp when unsafe-fp-math.

The custom code produces incorrect results if later reassociated.

Since r221657, on x86, vNi32 uitofp is lowered using an optimized
sequence:

  movdqa LCPI0_0(%rip), %xmm1 ## xmm1 = [65535, ...]
  pand %xmm0, %xmm1
  por LCPI0_1(%rip), %xmm1 ## [0x4b000000, ...]
  psrld $16, %xmm0
  por LCPI0_2(%rip), %xmm0 ## [0x53000000, ...]
  addps LCPI0_3(%rip), %xmm0 ## [float -5.497642e+11, ...]
  addps %xmm1, %xmm0

Since r240361, the machine combiner opportunistically reassociates
2-instruction sequences (with -ffast-math). In the new code sequence,
the ADDPS' are eligible. In isolation, for simple examples (without
reassociable users), this makes no performance difference (the goal
being to enable reassociation of longer chains).

In the trivial example (just one uitofp), the reassociation doesn't
happen, because (I think) it would require the emission of a separate
movaps for a constantpool load (instead of folding it into addps).

However, when we have multiple uitofp sequences, and the constantpool
loads are CSE'd earlier, the machine combiner can do the reassociation.

When the ADDPS' are reassociated, the resulting sequence isn't correct
anymore, as we'd be adding large (2**39) constants with comparatively
smaller values (~2**23). Given that two of the three inputs are powers
of 2 larger than 2**16, and that ulp(2**39) == 2**(39-24) == 2**15,
the reassociated chain will produce 0 for any input in [0, 2**14[.
In my testing, it also produces wrong results for 99.5% of [0, 2**32[.

Avoid this by disabling the new lowering when -ffast-math. It does
mean that we'll get slower code than without it, but at least we
won't get egregiously incorrect code.

One might argue that, considering -ffast-math is all but meaningless,
uitofp producing wrong results isn't a compiler bug. But it really is.

Fixes PR24512.

...though this is really more of a workaround.
Ideally, we'd have some sort of Machine FMF, but that's a problem
that's not worth tackling until we do more with machine IR.

llvm-svn: 248965

2015-10-01 00:11:07 +00:00

autoconf

Don't use bashism/kshism of test ==. From Kamil Rytarowski.

2015-09-12 16:30:32 +00:00

bindings

[bindings] Update Go bindings to DIBuilder

2015-09-06 02:22:15 +00:00

cmake

Enable -Wdeprecated in the cmake build now that LLVM (& Clang, Polly, and LLD) are -Wdeprecated clean

2015-09-30 23:36:12 +00:00

docs

Introduce !align metadata for load instruction

2015-09-28 17:41:08 +00:00

examples

Fix Clang-tidy modernize-use-nullptr warnings in examples and include directories; other minor cleanups.

2015-09-29 18:02:48 +00:00

include

[WinEH] Emit int3 after noreturn calls on Win64

2015-09-30 23:09:23 +00:00

lib

[X86] Don't custom-lower vNi32 uint_to_fp when unsafe-fp-math.

2015-10-01 00:11:07 +00:00

projects

build: make libunwind a proper project

2015-04-25 01:47:39 +00:00

resources

In MSVC builds embed a VERSIONINFO resource in our exe and DLL files.

2015-06-12 15:58:29 +00:00

test

[X86] Don't custom-lower vNi32 uint_to_fp when unsafe-fp-math.

2015-10-01 00:11:07 +00:00

tools

InstrProf: Support for value profiling in the indexed profile format

2015-09-29 22:13:58 +00:00

unittests

Add support for sub-byte aligned writes to lib/Support/Endian.h

2015-09-30 13:20:37 +00:00

utils

HHVM calling conventions.

2015-09-29 22:09:16 +00:00

.arcconfig

Updated phabricator server.

2014-04-07 03:57:04 +00:00

.clang-format

Test commit.

2014-03-02 13:08:46 +00:00

.clang-tidy

Enable display of compiler diagnostics in clang-tidy by default.

2014-10-29 17:29:38 +00:00

.gitignore

Minor updates to gitignore so that symlinks are ignored in the projects dir.

2015-07-07 20:24:58 +00:00

CMakeLists.txt

[CMake] [Darwin] Need to set lto_library on CMAKE_MODULE_LINKER_FLAGS as well

2015-09-11 18:39:19 +00:00

CODE_OWNERS.TXT

CODE_OWNERS.TXT is supposed to be sorted by surname

2015-09-07 00:41:40 +00:00

configure

Don't use bashism/kshism of test ==. From Kamil Rytarowski.

2015-09-12 16:30:32 +00:00

CREDITS.TXT

[WebAssembly] Initial WebAssembly backend

2015-06-29 23:51:55 +00:00

LICENSE.TXT

Update for a new year.

2015-03-12 01:25:29 +00:00

llvm.spec.in

[Sparc] Implement i64 load/store support for 32-bit sparc.

2015-08-10 19:11:39 +00:00

LLVMBuild.txt

Remove the very substantial, largely unmaintained legacy PGO

2013-10-02 15:42:23 +00:00

Makefile

[configure/make] Propagate names of build host tools when making BuildTools

2014-03-25 21:45:41 +00:00

Makefile.common

Makefile.common: Update a description, s/Source/SOURCES/ , according to MakefileGuide.html#control-variables .

2012-12-07 01:43:23 +00:00

Makefile.config.in

We're actually -Wmissing-field-initializers clean thanks to the cmake

2015-08-07 16:44:47 +00:00

Makefile.rules

We're actually -Wmissing-field-initializers clean thanks to the cmake

2015-08-07 16:44:47 +00:00

README.txt

Revert test commit at revision 233535.

2015-03-30 12:39:03 +00:00

README.txt

Low Level Virtual Machine (LLVM)
================================

This directory and its subdirectories contain source code for LLVM,
a toolkit for the construction of highly optimized compilers,
optimizers, and runtime environments.

LLVM is open source software. You may freely distribute it under the terms of
the license agreement found in LICENSE.txt.

Please see the documentation provided in docs/ for further
assistance with LLVM, and in particular docs/GettingStarted.rst for getting
started with LLVM and docs/README.txt for an overview of LLVM's
documentation setup.

If you're writing a package for LLVM, see docs/Packaging.rst for our
suggestions.

Languages

C++ 96.9%

C 1%

Python 1%

CMake 0.6%

OCaml 0.2%

Other 0.1%