| Copyright | (c) Levent Erkok |
|---|---|
| License | BSD3 |
| Maintainer | erkokl@gmail.com |
| Stability | experimental |
| Safe Haskell | None |
| Language | Haskell2010 |
Documentation.SBV.Examples.Misc.Floating
Description
Several examples involving IEEE-754 floating point numbers, i.e., single
precision Float (SFloat) and double precision Double (SDouble) types.
Note that arithmetic with floating point is full of surprises; due to precision
issues associativity of arithmetic operations typically do not hold. Also,
the presence of NaN is always something to look out for.
Synopsis
- assocPlus :: SFloat -> SFloat -> SFloat -> SBool
- assocPlusRegular :: IO ThmResult
- nonZeroAddition :: IO ThmResult
- multInverse :: IO ThmResult
- roundingAdd :: IO SatResult
FP addition is not associative
assocPlus :: SFloat -> SFloat -> SFloat -> SBool Source #
Prove that floating point addition is not associative. For illustration purposes,
we will require one of the inputs to be a NaN. We have:
>>>prove $ assocPlus (0/0)Falsifiable. Counter-example: s0 = 0.0 :: Float s1 = 0.0 :: Float
Indeed:
>>>let i = 0/0 :: Float>>>i + (0.0 + 0.0)NaN>>>((i + 0.0) + 0.0)NaN
But keep in mind that NaN does not equal itself in the floating point world! We have:
>>>let nan = 0/0 :: Float in nan == nanFalse
assocPlusRegular :: IO ThmResult Source #
Prove that addition is not associative, even if we ignore NaN/Infinity values.
To do this, we use the predicate fpIsPoint, which is true of a floating point
number (SFloat or SDouble) if it is neither NaN nor Infinity. (That is, it's a
representable point in the real-number line.)
We have:
>>>assocPlusRegularFalsifiable. Counter-example: x = 1.3067223e-25 :: Float y = -1.7763568e-15 :: Float z = 1.7762754e-15 :: Float
Indeed, we have:
>>>let x = 1.3067223e-25 :: Float>>>let y = -1.7763568e-15 :: Float>>>let z = 1.7762754e-15 :: Float>>>x + (y + z)-8.142091e-20>>>(x + y) + z-8.142104e-20
Note the difference in the results!
FP addition by non-zero can result in no change
nonZeroAddition :: IO ThmResult Source #
Demonstrate that a+b = a does not necessarily mean b is 0 in the floating point world,
even when we disallow the obvious solution when a and b are Infinity.
We have:
>>>nonZeroAdditionFalsifiable. Counter-example: a = 7.486071e12 :: Float b = 188.8646 :: Float
Indeed, we have:
>>>let a = 7.486071e12 :: Float>>>let b = 188.8646 :: Float>>>a + b == aTrue>>>b == 0False
FP multiplicative inverses may not exist
multInverse :: IO ThmResult Source #
This example illustrates that a * (1/a) does not necessarily equal 1. Again,
we protect against division by 0 and NaN/Infinity.
We have:
>>>multInverseFalsifiable. Counter-example: a = 8.590978e-39 :: Float
Indeed, we have:
>>>let a = 8.590978e-39 :: Float>>>a * (1/a)0.99999994
Effect of rounding modes
roundingAdd :: IO SatResult Source #
One interesting aspect of floating-point is that the chosen rounding-mode
can effect the results of a computation if the exact result cannot be precisely
represented. SBV exports the functions fpAdd, fpSub, fpMul, fpDiv, fpFMA
and fpSqrt which allows users to specify the IEEE supported RoundingMode for
the operation. This example illustrates how SBV can be used to find rounding-modes
where, for instance, addition can produce different results. We have:
>>>roundingAddSatisfiable. Model: rm = RoundTowardPositive :: RoundingMode x = -2.240786e-38 :: Float y = -1.10355e-39 :: Float
(Note that depending on your version of Z3, you might get a different result.)
Unfortunately we can't directly validate this result at the Haskell level, as Haskell only supports
RoundNearestTiesToEven. We have:
>>>-2.240786e-38 + (-1.10355e-39) :: Float-2.3511412e-38
While we cannot directly see the result when the mode is RoundTowardPositive in Haskell, we can use
SBV to provide us with that result thusly:
>>>sat $ \z -> z .== fpAdd sRoundTowardPositive (-2.240786e-38) (-1.10355e-39 :: SFloat)Satisfiable. Model: s0 = -2.351141e-38 :: Float
We can see why these two results are indeed different: The RoundTowardPositive
(which rounds towards positive infinity from zero) produces a larger result. Indeed, if we treat these numbers
as Double values, we get:
> -2.240786e-38 + (-1.10355e-39) :: Double
- 2.351141e-38
we see that the "more precise" result is larger than what the Float value is, justifying the
larger value with RoundTowardPositive. A more detailed study is beyond our current scope, so we'll
merely note that floating point representation and semantics is indeed a thorny
subject, and point to http://ece.uwaterloo.ca/~dwharder/NumericalAnalysis/02Numerics/Double/paper.pdf as
an excellent guide.