[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Multiple precisions of floating-point arithmetic

This page is part of the web mail archives of SRFI 77 from before July 7th, 2015. The new archives for SRFI 77 contain all messages, not just those from before July 7th, 2015.

To: srfi-77@xxxxxxxxxxxxxxxxx
Subject: Multiple precisions of floating-point arithmetic
From: Bradley Lucier <lucier@xxxxxxxxxxxxxxx>
Date: Sun, 26 Feb 2006 12:17:34 -0600
Cc: Bradley Lucier <lucier@xxxxxxxxxxxxxxx>
Delivered-to: srfi-77@xxxxxxxxxxxxxxxxx

Some floating-point applications need greater-than-64-bit-precisionarithmetic; two are mentioned below.

Perhaps this SRFI should tackle the problem of providing floating-point arithmetics of various precisions. If we think this might beneeded, then the specially-named--operator approach for floating-point arithmetic as suggested in this SRFI (and which I like, by theway), does not seem to scale well.

Common Lisp has an approach which is perhaps cumbersome to useproperly and may be error prone, but it does allow for theimplementation and use of differing precisions of floating-pointarithmetic where they are useful.

Or perhaps one could use the naming convention "name" (default doubleprecision operation), "name"f (single-precision, 32-bit, operator),and "name"l (long double, whether 80 bit extended precision, 128-bitquad precision, or 128-bit pair-of-64-bit-doubles precision) foroperations as is done in C if one wants to use the special-nameapproach.


Brad

Examples of effective use of 128-bit floating-point arithmetic:

The following problem was pointed out by Philip W Sharp at theUniversity of Auckland in a talk on the long-time simulation of thesolar system.

As computers get faster, round-off error accumulates more quickly,and, indeed, scientists are reaching the end of usefulness of 64-bitIEEE floating-point arithmetic for long-time simulations of thebehavior of the solar system. There's a paper here that discussesthis issue:


http://anziamj.austms.org.au/V46/CTAC2004/Gra2/home.html

Basically, if you want to simulate the solar system for longer timesyou'll need an underlying arithmetic with more accuracy.

Beyond using extended-precision arithmetic for accurate evaluation ofthe elementary functions, this was the first "real" application thatI had heard of that needed more than 64-bit arithmetic.

Then Colin Percival published his paper "Rapid multiplication modulothe sum and difference of highly composite numbers",

www.ams.org/mcom/2003-72-241/S0025-5718-02-01419-9/S0025-5718-02-01419-9.pdf

which gives new bounds for the error in FFTs implemented in floating-point arithmetic. This allows you to use FFTs to implement bignumarithmetic with inputs of size 256 * (1024)^2 bits in 64-bit IEEEarithmetic with proven accuracy. (Most codes for FFT bignumarithmetic use number-theoretic FFTs on finite fields.) This is notas big as some applications would like, but with 128-bit arithmetic(either so-called quad-precision with a 15 bit exponent and 113-bitmantissa or IBM-type long-double implemented as a pair of doubles (sowith the same dynamic range as 64-bit IEEE arithmetic but with about106 bits of precision)), one could very easily implement fast,provably accurate bignum multiplication for sizes as big as one mightever need (and I don't think I'll live long enough to see thatstatement made false).

I think that, given the effort and expense put into designing fastfloating-point arithmetic units, bignum arithmetic built on floating-point FFTs will, in the end, be faster than the number theoretic FFTsnow popular among the "really big bignum" folks.

Follow-Ups:
- Re: Multiple precisions of floating-point arithmetic
  - From: bear

Prev by Date: Re: Integer residue-classes [was: Questions about srfi-77 Generic Arithmetic]
Next by Date: Re: Multiple precisions of floating-point arithmetic
Previous by thread: miscellaneous request (last one)
Next by thread: Re: Multiple precisions of floating-point arithmetic
Index(es):
- Date
- Thread