README (revision 4882a59341e53eb6f0b4789bf948001014eff981) - OpenGrok cross reference for /OK3568_Linux_fs/kernel/arch/x86/math-emu/README

*4882a593Smuzhiyun +---------------------------------------------------------------------------+
*4882a593Smuzhiyun |  wm-FPU-emu   an FPU emulator for 80386 and 80486SX microprocessors.      |
*4882a593Smuzhiyun |                                                                           |
*4882a593Smuzhiyun | Copyright (C) 1992,1993,1994,1995,1996,1997,1999                          |
*4882a593Smuzhiyun |                       W. Metzenthen, 22 Parker St, Ormond, Vic 3163,      |
*4882a593Smuzhiyun |                       Australia.  E-mail billm@melbpc.org.au              |
*4882a593Smuzhiyun |                                                                           |
*4882a593Smuzhiyun |    This program is free software; you can redistribute it and/or modify   |
*4882a593Smuzhiyun |    it under the terms of the GNU General Public License version 2 as      |
*4882a593Smuzhiyun |    published by the Free Software Foundation.                             |
*4882a593Smuzhiyun |                                                                           |
*4882a593Smuzhiyun |    This program is distributed in the hope that it will be useful,        |
*4882a593Smuzhiyun |    but WITHOUT ANY WARRANTY; without even the implied warranty of         |
*4882a593Smuzhiyun |    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the          |
*4882a593Smuzhiyun |    GNU General Public License for more details.                           |
*4882a593Smuzhiyun |                                                                           |
*4882a593Smuzhiyun |    You should have received a copy of the GNU General Public License      |
*4882a593Smuzhiyun |    along with this program; if not, write to the Free Software            |
*4882a593Smuzhiyun |    Foundation, Inc., 675 Mass Ave, Cambridge, MA 02139, USA.              |
*4882a593Smuzhiyun |                                                                           |
*4882a593Smuzhiyun +---------------------------------------------------------------------------+
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyunwm-FPU-emu is an FPU emulator for Linux. It is derived from wm-emu387
*4882a593Smuzhiyunwhich was my 80387 emulator for early versions of djgpp (gcc under
*4882a593Smuzhiyunmsdos); wm-emu387 was in turn based upon emu387 which was written by
*4882a593SmuzhiyunDJ Delorie for djgpp.  The interface to the Linux kernel is based upon
*4882a593Smuzhiyunthe original Linux math emulator by Linus Torvalds.
*4882a593Smuzhiyun
*4882a593SmuzhiyunMy target FPU for wm-FPU-emu is that described in the Intel486
*4882a593SmuzhiyunProgrammer's Reference Manual (1992 edition). Unfortunately, numerous
*4882a593Smuzhiyunfacets of the functioning of the FPU are not well covered in the
*4882a593SmuzhiyunReference Manual. The information in the manual has been supplemented
*4882a593Smuzhiyunwith measurements on real 80486's. Unfortunately, it is simply not
*4882a593Smuzhiyunpossible to be sure that all of the peculiarities of the 80486 have
*4882a593Smuzhiyunbeen discovered, so there is always likely to be obscure differences
*4882a593Smuzhiyunin the detailed behaviour of the emulator and a real 80486.
*4882a593Smuzhiyun
*4882a593Smuzhiyunwm-FPU-emu does not implement all of the behaviour of the 80486 FPU,
*4882a593Smuzhiyunbut is very close.  See "Limitations" later in this file for a list of
*4882a593Smuzhiyunsome differences.
*4882a593Smuzhiyun
*4882a593SmuzhiyunPlease report bugs, etc to me at:
*4882a593Smuzhiyun       billm@melbpc.org.au
*4882a593Smuzhiyunor     b.metzenthen@medoto.unimelb.edu.au
*4882a593Smuzhiyun
*4882a593SmuzhiyunFor more information on the emulator and on floating point topics, see
*4882a593Smuzhiyunmy web pages, currently at  http://www.suburbia.net/~billm/
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyun--Bill Metzenthen
*4882a593Smuzhiyun  December 1999
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyun----------------------- Internals of wm-FPU-emu -----------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunNumeric algorithms:
*4882a593Smuzhiyun(1) Add, subtract, and multiply. Nothing remarkable in these.
*4882a593Smuzhiyun(2) Divide has been tuned to get reasonable performance. The algorithm
*4882a593Smuzhiyun    is not the obvious one which most people seem to use, but is designed
*4882a593Smuzhiyun    to take advantage of the characteristics of the 80386. I expect that
*4882a593Smuzhiyun    it has been invented many times before I discovered it, but I have not
*4882a593Smuzhiyun    seen it. It is based upon one of those ideas which one carries around
*4882a593Smuzhiyun    for years without ever bothering to check it out.
*4882a593Smuzhiyun(3) The sqrt function has been tuned to get good performance. It is based
*4882a593Smuzhiyun    upon Newton's classic method. Performance was improved by capitalizing
*4882a593Smuzhiyun    upon the properties of Newton's method, and the code is once again
*4882a593Smuzhiyun    structured taking account of the 80386 characteristics.
*4882a593Smuzhiyun(4) The trig, log, and exp functions are based in each case upon quasi-
*4882a593Smuzhiyun    "optimal" polynomial approximations. My definition of "optimal" was
*4882a593Smuzhiyun    based upon getting good accuracy with reasonable speed.
*4882a593Smuzhiyun(5) The argument reducing code for the trig function effectively uses
*4882a593Smuzhiyun    a value of pi which is accurate to more than 128 bits. As a consequence,
*4882a593Smuzhiyun    the reduced argument is accurate to more than 64 bits for arguments up
*4882a593Smuzhiyun    to a few pi, and accurate to more than 64 bits for most arguments,
*4882a593Smuzhiyun    even for arguments approaching 2^63. This is far superior to an
*4882a593Smuzhiyun    80486, which uses a value of pi which is accurate to 66 bits.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe code of the emulator is complicated slightly by the need to
*4882a593Smuzhiyunaccount for a limited form of re-entrancy. Normally, the emulator will
*4882a593Smuzhiyunemulate each FPU instruction to completion without interruption.
*4882a593SmuzhiyunHowever, it may happen that when the emulator is accessing the user
*4882a593Smuzhiyunmemory space, swapping may be needed. In this case the emulator may be
*4882a593Smuzhiyuntemporarily suspended while disk i/o takes place. During this time
*4882a593Smuzhiyunanother process may use the emulator, thereby perhaps changing static
*4882a593Smuzhiyunvariables. The code which accesses user memory is confined to five
*4882a593Smuzhiyunfiles:
*4882a593Smuzhiyun    fpu_entry.c
*4882a593Smuzhiyun    reg_ld_str.c
*4882a593Smuzhiyun    load_store.c
*4882a593Smuzhiyun    get_address.c
*4882a593Smuzhiyun    errors.c
*4882a593SmuzhiyunAs from version 1.12 of the emulator, no static variables are used
*4882a593Smuzhiyun(apart from those in the kernel's per-process tables). The emulator is
*4882a593Smuzhiyuntherefore now fully re-entrant, rather than having just the restricted
*4882a593Smuzhiyunform of re-entrancy which is required by the Linux kernel.
*4882a593Smuzhiyun
*4882a593Smuzhiyun----------------------- Limitations of wm-FPU-emu -----------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunThere are a number of differences between the current wm-FPU-emu
*4882a593Smuzhiyun(version 2.01) and the 80486 FPU (apart from bugs).  The differences
*4882a593Smuzhiyunare fewer than those which applied to the 1.xx series of the emulator.
*4882a593SmuzhiyunSome of the more important differences are listed below:
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe Roundup flag does not have much meaning for the transcendental
*4882a593Smuzhiyunfunctions and its 80486 value with these functions is likely to differ
*4882a593Smuzhiyunfrom its emulator value.
*4882a593Smuzhiyun
*4882a593SmuzhiyunIn a few rare cases the Underflow flag obtained with the emulator will
*4882a593Smuzhiyunbe different from that obtained with an 80486. This occurs when the
*4882a593Smuzhiyunfollowing conditions apply simultaneously:
*4882a593Smuzhiyun(a) the operands have a higher precision than the current setting of the
*4882a593Smuzhiyun    precision control (PC) flags.
*4882a593Smuzhiyun(b) the underflow exception is masked.
*4882a593Smuzhiyun(c) the magnitude of the exact result (before rounding) is less than 2^-16382.
*4882a593Smuzhiyun(d) the magnitude of the final result (after rounding) is exactly 2^-16382.
*4882a593Smuzhiyun(e) the magnitude of the exact result would be exactly 2^-16382 if the
*4882a593Smuzhiyun    operands were rounded to the current precision before the arithmetic
*4882a593Smuzhiyun    operation was performed.
*4882a593SmuzhiyunIf all of these apply, the emulator will set the Underflow flag but a real
*4882a593Smuzhiyun80486 will not.
*4882a593Smuzhiyun
*4882a593SmuzhiyunNOTE: Certain formats of Extended Real are UNSUPPORTED. They are
*4882a593Smuzhiyununsupported by the 80486. They are the Pseudo-NaNs, Pseudoinfinities,
*4882a593Smuzhiyunand Unnormals. None of these will be generated by an 80486 or by the
*4882a593Smuzhiyunemulator. Do not use them. The emulator treats them differently in
*4882a593Smuzhiyundetail from the way an 80486 does.
*4882a593Smuzhiyun
*4882a593SmuzhiyunSelf modifying code can cause the emulator to fail. An example of such
*4882a593Smuzhiyuncode is:
*4882a593Smuzhiyun          movl %esp,[%ebx]
*4882a593Smuzhiyun	  fld1
*4882a593SmuzhiyunThe FPU instruction may be (usually will be) loaded into the pre-fetch
*4882a593Smuzhiyunqueue of the CPU before the mov instruction is executed. If the
*4882a593Smuzhiyundestination of the 'movl' overlaps the FPU instruction then the bytes
*4882a593Smuzhiyunin the prefetch queue and memory will be inconsistent when the FPU
*4882a593Smuzhiyuninstruction is executed. The emulator will be invoked but will not be
*4882a593Smuzhiyunable to find the instruction which caused the device-not-present
*4882a593Smuzhiyunexception. For this case, the emulator cannot emulate the behaviour of
*4882a593Smuzhiyunan 80486DX.
*4882a593Smuzhiyun
*4882a593SmuzhiyunHandling of the address size override prefix byte (0x67) has not been
*4882a593Smuzhiyunextensively tested yet. A major problem exists because using it in
*4882a593Smuzhiyunvm86 mode can cause a general protection fault. Address offsets
*4882a593Smuzhiyungreater than 0xffff appear to be illegal in vm86 mode but are quite
*4882a593Smuzhiyunacceptable (and work) in real mode. A small test program developed to
*4882a593Smuzhiyuncheck the addressing, and which runs successfully in real mode,
*4882a593Smuzhiyuncrashes dosemu under Linux and also brings Windows down with a general
*4882a593Smuzhiyunprotection fault message when run under the MS-DOS prompt of Windows
*4882a593Smuzhiyun3.1. (The program simply reads data from a valid address).
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe emulator supports 16-bit protected mode, with one difference from
*4882a593Smuzhiyunan 80486DX.  A 80486DX will allow some floating point instructions to
*4882a593Smuzhiyunwrite a few bytes below the lowest address of the stack.  The emulator
*4882a593Smuzhiyunwill not allow this in 16-bit protected mode: no instructions are
*4882a593Smuzhiyunallowed to write outside the bounds set by the protection.
*4882a593Smuzhiyun
*4882a593Smuzhiyun----------------------- Performance of wm-FPU-emu -----------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunSpeed.
*4882a593Smuzhiyun-----
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe speed of floating point computation with the emulator will depend
*4882a593Smuzhiyunupon instruction mix. Relative performance is best for the instructions
*4882a593Smuzhiyunwhich require most computation. The simple instructions are adversely
*4882a593Smuzhiyunaffected by the FPU instruction trap overhead.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunTiming: Some simple timing tests have been made on the emulator functions.
*4882a593SmuzhiyunThe times include load/store instructions. All times are in microseconds
*4882a593Smuzhiyunmeasured on a 33MHz 386 with 64k cache. The Turbo C tests were under
*4882a593Smuzhiyunms-dos, the next two columns are for emulators running with the djgpp
*4882a593Smuzhiyunms-dos extender. The final column is for wm-FPU-emu in Linux 0.97,
*4882a593Smuzhiyunusing libm4.0 (hard).
*4882a593Smuzhiyun
*4882a593Smuzhiyunfunction      Turbo C        djgpp 1.06        WM-emu387     wm-FPU-emu
*4882a593Smuzhiyun
*4882a593Smuzhiyun   +          60.5           154.8              76.5          139.4
*4882a593Smuzhiyun   -          61.1-65.5      157.3-160.8        76.2-79.5     142.9-144.7
*4882a593Smuzhiyun   *          71.0           190.8              79.6          146.6
*4882a593Smuzhiyun   /          61.2-75.0      261.4-266.9        75.3-91.6     142.2-158.1
*4882a593Smuzhiyun
*4882a593Smuzhiyun sin()        310.8          4692.0            319.0          398.5
*4882a593Smuzhiyun cos()        284.4          4855.2            308.0          388.7
*4882a593Smuzhiyun tan()        495.0          8807.1            394.9          504.7
*4882a593Smuzhiyun atan()       328.9          4866.4            601.1          419.5-491.9
*4882a593Smuzhiyun
*4882a593Smuzhiyun sqrt()       128.7          crashed           145.2          227.0
*4882a593Smuzhiyun log()        413.1-419.1    5103.4-5354.21    254.7-282.2    409.4-437.1
*4882a593Smuzhiyun exp()        479.1          6619.2            469.1          850.8
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe performance under Linux is improved by the use of look-ahead code.
*4882a593SmuzhiyunThe following results show the improvement which is obtained under
*4882a593SmuzhiyunLinux due to the look-ahead code. Also given are the times for the
*4882a593Smuzhiyunoriginal Linux emulator with the 4.1 'soft' lib.
*4882a593Smuzhiyun
*4882a593Smuzhiyun [ Linus' note: I changed look-ahead to be the default under linux, as
*4882a593Smuzhiyun   there was no reason not to use it after I had edited it to be
*4882a593Smuzhiyun   disabled during tracing ]
*4882a593Smuzhiyun
*4882a593Smuzhiyun            wm-FPU-emu w     original w
*4882a593Smuzhiyun            look-ahead       'soft' lib
*4882a593Smuzhiyun   +         106.4             190.2
*4882a593Smuzhiyun   -         108.6-111.6      192.4-216.2
*4882a593Smuzhiyun   *         113.4             193.1
*4882a593Smuzhiyun   /         108.8-124.4      700.1-706.2
*4882a593Smuzhiyun
*4882a593Smuzhiyun sin()       390.5            2642.0
*4882a593Smuzhiyun cos()       381.5            2767.4
*4882a593Smuzhiyun tan()       496.5            3153.3
*4882a593Smuzhiyun atan()      367.2-435.5     2439.4-3396.8
*4882a593Smuzhiyun
*4882a593Smuzhiyun sqrt()      195.1            4732.5
*4882a593Smuzhiyun log()       358.0-387.5     3359.2-3390.3
*4882a593Smuzhiyun exp()       619.3            4046.4
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese figures are now somewhat out-of-date. The emulator has become
*4882a593Smuzhiyunprogressively slower for most functions as more of the 80486 features
*4882a593Smuzhiyunhave been implemented.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyun----------------------- Accuracy of wm-FPU-emu -----------------------
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe accuracy of the emulator is in almost all cases equal to or better
*4882a593Smuzhiyunthan that of an Intel 80486 FPU.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe results of the basic arithmetic functions (+,-,*,/), and fsqrt
*4882a593Smuzhiyunmatch those of an 80486 FPU. They are the best possible; the error for
*4882a593Smuzhiyunthese never exceeds 1/2 an lsb. The fprem and fprem1 instructions
*4882a593Smuzhiyunreturn exact results; they have no error.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe following table compares the emulator accuracy for the sqrt(),
*4882a593Smuzhiyuntrig and log functions against the Turbo C "emulator". For this table,
*4882a593Smuzhiyuneach function was tested at about 400 points. Ideal worst-case results
*4882a593Smuzhiyunwould be 64 bits. The reduced Turbo C accuracy of cos() and tan() for
*4882a593Smuzhiyunarguments greater than pi/4 can be thought of as being related to the
*4882a593Smuzhiyunprecision of the argument x; e.g. an argument of pi/2-(1e-10) which is
*4882a593Smuzhiyunaccurate to 64 bits can result in a relative accuracy in cos() of
*4882a593Smuzhiyunabout 64 + log2(cos(x)) = 31 bits.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunFunction      Tested x range            Worst result                Turbo C
*4882a593Smuzhiyun                                        (relative bits)
*4882a593Smuzhiyun
*4882a593Smuzhiyunsqrt(x)       1 .. 2                    64.1                         63.2
*4882a593Smuzhiyunatan(x)       1e-10 .. 200              64.2                         62.8
*4882a593Smuzhiyuncos(x)        0 .. pi/2-(1e-10)         64.4 (x <= pi/4)             62.4
*4882a593Smuzhiyun                                        64.1 (x = pi/2-(1e-10))      31.9
*4882a593Smuzhiyunsin(x)        1e-10 .. pi/2             64.0                         62.8
*4882a593Smuzhiyuntan(x)        1e-10 .. pi/2-(1e-10)     64.0 (x <= pi/4)             62.1
*4882a593Smuzhiyun                                        64.1 (x = pi/2-(1e-10))      31.9
*4882a593Smuzhiyunexp(x)        0 .. 1                    63.1 **                      62.9
*4882a593Smuzhiyunlog(x)        1+1e-6 .. 2               63.8 **                      62.1
*4882a593Smuzhiyun
*4882a593Smuzhiyun** The accuracy for exp() and log() is low because the FPU (emulator)
*4882a593Smuzhiyundoes not compute them directly; two operations are required.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe emulator passes the "paranoia" tests (compiled with gcc 2.3.3 or
*4882a593Smuzhiyunlater) for 'float' variables (24 bit precision numbers) when precision
*4882a593Smuzhiyuncontrol is set to 24, 53 or 64 bits, and for 'double' variables (53
*4882a593Smuzhiyunbit precision numbers) when precision control is set to 53 bits (a
*4882a593Smuzhiyunproperly performing FPU cannot pass the 'paranoia' tests for 'double'
*4882a593Smuzhiyunvariables when precision control is set to 64 bits).
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe code for reducing the argument for the trig functions (fsin, fcos,
*4882a593Smuzhiyunfptan and fsincos) has been improved and now effectively uses a value
*4882a593Smuzhiyunfor pi which is accurate to more than 128 bits precision. As a
*4882a593Smuzhiyunconsequence, the accuracy of these functions for large arguments has
*4882a593Smuzhiyunbeen dramatically improved (and is now very much better than an 80486
*4882a593SmuzhiyunFPU). There is also now no degradation of accuracy for fcos and fptan
*4882a593Smuzhiyunfor operands close to pi/2. Measured results are (note that the
*4882a593Smuzhiyundefinition of accuracy has changed slightly from that used for the
*4882a593Smuzhiyunabove table):
*4882a593Smuzhiyun
*4882a593SmuzhiyunFunction      Tested x range          Worst result
*4882a593Smuzhiyun                                     (absolute bits)
*4882a593Smuzhiyun
*4882a593Smuzhiyuncos(x)        0 .. 9.22e+18              62.0
*4882a593Smuzhiyunsin(x)        1e-16 .. 9.22e+18          62.1
*4882a593Smuzhiyuntan(x)        1e-16 .. 9.22e+18          61.8
*4882a593Smuzhiyun
*4882a593SmuzhiyunIt is possible with some effort to find very large arguments which
*4882a593Smuzhiyungive much degraded precision. For example, the integer number
*4882a593Smuzhiyun           8227740058411162616.0
*4882a593Smuzhiyunis within about 10e-7 of a multiple of pi. To find the tan (for
*4882a593Smuzhiyunexample) of this number to 64 bits precision it would be necessary to
*4882a593Smuzhiyunhave a value of pi which had about 150 bits precision. The FPU
*4882a593Smuzhiyunemulator computes the result to about 42.6 bits precision (the correct
*4882a593Smuzhiyunresult is about -9.739715e-8). On the other hand, an 80486 FPU returns
*4882a593Smuzhiyun0.01059, which in relative terms is hopelessly inaccurate.
*4882a593Smuzhiyun
*4882a593SmuzhiyunFor arguments close to critical angles (which occur at multiples of
*4882a593Smuzhiyunpi/2) the emulator is more accurate than an 80486 FPU. For very large
*4882a593Smuzhiyunarguments, the emulator is far more accurate.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunPrior to version 1.20 of the emulator, the accuracy of the results for
*4882a593Smuzhiyunthe transcendental functions (in their principal range) was not as
*4882a593Smuzhiyungood as the results from an 80486 FPU. From version 1.20, the accuracy
*4882a593Smuzhiyunhas been considerably improved and these functions now give measured
*4882a593Smuzhiyunworst-case results which are better than the worst-case results given
*4882a593Smuzhiyunby an 80486 FPU.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe following table gives the measured results for the emulator. The
*4882a593Smuzhiyunnumber of randomly selected arguments in each case is about half a
*4882a593Smuzhiyunmillion.  The group of three columns gives the frequency of the given
*4882a593Smuzhiyunaccuracy in number of times per million, thus the second of these
*4882a593Smuzhiyuncolumns shows that an accuracy of between 63.80 and 63.89 bits was
*4882a593Smuzhiyunfound at a rate of 133 times per one million measurements for fsin.
*4882a593SmuzhiyunThe results show that the fsin, fcos and fptan instructions return
*4882a593Smuzhiyunresults which are in error (i.e. less accurate than the best possible
*4882a593Smuzhiyunresult (which is 64 bits)) for about one per cent of all arguments
*4882a593Smuzhiyunbetween -pi/2 and +pi/2.  The other instructions have a lower
*4882a593Smuzhiyunfrequency of results which are in error.  The last two columns give
*4882a593Smuzhiyunthe worst accuracy which was found (in bits) and the approximate value
*4882a593Smuzhiyunof the argument which produced it.
*4882a593Smuzhiyun
*4882a593Smuzhiyun                                frequency (per M)
*4882a593Smuzhiyun                               -------------------   ---------------
*4882a593Smuzhiyuninstr   arg range    # tests   63.7   63.8    63.9   worst   at arg
*4882a593Smuzhiyun                               bits   bits    bits    bits
*4882a593Smuzhiyun-----  ------------  -------   ----   ----   -----   -----  --------
*4882a593Smuzhiyunfsin     (0,pi/2)     547756      0    133   10673   63.89  0.451317
*4882a593Smuzhiyunfcos     (0,pi/2)     547563      0    126   10532   63.85  0.700801
*4882a593Smuzhiyunfptan    (0,pi/2)     536274     11    267   10059   63.74  0.784876
*4882a593Smuzhiyunfpatan  4 quadrants   517087      0      8    1855   63.88  0.435121 (4q)
*4882a593Smuzhiyunfyl2x     (0,20)      541861      0      0    1323   63.94  1.40923  (x)
*4882a593Smuzhiyunfyl2xp1 (-.293,.414)  520256      0      0    5678   63.93  0.408542 (x)
*4882a593Smuzhiyunf2xm1     (-1,1)      538847      4    481    6488   63.79  0.167709
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunTests performed on an 80486 FPU showed results of lower accuracy. The
*4882a593Smuzhiyunfollowing table gives the results which were obtained with an AMD
*4882a593Smuzhiyun486DX2/66 (other tests indicate that an Intel 486DX produces
*4882a593Smuzhiyunidentical results).  The tests were basically the same as those used
*4882a593Smuzhiyunto measure the emulator (the values, being random, were in general not
*4882a593Smuzhiyunthe same).  The total number of tests for each instruction are given
*4882a593Smuzhiyunat the end of the table, in case each about 100k tests were performed.
*4882a593SmuzhiyunAnother line of figures at the end of the table shows that most of the
*4882a593Smuzhiyuninstructions return results which are in error for more than 10
*4882a593Smuzhiyunpercent of the arguments tested.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe numbers in the body of the table give the approx number of times a
*4882a593Smuzhiyunresult of the given accuracy in bits (given in the left-most column)
*4882a593Smuzhiyunwas obtained per one million arguments. For three of the instructions,
*4882a593Smuzhiyuntwo columns of results are given: * The second column for f2xm1 gives
*4882a593Smuzhiyunthe number cases where the results of the first column were for a
*4882a593Smuzhiyunpositive argument, this shows that this instruction gives better
*4882a593Smuzhiyunresults for positive arguments than it does for negative.  * In the
*4882a593Smuzhiyuncases of fcos and fptan, the first column gives the results when all
*4882a593Smuzhiyuncases where arguments greater than 1.5 were removed from the results
*4882a593Smuzhiyungiven in the second column. Unlike the emulator, an 80486 FPU returns
*4882a593Smuzhiyunresults of relatively poor accuracy for these instructions when the
*4882a593Smuzhiyunargument approaches pi/2. The table does not show those cases when the
*4882a593Smuzhiyunaccuracy of the results were less than 62 bits, which occurs quite
*4882a593Smuzhiyunoften for fsin and fptan when the argument approaches pi/2. This poor
*4882a593Smuzhiyunaccuracy is discussed above in relation to the Turbo C "emulator", and
*4882a593Smuzhiyunthe accuracy of the value of pi.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyunbits   f2xm1  f2xm1 fpatan   fcos   fcos  fyl2x fyl2xp1  fsin  fptan  fptan
*4882a593Smuzhiyun62.0       0      0      0      0    437      0      0      0      0    925
*4882a593Smuzhiyun62.1       0      0     10      0    894      0      0      0      0   1023
*4882a593Smuzhiyun62.2      14      0      0      0   1033      0      0      0      0    945
*4882a593Smuzhiyun62.3      57      0      0      0   1202      0      0      0      0   1023
*4882a593Smuzhiyun62.4     385      0      0     10   1292      0     23      0      0   1178
*4882a593Smuzhiyun62.5    1140      0      0    119   1649      0     39      0      0   1149
*4882a593Smuzhiyun62.6    2037      0      0    189   1620      0     16      0      0   1169
*4882a593Smuzhiyun62.7    5086     14      0    646   2315     10    101     35     39   1402
*4882a593Smuzhiyun62.8    8818     86      0    984   3050     59    287    131    224   2036
*4882a593Smuzhiyun62.9   11340   1355      0   2126   4153     79    605    357    321   1948
*4882a593Smuzhiyun63.0   15557   4750      0   3319   5376    246   1281    862    808   2688
*4882a593Smuzhiyun63.1   20016   8288      0   4620   6628    511   2569   1723   1510   3302
*4882a593Smuzhiyun63.2   24945  11127     10   6588   8098   1120   4470   2968   2990   4724
*4882a593Smuzhiyun63.3   25686  12382     69   8774  10682   1906   6775   4482   5474   7236
*4882a593Smuzhiyun63.4   29219  14722     79  11109  12311   3094   9414   7259   8912  10587
*4882a593Smuzhiyun63.5   30458  14936    393  13802  15014   5874  12666   9609  13762  15262
*4882a593Smuzhiyun63.6   32439  16448   1277  17945  19028  10226  15537  14657  19158  20346
*4882a593Smuzhiyun63.7   35031  16805   4067  23003  23947  18910  20116  21333  25001  26209
*4882a593Smuzhiyun63.8   33251  15820   7673  24781  25675  24617  25354  24440  29433  30329
*4882a593Smuzhiyun63.9   33293  16833  18529  28318  29233  31267  31470  27748  29676  30601
*4882a593Smuzhiyun
*4882a593SmuzhiyunPer cent with error:
*4882a593Smuzhiyun        30.9           3.2          18.5    9.8   13.1   11.6          17.4
*4882a593SmuzhiyunTotal arguments tested:
*4882a593Smuzhiyun       70194  70099 101784 100641 100641 101799 128853 114893 102675 102675
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyun------------------------- Contributors -------------------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunA number of people have contributed to the development of the
*4882a593Smuzhiyunemulator, often by just reporting bugs, sometimes with suggested
*4882a593Smuzhiyunfixes, and a few kind people have provided me with access in one way
*4882a593Smuzhiyunor another to an 80486 machine. Contributors include (to those people
*4882a593Smuzhiyunwho I may have forgotten, please forgive me):
*4882a593Smuzhiyun
*4882a593SmuzhiyunLinus Torvalds
*4882a593SmuzhiyunTommy.Thorn@daimi.aau.dk
*4882a593SmuzhiyunAndrew.Tridgell@anu.edu.au
*4882a593SmuzhiyunNick Holloway, alfie@dcs.warwick.ac.uk
*4882a593SmuzhiyunHermano Moura, moura@dcs.gla.ac.uk
*4882a593SmuzhiyunJon Jagger, J.Jagger@scp.ac.uk
*4882a593SmuzhiyunLennart Benschop
*4882a593SmuzhiyunBrian Gallew, geek+@CMU.EDU
*4882a593SmuzhiyunThomas Staniszewski, ts3v+@andrew.cmu.edu
*4882a593SmuzhiyunMartin Howell, mph@plasma.apana.org.au
*4882a593SmuzhiyunM Saggaf, alsaggaf@athena.mit.edu
*4882a593SmuzhiyunPeter Barker, PETER@socpsy.sci.fau.edu
*4882a593Smuzhiyuntom@vlsivie.tuwien.ac.at
*4882a593SmuzhiyunDan Russel, russed@rpi.edu
*4882a593SmuzhiyunDaniel Carosone, danielce@ee.mu.oz.au
*4882a593Smuzhiyuncae@jpmorgan.com
*4882a593SmuzhiyunHamish Coleman, t933093@minyos.xx.rmit.oz.au
*4882a593SmuzhiyunBruce Evans, bde@kralizec.zeta.org.au
*4882a593SmuzhiyunTimo Korvola, Timo.Korvola@hut.fi
*4882a593SmuzhiyunRick Lyons, rick@razorback.brisnet.org.au
*4882a593SmuzhiyunRick, jrs@world.std.com
*4882a593Smuzhiyun
*4882a593Smuzhiyun...and numerous others who responded to my request for help with
*4882a593Smuzhiyuna real 80486.
*4882a593Smuzhiyun