arm/nwfpe/netwinder-fpe.rst

*4882a593Smuzhiyun=============
*4882a593SmuzhiyunCurrent State
*4882a593Smuzhiyun=============
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe following describes the current state of the NetWinder's floating point
*4882a593Smuzhiyunemulator.
*4882a593Smuzhiyun
*4882a593SmuzhiyunIn the following nomenclature is used to describe the floating point
*4882a593Smuzhiyuninstructions.  It follows the conventions in the ARM manual.
*4882a593Smuzhiyun
*4882a593Smuzhiyun::
*4882a593Smuzhiyun
*4882a593Smuzhiyun  <S|D|E> = <single|double|extended>, no default
*4882a593Smuzhiyun  {P|M|Z} = {round to +infinity,round to -infinity,round to zero},
*4882a593Smuzhiyun            default = round to nearest
*4882a593Smuzhiyun
*4882a593SmuzhiyunNote: items enclosed in {} are optional.
*4882a593Smuzhiyun
*4882a593SmuzhiyunFloating Point Coprocessor Data Transfer Instructions (CPDT)
*4882a593Smuzhiyun------------------------------------------------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunLDF/STF - load and store floating
*4882a593Smuzhiyun
*4882a593Smuzhiyun<LDF|STF>{cond}<S|D|E> Fd, Rn
*4882a593Smuzhiyun<LDF|STF>{cond}<S|D|E> Fd, [Rn, #<expression>]{!}
*4882a593Smuzhiyun<LDF|STF>{cond}<S|D|E> Fd, [Rn], #<expression>
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese instructions are fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunLFM/SFM - load and store multiple floating
*4882a593Smuzhiyun
*4882a593SmuzhiyunForm 1 syntax:
*4882a593Smuzhiyun<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn]
*4882a593Smuzhiyun<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn, #<expression>]{!}
*4882a593Smuzhiyun<LFM|SFM>{cond}<S|D|E> Fd, <count>, [Rn], #<expression>
*4882a593Smuzhiyun
*4882a593SmuzhiyunForm 2 syntax:
*4882a593Smuzhiyun<LFM|SFM>{cond}<FD,EA> Fd, <count>, [Rn]{!}
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese instructions are fully implemented.  They store/load three words
*4882a593Smuzhiyunfor each floating point register into the memory location given in the
*4882a593Smuzhiyuninstruction.  The format in memory is unlikely to be compatible with
*4882a593Smuzhiyunother implementations, in particular the actual hardware.  Specific
*4882a593Smuzhiyunmention of this is made in the ARM manuals.
*4882a593Smuzhiyun
*4882a593SmuzhiyunFloating Point Coprocessor Register Transfer Instructions (CPRT)
*4882a593Smuzhiyun----------------------------------------------------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunConversions, read/write status/control register instructions
*4882a593Smuzhiyun
*4882a593SmuzhiyunFLT{cond}<S,D,E>{P,M,Z} Fn, Rd          Convert integer to floating point
*4882a593SmuzhiyunFIX{cond}{P,M,Z} Rd, Fn                 Convert floating point to integer
*4882a593SmuzhiyunWFS{cond} Rd                            Write floating point status register
*4882a593SmuzhiyunRFS{cond} Rd                            Read floating point status register
*4882a593SmuzhiyunWFC{cond} Rd                            Write floating point control register
*4882a593SmuzhiyunRFC{cond} Rd                            Read floating point control register
*4882a593Smuzhiyun
*4882a593SmuzhiyunFLT/FIX are fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunRFS/WFS are fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunRFC/WFC are fully implemented.  RFC/WFC are supervisor only instructions, and
*4882a593Smuzhiyunpresently check the CPU mode, and do an invalid instruction trap if not called
*4882a593Smuzhiyunfrom supervisor mode.
*4882a593Smuzhiyun
*4882a593SmuzhiyunCompare instructions
*4882a593Smuzhiyun
*4882a593SmuzhiyunCMF{cond} Fn, Fm        Compare floating
*4882a593SmuzhiyunCMFE{cond} Fn, Fm       Compare floating with exception
*4882a593SmuzhiyunCNF{cond} Fn, Fm        Compare negated floating
*4882a593SmuzhiyunCNFE{cond} Fn, Fm       Compare negated floating with exception
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunFloating Point Coprocessor Data Instructions (CPDT)
*4882a593Smuzhiyun---------------------------------------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunDyadic operations:
*4882a593Smuzhiyun
*4882a593SmuzhiyunADF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - add
*4882a593SmuzhiyunSUF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - subtract
*4882a593SmuzhiyunRSF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse subtract
*4882a593SmuzhiyunMUF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - multiply
*4882a593SmuzhiyunDVF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - divide
*4882a593SmuzhiyunRDV{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse divide
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunFML{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast multiply
*4882a593SmuzhiyunFDV{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast divide
*4882a593SmuzhiyunFRD{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - fast reverse divide
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are fully implemented as well.  They use the same algorithm as the
*4882a593Smuzhiyunnon-fast versions.  Hence, in this implementation their performance is
*4882a593Smuzhiyunequivalent to the MUF/DVF/RDV instructions.  This is acceptable according
*4882a593Smuzhiyunto the ARM manual.  The manual notes these are defined only for single
*4882a593Smuzhiyunoperands, on the actual FPA11 hardware they do not work for double or
*4882a593Smuzhiyunextended precision operands.  The emulator currently does not check
*4882a593Smuzhiyunthe requested permissions conditions, and performs the requested operation.
*4882a593Smuzhiyun
*4882a593SmuzhiyunRMF{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - IEEE remainder
*4882a593Smuzhiyun
*4882a593SmuzhiyunThis is fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunMonadic operations:
*4882a593Smuzhiyun
*4882a593SmuzhiyunMVF{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - move
*4882a593SmuzhiyunMNF{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - move negated
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunABS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - absolute value
*4882a593SmuzhiyunSQT{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - square root
*4882a593SmuzhiyunRND{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - round
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are fully implemented.
*4882a593Smuzhiyun
*4882a593SmuzhiyunURD{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - unnormalized round
*4882a593SmuzhiyunNRM{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - normalize
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are implemented.  URD is implemented using the same code as the RND
*4882a593Smuzhiyuninstruction.  Since URD cannot return a unnormalized number, NRM becomes
*4882a593Smuzhiyuna NOP.
*4882a593Smuzhiyun
*4882a593SmuzhiyunLibrary calls:
*4882a593Smuzhiyun
*4882a593SmuzhiyunPOW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - power
*4882a593SmuzhiyunRPW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse power
*4882a593SmuzhiyunPOL{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - polar angle (arctan2)
*4882a593Smuzhiyun
*4882a593SmuzhiyunLOG{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base 10
*4882a593SmuzhiyunLGN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base e
*4882a593SmuzhiyunEXP{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - exponent
*4882a593SmuzhiyunSIN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - sine
*4882a593SmuzhiyunCOS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - cosine
*4882a593SmuzhiyunTAN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - tangent
*4882a593SmuzhiyunASN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arcsine
*4882a593SmuzhiyunACS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arccosine
*4882a593SmuzhiyunATN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arctangent
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are not implemented.  They are not currently issued by the compiler,
*4882a593Smuzhiyunand are handled by routines in libc.  These are not implemented by the FPA11
*4882a593Smuzhiyunhardware, but are handled by the floating point support code.  They should
*4882a593Smuzhiyunbe implemented in future versions.
*4882a593Smuzhiyun
*4882a593SmuzhiyunSignalling:
*4882a593Smuzhiyun
*4882a593SmuzhiyunSignals are implemented.  However current ELF kernels produced by Rebel.com
*4882a593Smuzhiyunhave a bug in them that prevents the module from generating a SIGFPE.  This
*4882a593Smuzhiyunis caused by a failure to alias fp_current to the kernel variable
*4882a593Smuzhiyuncurrent_set[0] correctly.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe kernel provided with this distribution (vmlinux-nwfpe-0.93) contains
*4882a593Smuzhiyuna fix for this problem and also incorporates the current version of the
*4882a593Smuzhiyunemulator directly.  It is possible to run with no floating point module
*4882a593Smuzhiyunloaded with this kernel.  It is provided as a demonstration of the
*4882a593Smuzhiyuntechnology and for those who want to do floating point work that depends
*4882a593Smuzhiyunon signals.  It is not strictly necessary to use the module.
*4882a593Smuzhiyun
*4882a593SmuzhiyunA module (either the one provided by Russell King, or the one in this
*4882a593Smuzhiyundistribution) can be loaded to replace the functionality of the emulator
*4882a593Smuzhiyunbuilt into the kernel.