arm/nwfpe/todo.rst

*4882a593SmuzhiyunTODO LIST
*4882a593Smuzhiyun=========
*4882a593Smuzhiyun
*4882a593Smuzhiyun::
*4882a593Smuzhiyun
*4882a593Smuzhiyun  POW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - power
*4882a593Smuzhiyun  RPW{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - reverse power
*4882a593Smuzhiyun  POL{cond}<S|D|E>{P,M,Z} Fd, Fn, <Fm,#value> - polar angle (arctan2)
*4882a593Smuzhiyun
*4882a593Smuzhiyun  LOG{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base 10
*4882a593Smuzhiyun  LGN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - logarithm to base e
*4882a593Smuzhiyun  EXP{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - exponent
*4882a593Smuzhiyun  SIN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - sine
*4882a593Smuzhiyun  COS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - cosine
*4882a593Smuzhiyun  TAN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - tangent
*4882a593Smuzhiyun  ASN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arcsine
*4882a593Smuzhiyun  ACS{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arccosine
*4882a593Smuzhiyun  ATN{cond}<S|D|E>{P,M,Z} Fd, <Fm,#value> - arctangent
*4882a593Smuzhiyun
*4882a593SmuzhiyunThese are not implemented.  They are not currently issued by the compiler,
*4882a593Smuzhiyunand are handled by routines in libc.  These are not implemented by the FPA11
*4882a593Smuzhiyunhardware, but are handled by the floating point support code.  They should
*4882a593Smuzhiyunbe implemented in future versions.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThere are a couple of ways to approach the implementation of these.  One
*4882a593Smuzhiyunmethod would be to use accurate table methods for these routines.  I have
*4882a593Smuzhiyuna couple of papers by S. Gal from IBM's research labs in Haifa, Israel that
*4882a593Smuzhiyunseem to promise extreme accuracy (in the order of 99.8%) and reasonable speed.
*4882a593SmuzhiyunThese methods are used in GLIBC for some of the transcendental functions.
*4882a593Smuzhiyun
*4882a593SmuzhiyunAnother approach, which I know little about is CORDIC.  This stands for
*4882a593SmuzhiyunCoordinate Rotation Digital Computer, and is a method of computing
*4882a593Smuzhiyuntranscendental functions using mostly shifts and adds and a few
*4882a593Smuzhiyunmultiplications and divisions.  The ARM excels at shifts and adds,
*4882a593Smuzhiyunso such a method could be promising, but requires more research to
*4882a593Smuzhiyundetermine if it is feasible.
*4882a593Smuzhiyun
*4882a593SmuzhiyunRounding Methods
*4882a593Smuzhiyun----------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe IEEE standard defines 4 rounding modes.  Round to nearest is the
*4882a593Smuzhiyundefault, but rounding to + or - infinity or round to zero are also allowed.
*4882a593SmuzhiyunMany architectures allow the rounding mode to be specified by modifying bits
*4882a593Smuzhiyunin a control register.  Not so with the ARM FPA11 architecture.  To change
*4882a593Smuzhiyunthe rounding mode one must specify it with each instruction.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThis has made porting some benchmarks difficult.  It is possible to
*4882a593Smuzhiyunintroduce such a capability into the emulator.  The FPCR contains
*4882a593Smuzhiyunbits describing the rounding mode.  The emulator could be altered to
*4882a593Smuzhiyunexamine a flag, which if set forced it to ignore the rounding mode in
*4882a593Smuzhiyunthe instruction, and use the mode specified in the bits in the FPCR.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThis would require a method of getting/setting the flag, and the bits
*4882a593Smuzhiyunin the FPCR.  This requires a kernel call in ArmLinux, as WFC/RFC are
*4882a593Smuzhiyunsupervisor only instructions.  If anyone has any ideas or comments I
*4882a593Smuzhiyunwould like to hear them.
*4882a593Smuzhiyun
*4882a593SmuzhiyunNOTE:
*4882a593Smuzhiyun pulled out from some docs on ARM floating point, specifically
*4882a593Smuzhiyun for the Acorn FPE, but not limited to it:
*4882a593Smuzhiyun
*4882a593Smuzhiyun The floating point control register (FPCR) may only be present in some
*4882a593Smuzhiyun implementations: it is there to control the hardware in an implementation-
*4882a593Smuzhiyun specific manner, for example to disable the floating point system.  The user
*4882a593Smuzhiyun mode of the ARM is not permitted to use this register (since the right is
*4882a593Smuzhiyun reserved to alter it between implementations) and the WFC and RFC
*4882a593Smuzhiyun instructions will trap if tried in user mode.
*4882a593Smuzhiyun
*4882a593Smuzhiyun Hence, the answer is yes, you could do this, but then you will run a high
*4882a593Smuzhiyun risk of becoming isolated if and when hardware FP emulation comes out
*4882a593Smuzhiyun
*4882a593Smuzhiyun		-- Russell.