14 KiB
FastTrig
Arduino library with interpolated lookup for sin() and cos(). Trades speed for accuracy.
Description
Warning: The library trades speed for accuracy so use at own risk
So please, verify the performance and accuracy to see if they meet the requirements of your project.
The library provides one lookup table that is used for isin(degrees) and icos(degrees) and itan(degrees). This lookup table is optimized for interpolation so the values for whole degrees are not optimal. Furthermore the itan() on AVR has almost no performance gain over the regular tan() so on AVR one is advised to use tan(). On ESP32 the itan(degrees) does have a serious performance gain so use it if you need speed.
These functions are to be used as replacements for sin(radians), cos(radians) and tan(radians). Important to know is that they are NOT direct replaceable as the parameter differs a factor (PI/180.0) or its inverse.
Similar to cos(x) == sin(x + PI)
it is also true that icos(x) == isin(x + 90)
,
so icos() can use the very same lookup table at the cost of a single addition.
In fact it uses icos(x) == isin(x - 270)
as that performs better, due to the folding.
The i in the names stands for int and interpolated as the core is
using integer math and lookup table of 91 uint16_t = 182 bytes.
By folding and mirroring the whole 360 degrees and beyond can be handled.
When isin(float x) is called and x == int(x)
then the library will not interpolate.
This will improve performance even more.
When x is not a whole number the library will linear interpolate
between isin(int(x)) and isin(int(x+1)).
Of course this introduces an error but the error is small and performance is still
quite fast (which was the goal).
Interface
#include "FastTrig.h"
Lookup tables
The lookup tables are optimized (sketch provided) to minimize the error when using the interpolation, this implies that the points in the table might not be optimal when you use only whole degrees. A sketch that generates lookup tables is in the examples folder. This generator sketch can also generate tables with different resolution e.g. 24, 14, 12 or even 6, 5 or 4 bit lookup tables. So depending on the application these tables can be ideal, but verify they meet your requirements.
The lookup tables used by isin() can be used directly in your program, the names are:
- sinTable16[] index 0..90, values need to be (float) divided by 65535.0
- sinTable8[] index 0..90, values need to be (float) divided by 255.0
The sinTable8 is not really for doing accurate math, however it is great to use in a LEDstrip or motor movements when less accuracy is needed.
Although the tables can be written to, it is advised not to do so.
OK, the optimize example does a write to improve the table to minimize errors
atan, atan2
Since version 0.2.1 two functions are added:
- float atanFast(float f) input range -1 .. 1 is faster. Returns -PI/2 .. PI/2
- float atan2Fast(float y, float x) all input possible except (0, 0). This (0,0) singularity returns NAN. Returns -PI .. PI for all other values.
These functions do not use a lookup table but are faster than the default atan() and atan2() functions. Use fastTrig_atan_performance.ino to check the gain on your board. Price is that the values are less accurate, but the difference is < 0.001.
isin256, icos256, isincos256
Version 0.3.0 added these experimental functions:
- int isin256(uint32_t v) accepts only positive angles in degrees. Returns the sin(v)*256 to keep all math integer (shift 8 later to correct value)
- int icos256(uint32_t v) accepts only positive angles in degrees. Returns the cos(v)*256 to keep all math integer (shift 8 later to correct value)
- int isincos256(uint32_t v, int *si, int *co) accepts only positive angles in degrees. returns both the sin(v)*256 and the cos(v)*256 of the same angle. Faster than both individual calls together.
isincos
Version 0.3.0 added this experimental function:
- float isincos(float f, float *si, float *co) accepts any angle in degrees. returns both the sin(v) and the cos(v) of the same angle. Faster than both individual calls, see example. There is a minor difference between the value of the float co compared to icos(). This need some investigation ( truncating ?)
hypotFast
Strictly hypot() is no gonio function but it is often used for calculating length in polar coordinates.
angle = atan2(x,y);
length = hypot(x, y);
- float hypotFast(float x, float y) faster approximation of the hypot(x, y) Experimental!
Performance isin icos itan
time in us - calls 0 - 360 step 1 degree and calls 720 - 1080 (lib version 0.1.5) (clock speeds in MHz)
function | UNO 16 | ESP32 240 | UNO (720-1080) | ESP (720-1080) |
---|---|---|---|---|
sin | 120.43 | 10.90 | 124.19 | 10.91 |
isin | 44.24 | 1.09 | 85.00 | 1.11 |
cos | 120.27 | 10.81 | 123.98 | 10.83 |
icos | 51.40 | 1.16 | 91.42 | 1.18 |
tan | 147.59 | 18.07 | 151.39 | 18.07 |
itan | 126.73 | 1.31 | 129.93 | 1.29 |
*Note: itan() 0.1.3 was ( 131.23, 3.05 ) so it improved quite a bit on ESP32. *
Performance gain is most evident for the ESP32 processor, and much less on AVR. The effect of the modulo (360 degrees) can be seen explicitly in AVR. Furthermore the itan() on AVR is not faster when there is also interpolation (not in table)
The 0.1.4 version of itan() is faster for ESP32 than 0.1.3 version but the improvement on AVR is minimal. So this will stay on the TODO list.
Furthermore a lot of gain is lost when the angle is not within 0..360 and needs to be normalized ( expensive modulo on AVR ). It is worth noting that the original sin() cos() and tan() only have a small overhead for values outside the 0..360 range.
Please, verify the performance and accuracy to see if they meet the requirements of your project.
Accuracy isin icos itan
errors - based upon example sketch - lib version 0.1.5
ESP32 calls 0.0 - 360.0 step 0.1 degree
function | max abs error | avg abs error | max rel error | avg rel error |
---|---|---|---|---|
isin | 0.00010264 | 0.00002059 | 0.02955145 | 0.00035180 |
icos | 0.00010264 | 0.00002031 | 0.02955145 | 0.00034868 |
itan | 0.69696045 | 0.00640957 | 0.00144703 | 0.00010100 |
UNO calls 0.0 - 360.0 step 0.1 degree
function | max abs error | avg abs error | max rel error | avg rel error |
---|---|---|---|---|
isin | 0.00010270 | 0.00002059 | 0.02955145 | 0.00035171 |
icos | 0.00010264 | 0.00002032 | 0.02949960 | 0.00034869 |
itan | 0.72760009 | 0.00641527 | 0.00144703 | 0.00037889 |
*Note: 0.1.3 for AVR was bad: 17.41900634 , 0.02249339 , 0.02953807 for itan() *
Strange that the itan() on UNO and ESP32 differs (OK same order of magnitude). Different implementation of goniometry / float maths?
Please, verify the performance to see if it meets your requirements.
Performance iasin iacos iatan
(added 0.1.5)
time in us - calls -1 ..+1 step 0.001 degree
function | UNO 16 - | ESP32 240 |
---|---|---|
asin | 149.76 | 16.71 |
iasin | 107.70 | 2.58 |
acos | 169.50 | 15.44 |
iacos | 114.65 | 2.67 |
atan | 155.75 | 11.68 |
iatan | NI | NI |
- the interpolated reverse lookup is around 30% faster on UNO an 80+% on ESP32
- iatan() is Not Implemented.
Please, verify the accuracy to see if it meets your requirements.
Accuracy iasin iacos iatan
(added 0.1.5)
ESP32 calls -1 ..+1 step 0.001 degree
function | max abs error | avg abs error | max rel error | avg rel error |
---|---|---|---|---|
iasin | 0.22498322 | 0.00195790 | 0.00456106 | 0.00005727 |
iacos | 0.22498587 | 0.00195794 | 0.64284271 | 0.00021902 |
iatan | NI | NI | NI | NI |
- largest error at 0.999981 - second largest error 0.052841 at -0.999000
- iatan() is Not Implemented
UNO calls -1 ..+1 step 0.001 degree
function | max abs error | avg abs error | max rel error | avg rel error |
---|---|---|---|---|
iasin | 0.22499084 | 0.00195719 | 0.00456125 | 0.00005725 |
iacos | 0.22498588 | 0.00195740 | 0.64284276 | 0.00021901 |
iatan | NI | NI | NI | NI |
- largest error at 0.999981 - second largest error 0.052841 at -0.999000
- max relative error is high as it occurred near zero.
- iatan() is Not Implemented
Please, verify the accuracy to see if it meets your requirements.
Performance atanFast, atan2Fast
Indicative times in microseconds (first measurements)
function | atan | atanF | atan2 | atan2F | factor | notes |
---|---|---|---|---|---|---|
UNO | 188 | 96 | 196 | 124 | ~1.6 | range -1 .. 1 |
UNO | 220 | 124 | 212 | 128 | ~1.6 | |
ESP32 | 50 | 15 | 44 | 13 | ~3.3 |
The range of the second UNO is beyond the -1..1 range
Additional measurements are welcome. (use performance sketch)
To be elaborated.
Accuracy atanFast, atan2Fast
The atan2Fast() uses atanFast() so the accuracy for both is the same. The test sketch indicates a maximum error is smaller than 0.001.
To be elaborated.
Performance isincos()
isincos() calculates sin(f) and cos(f) in one call.
loop 1000 calls in microseconds.
function | UNO 16 | ESP32 240 |
---|---|---|
sin | 122872 | 10926 |
isin | 70704 | 1086 |
cos | 122636 | 10853 |
icos | 66588 | 1151 |
isin + icos | 148368 | 2248 |
isincos | 103788 | 1909 |
Note the isincos() is faster than the original sin() or cos() while being pretty accurate.
Accuracy isincos()
As the basic algorithm is very similar to isin() the accuracy is the same.
Performance hypotFast
hypotFast() approximates the hypot(x,y) function with a faster formula. Price is accuracy.
loop 1000 calls in microseconds.
function | UNO 16 | ESP32 240 |
---|---|---|
sqrt | 58.856 | 4.472 |
hypot | 56.588 | 7.111 |
hypotFast | 33.768 | 1.683 |
Note that sqrt(x^2, y^2) will overflow faster than hypot(x,y) or hypotFast(x,y).
Accuracy hypotFast
First measurements indicate that the maximum error is about 2.64% so on average about 1.32% (both + and -). See test sketch.
Please verify accuracy for the ranges used in your project!
Performance isin256 icos256 isincos256
isin256(), icos256() and isincos256() calculates the sin*256 etc. These functions all return an integer value. There is no floating point math in there so it performs a bit better. At some moment you must correct this factor of 256 with a division or a shift 8.
loop 1000 calls in microseconds. Based upon fastTrig_isincos256.ino
Note to test and compare, the values were multiplied by 100 and shifted by 8.
function | UNO 16 | ESP32 240 |
---|---|---|
sin | 131260 | 11364 |
isin | 79044 | 1119 |
isin256 | 28284 | 255 |
cos | 131008 | 11298 |
icos | 74928 | 1190 |
icos256 | 31704 | 289 |
isin256 + icos256 | 58352 | 478 |
isincos256 | 32300 | 339 |
Note the Ixxx256() series functions are Fast. The price is accuracy but might still be OK for many projects.
Accuracy isin256 icos256 isincos256
The Ixxx256() only accept whole degrees. Therefore the values come directly from the lookup tables. no interpolation. First measurements indicate that the error is less than 2%.
To be quantified
Operation
See examples
Future
Must
- improve documentation
- verify math (tables etc) again.
- write test sketches that output the tables for documentation :)
Should
- write more tests to verify values.
- test performance on more platforms.
- investigate the difference between isincos() and icos().
- investigate itan256()
- itan256(0) = 0 itan256(1) = 4 itan256(2) = 9 so there will be big steps...
- max abs error should be 0.5 or less, it might have its uses.
Could
- How to improve the accuracy of the whole degrees,
- now the table is optimized for interpolation.
- add sinc(x) = sin(x)/x function.?
- ixxx256() functions need another lookup table?
- separate .h file?
Wont
Support
If you appreciate my libraries, you can support the development and maintenance. Improve the quality of the libraries by providing issues and Pull Requests, or donate through PayPal or GitHub sponsors.
Thank you,