infinteAverage
Arduino Library to calculate an average of many samples
Description
This library is an experimental library that cascades a float and a uint32_t type. It was created from an idea when an overflow was encountered in my Statistic Class due too many samples.
Problem
As an 32 bit float has ~7 decimals precision, after 10 million additions the sum becomes 7 orders of magnitude larger than individual samples. From that moment the addition will not increase the sum correctly or even not at all.
Taking the average is taking the sum and divide that by the count of the numbers. Only if the count is fixed one could divide the samples first and then sum them. This library supports the first scenario.
Idea
To cope with the overflow problem, this lib uses an float combined with an uint32_t.
The float is used for the decimal part and the uint32_t for the whole part. In theory this should give about 15 significant digits for the average in a 9.6 format. but this precision is only internal to do some math. WHen the average() is calculated the value returned is "just" a float.
(since 0.1.2) If the library detects that there are 2 billion++ (0x8000000) added or if the whole part of the sum reaches that number, all internal counters are divided by 2. That does not affect the minimum and maximum and the average only slightly.
Since version 0.1.4 users can change this threshold and adjust it to data added. NB if you add only small values e.g between 0..100 this threshold may be at 4 billion.
Conclusion (for now)
The library allows two things
- take the average of many many samples where normally a summing float would "overflow"
- take the average of numbers that differ in order of magnitude
Notes
Note the practical meaning of the average of millions or billions of numbers is a discussion worth taking. Normally the outliers are the most interesting.
Note the library is not tested extensively, so use (with care) at your own risk.
Note library does not support negative yet. Planned for 0.2.0. First get more hands-on experience with it.
Interface
- IAVG() constructor, resets the internal counters to 0
- void reset() resets the internal counters to 0
- void add(float value) adds value to the internal float uint32_t pair.
- float decimals() returns the internal float = decimals part.
- uint32_t whole() returns the internal whole part.
- uint32_t count() returns the number of values added. Note this may be scaled back a factor of 2 or more.
- float average() returns the average in float format, or NAN if count == 0
- float minimum() returns the minimum in float format, or NAN if count == 0
- float maximum() returns the maximum in float format, or NAN if count == 0
Threshold
(since 0.1.4) User can set the value when the sum and count are divided by two, to prevent internal counters to overflow. Default at startup this value is (1UL << 31).
- void setDivideThreshold(uint32_t threshold)
- uint32_t getDivideThreshold()
Operation
See examples
Future
This library has its limits which needs to be investigated.
One could upgrade the idea to a combination of a 8 byte double and a uint64_t to get around 28 significant digits => 18.10 format
- investigate "adaptive" scaling, now there is a virtual border at 1, but that could be at value (and might be user definable) This would allow to adjust to known order of size of the numbers. (e.g. if numbers are all in the billions the uint32_t would overflow very fast)
- investigate other math with this data type, starting with + - / * ?
- printable interface?
- play if time permits.
- update documentation
- add examples
0.2.0
- add negative numbers
- test negative numbers / mixed numbers.