DSFaker aims at providing data scientists with functions for generating and streaming arrays of data such as:
- Time-series
- Date/Datetime
- Autoincrement & random autoincrement
- Trigonometric (Sin, Sinh, Cos, Cosh, Tan, Tanh)
- Random distributions
- Pattern repetition
- Noise application
Basic operators between generators and/or python int/float types (+, -, /, //, *, …) are also implemented.
To generate such data, you can specify the distribution you want amongst the ones provided by numpy.random. Or you can also implement your own distribution to generate data.
pip install dsfaker
The documentation is available here: https://eds-aphp.github.io/dsfaker/.