Try using [criterion-cycles-per-byte](https://crates.io/crates/criterion-cycles-per-byte) to better measure the performance.