Upstreaming 2024 Nyrkiö patches #27

henrikingo · 2025-01-06T19:07:40Z

No description provided.

henrikingo · 2025-01-06T19:15:18Z

Ah. I think I know what this is: We use hunter together with Nyrkiö, and the latter has pytest-benchmark, so it successfully went through all our own testing. I'll addpytest-benchmark to poetry files and try again. (But not today.)

Gerrrr · 2025-01-10T04:05:21Z

All the changes look good! We just need to remove pytest-benchmark or update the lock.

henrikingo · 2025-01-10T19:24:02Z

Ok actually I think you need to restart the test. I'm not a committer yet :-D

This can be used to compare whether an AnalyzedSeries object is more recent than the set of change points it was computed from. (Think cache invalidation, even if the AnalyzedSeries isn't necessarily a cache.)

Adds dependency pytest-benchmark

Hunter modified e-divisive such that it first does a pass using a higher p-value, then filters out all change points that have a higher p-value than the actual max_pvalue specified by the user. The initial higher pvalue is max_pvalue * 10. This means that for values higher than 0.1, the first pvalue is > 1.0. This doesn't make sense. In fact 1.0 also doesn't make sense because now every point is a weak change point. This patch modifies the call to split() such that: max_pvalue: first pass pvalue 0.0 - 0.05: max_pvalue * 10 (unchanged) 0.0 - 0.5 : max_pvalue * 2 0.5 - 1.0 : max_pvalue Since values above 0.1 didn't really make sense before this patch, the area where this could cause changes is for users using max_pvalue between 0.05 and 0.1. The merge() phase should however eventually produce approximately the same set of change points anyway, but this isn't guaranteed.

This is the default in AnalysisOptions, which most users would use, since it is required in the typical code path.

The common case is to add new data points to the end of the series. In this case we don't need to recompute all change points, we can just compute window_len points from the end. We do roughly 2 * window_len for good measure.

henrikingo · 2025-01-10T21:21:14Z

Ok actually I think you need to restart the test. I'm not a committer yet :-D

Never mind. It was just so fast I didn't realize it had ran already.

Since it is random by design, can't really unit testas usual.

henrikingo added 8 commits January 10, 2025 22:37

Update poetry.lock

aa4d4cc

Add to_json() and from_json() serialization methods.

5567470

Add timestamp to AnalyzedSeries

208d89a

This can be used to compare whether an AnalyzedSeries object is more recent than the set of change points it was computed from. (Think cache invalidation, even if the AnalyzedSeries isn't necessarily a cache.)

Add new unit test and perf tests using tigerbeetle dataset

997b3ea

Adds dependency pytest-benchmark

compute_change_points(): Default min_magnitude to 0.0

28804cf

This is the default in AnalysisOptions, which most users would use, since it is required in the typical code path.

Optimization: Incremental Hunter

fa0508d

The common case is to add new data points to the end of the series. In this case we don't need to recompute all change points, we can just compute window_len points from the end. We do roughly 2 * window_len for good measure.

One years worth of linting and formatting...

e616f64

henrikingo force-pushed the to-asf-upstream2 branch from 03f22e9 to e616f64 Compare January 10, 2025 20:38

flake8 format

1ce8bb8

Disable asserts for orig_edivisive test.

542767d

Since it is random by design, can't really unit testas usual.

Gerrrr approved these changes Jan 10, 2025

View reviewed changes

smccarthy788 approved these changes Jan 10, 2025

View reviewed changes

henrikingo merged commit 72c34f6 into apache:master Jan 10, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upstreaming 2024 Nyrkiö patches #27

Upstreaming 2024 Nyrkiö patches #27

henrikingo commented Jan 6, 2025

henrikingo commented Jan 6, 2025

Gerrrr commented Jan 10, 2025

henrikingo commented Jan 10, 2025

henrikingo commented Jan 10, 2025

Upstreaming 2024 Nyrkiö patches #27

Upstreaming 2024 Nyrkiö patches #27

Conversation

henrikingo commented Jan 6, 2025

henrikingo commented Jan 6, 2025

Gerrrr commented Jan 10, 2025

henrikingo commented Jan 10, 2025

henrikingo commented Jan 10, 2025