-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Roadmap #1
Comments
I think it makes sense for For methods that are already in
Okay, that sounds good. I've given you push access. Alternatively, if you'd like to have your repo be the main one instead of this one, that would be fine with me. |
Once #9 gets merged I think we are in a good position to officially release version 0.1.0 on crates.io - what do you think? @jturner314 |
I agree. By the way, I recently came across Julia's |
Added a bunch of tests to #9 and merged 🎉 It feels like ages since I started to work on it 😅 Your contribution was extremely helpful to get it in the shape it is right now, thanks a lot @jturner314! What do we need to do in order to release on crates.io? |
Yay! 🎉 That was a big job; great work.
Ideally, we'd eliminate the It would also be good to merge #12 and #13 before releasing. |
|
💯 💯 I think it's safe to say it would have never got there without your help 😛 |
I have drafted a tentative roadmap with the features I'd like to add in the next release - please edit it with your comments and suggestions @jturner314 |
The roadmap looks good to me. I'm not familiar with the applications of higher order central moments (I'd usually use a histogram instead), but I don't mind adding them if people find them useful. By the way, I invited you as an owner for the |
Somehow I didn't receive an email notification, but the invite was on my dashboard - accepted it! The main objective in that area is getting kurtosis and skewness, and given the kind of computation required to achieve that it makes sense to also roll out higher order central moments I'd say :) |
Hey mate, |
Thanks for your interest! You'll want to add I'd suggest starting with the existing implementation for It would also be good to add Please feel free to ask if you have any questions. |
Hey mates, I have added And what does this mean by this? |
It's because we require the data type to be This reduces the failure scope:
This can be useful when you are dealing with floats or arrays with potentially missing values (e.g. Re: quantiles - the current implementation requires |
Thanks @LukeMathWalker for the last point, if we change |
Exactly! @phungleson |
Cool thanks @LukeMathWalker so seems like everything is more or less complete? Let me know if there are any doable features, cheers. BTW |
For
|
Yes @phungleson, it basically boils down to summing together the |
I'd like to close existing work streams and cut a release - what does your bandwidth look like @jturner314 to review open PRs? |
I've been meaning to look over the open PRs but haven't had a chance. I'll reserve time on Sunday to review them. |
It seems I managed to publish I'd say we have done a major leap forward in terms of features - there are things that can be polished, the API design can be further improved and we can optimize the existing code, but I'll clean up the parent post to move items that we didn't manage to include in this release to the roadmap for the next one. I am not sure what we should be covering next in terms of major new functionality 🤔 |
Well done all 👏 |
Great job on 0.2.0 everyone!
A couple of ideas from StatsBase.jl:
We could also add statistical models (e.g. linear regression), but that might be best put in a separate crate. |
Well done! cheers! |
Unless any of you have made a start on these, I'd be interested in having a go at either, or contributing. I'll try to spend some time in the next couple of days looking at what is involved with the Deviation functions. ❓ Does anyone have any implementation suggestions other than just trying to port from StatsBase.jl? If anyone wants to collaborate on the code then let me know. |
Ok I made a start: #41 Any advice for choosing traits bounds for the |
I'd say to use |
@LukeMathWalker thanks. What led you to that decision? Is there a particular data type you've seen used in |
I see it as a tradeoff between convenience and generality - I am not personally aware of any "popular" numerical type that is not |
I wanted to code a simple
but I realize that it's too simple. This code is only useful for 1D arrays, or flattened matrices/images, etc. I can change the
What do you guys had in mind? |
I think it makes perfect sense to have two functions. @nilgoyette |
I was thinking of picking up the histogram |
@aeroaks I'd say go for it 💯 You could raise a draft PR if you get something working and want some early feedback. |
I would like to implement something like scipy.stats.binned_statistic_dd based on ndarray_stats::histogram::Histogram, allowing to caluclate running means, variances, sums, max/min value in each bin. |
It does! @RolfStierle |
According to https://docs.scipy.org/doc/numpy/reference/generated/numpy.histogram_bin_edges.html#numpy.histogram_bin_edges Some of bins building strategies are not implemented by
|
Thanks very much for sharing the good work. Would it be possible to add univariate, bivariate and multivariate kernel density estimation functions? Thanks. |
In terms of functionality, the mid-term end goal is to achieve feature parity with the statistics routine in
numpy
(here) andJulia StatsBase
(here).For the next version:
partialord
version forquantiles
methods;merge
method;For version 0.2.0:
For version 0.1.0:
max
/nanmax
(@jturner314)min
/nanmin
(@jturner314)quantile
/nanquantile
(it includespercentile
/nanpercentile
as a special case) (@LukeMathWalker & @jturner314)correlation
-methods:cov
(@LukeMathWalker) -One last fix to be made (Remove 'static bound from type[On hold for now]A
inCorrelationExt.cov
#3)corrcoef
(@LukeMathWalker - Pearson correlation #5)histogram
-methods (@LukeMathWalker - Histogram (revisited) #9)The text was updated successfully, but these errors were encountered: