Loess 0.6 fails (never completes) while Loess 0.5.4 completes in a few seconds #73

palday · 2023-08-08T18:23:23Z

Data for MWE (compressed because GitHub doesn't like .arrow as attachments)

loess.arrow.zip

using Arrow
using Loess

tbl = Arrow.Table("loess.arrow")
# runs quickly on 0.5.4; never completes on 0.6.1
@time loess(tbl.x, tbl.y)

cc @dmbates

The text was updated successfully, but these errors were encountered:

andreasnoack · 2023-08-13T19:05:12Z

I've taken a closer look at this. There seems to be two things going on here. The first thing is that the old version used a different rule for deciding on splits when building the KD tree. The rule was different from the one in the Loess papers. The old approach just used the median even when it wasn't unique. The papers search for the index where a change happens. It took some work to figure out exactly how they did it so I wrote a comment. However, this is a linear search for where the x value changes and there are a lot of ties in your data. Specifically, it searches forward and backwards for x == 8 and

julia> countmap(tbl.x)[8]
431154

and it has to do a bit of sorting for each iteration. The second issue is that the partial sort seems to be slower than it should and if I add a function barrier then it's much faster so it seem that the closure is causing some overhead, but even after that change, building the tree for your dataset is prohibitively slow. However, I tried the loess in R which is based on the original Fortran version and it also seems to be really slow for this dataset so I think it's a consequence of the algorithm. Maybe it is possible use a bisection strategy instead of the linear search. I guess the assumption in the original paper was that the number of ties would be small such that the linear search would be fine.

palday changed the title ~~Loess 0.6 fails (never completes) while Loess 0.5.4 completes in a few second~~ Loess 0.6 fails (never completes) while Loess 0.5.4 completes in a few seconds Aug 8, 2023

palday mentioned this issue Aug 13, 2023

improve performance with ties #74

Merged

andreasnoack mentioned this issue Aug 15, 2023

Add let block around call to partialsort! to avoid overhead from captured variable #76

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loess 0.6 fails (never completes) while Loess 0.5.4 completes in a few seconds #73

Loess 0.6 fails (never completes) while Loess 0.5.4 completes in a few seconds #73

palday commented Aug 8, 2023 •

edited

Loading

andreasnoack commented Aug 13, 2023 •

edited

Loading

Loess 0.6 fails (never completes) while Loess 0.5.4 completes in a few seconds #73

Loess 0.6 fails (never completes) while Loess 0.5.4 completes in a few seconds #73

Comments

palday commented Aug 8, 2023 • edited Loading

andreasnoack commented Aug 13, 2023 • edited Loading

palday commented Aug 8, 2023 •

edited

Loading

andreasnoack commented Aug 13, 2023 •

edited

Loading