{SLmetrics}: Machine learning performance evaluation on steroids

{SLmetrics} is a lightweight R package written in C++ and {Rcpp} for memory-efficient and lightning-fast machine learning performance evaluation; it’s like using a supercharged {yardstick} but without the risk of soft to super-hard deprecations. {SLmetrics} covers both regression and classification metrics and provides (almost) the same array of metrics as {scikit-learn} and {PyTorch} all without {reticulate} and the Python compile-run-(crash)-debug cylce.

Depending on the mood and alignment of planets {SLmetrics} stands for Supervised Learning metrics, or Statistical Learning metrics. If {SLmetrics} catches on, the latter will be the core philosophy and include unsupervised learning metrics. If not, then it will remain a {pkg} for Supervised Learning metrics, and a sandbox for me to develop my C++ skills.

📚 Table of Contents

🚀 Gettting Started
- 🛡️ Installation
- 📚 Basic Usage
ℹ️ Why?
⚡ Performance Comparison
- ⏩ Speed comparison
- 💾 Memory-efficiency
ℹ️ Basic usage
- 📚 Regression
- 📚 Classification
ℹ️ Installation
- 🛡️ Stable version
- 🛠️ Development version
ℹ️ Code of Conduct

🚀 Gettting Started

Below you’ll find instructions to install {SLmetrics} and get started with your first metric, the Root Mean Squared Error (RMSE).

🛡️ Installation

## install stable release
devtools::install_github(
  repo = 'https://github.com/serkor1/SLmetrics@*release',
  ref  = 'main'
)

📚 Basic Usage

Below is a minimal example demonstrating how to compute both unweighted and weighted RMSE.

library(SLmetrics)

actual    <- c(10.2, 12.5, 14.1)
predicted <- c(9.8, 11.5, 14.2)
weights   <- c(0.2, 0.5, 0.3)

cat(
  "Root Mean Squared Error", rmse(
    actual    = actual,
    predicted = predicted,
  ),
  "Root Mean Squared Error (weighted)", weighted.rmse(
    actual    = actual,
    predicted = predicted,
    w         = weights
  ),
  sep = "\n"
)
#> Root Mean Squared Error
#> 0.6244998
#> Root Mean Squared Error (weighted)
#> 0.7314369

That’s all! Now you can explore the rest of this README for in-depth usage, performance comparisons, and more details about {SLmetrics}.

ℹ️ Why?

Machine learning can be a complicated task; the steps from feature engineering to model deployment require carefully measured actions and decisions. One low-hanging fruit to simplify this process is performance evaluation.

At its core, performance evaluation is essentially just comparing two vectors — a programmatically and, at times, mathematically trivial step in the machine learning pipeline, but one that can become complicated due to:

Dependencies and potential deprecations
Needlessly complex or repetitive arguments
Performance and memory bottlenecks at scale

{SLmetrics} solves these issues by being:

Fast: Powered by C++ and Rcpp
Memory-efficient: Everything is structured around pointers and references
Lightweight: Only depends on Rcpp, RcppEigen, and lattice
Simple: S3-based, minimal overhead, and flexible inputs

Performance evaluation should be plug-and-play and “just work” out of the box — there’s no need to worry about quasiquations, dependencies, deprecations, or variations of the same functions relative to their arguments when using {SLmetrics}.

⚡ Performance Comparison

One, obviously, can’t build an R-package on C++ and {Rcpp} without a proper pissing contest at the urinals - below is a comparison in execution time and memory efficiency of two simple cases that any {pkg} should be able to handle gracefully; computing a 2 x 2 confusion matrix and computing the RMSE¹.

⏩ Speed comparison

As shown in the chart, {SLmetrics} maintains consistently low(er) execution times across different sample sizes.

💾 Memory-efficiency

Below are the results for garbage collections and total memory allocations when computing a 2×2 confusion matrix (N = 1e7) and RMSE (N = 1e7). Notice that {SLmetrics} requires no GC calls for these operations.

	Iterations	Garbage Collections [gc()]	gc() pr. second	Memory Allocation (MB)
{SLmetrics}	100	0	0.00	0
{yardstick}	100	186	4.53	381
{MLmetrics}	100	186	4.47	381
{mlr3measures}	100	386	3.57	916

2 x 2 Confusion Matrix (N = 1e7)

	Iterations	Garbage Collections [gc()]	gc() pr. second	Memory Allocation (MB)
{SLmetrics}	100	0	0.00	0
{yardstick}	100	157	4.47	420
{MLmetrics}	100	19	2.39	76
{mlr3measures}	100	12	1.27	76

RMSE (N = 1e7)

In both tasks, {SLmetrics} remains extremely memory-efficient, even at large sample sizes.

Important

From {bench} documentation: Total amount of memory allocated by R while running the expression. Memory allocated outside the R heap, e.g. by malloc() or new directly is not tracked, take care to avoid misinterpreting the results if running code that may do this.

ℹ️ Basic usage

In its simplest form, {SLmetrics}-functions work directly with pairs of <numeric> vectors (for regression) or <factor> vectors (for classification). Below we demonstrate this on two well-known datasets, mtcars (regression) and iris (classification).

📚 Regression

We first fit a linear model to predict mpg in the mtcars dataset, then compute the in-sample RMSE:

# Evaluate a linear model on mpg (mtcars)
model <- lm(mpg ~ ., data = mtcars)
rmse(mtcars$mpg, fitted(model))
#> [1] 2.146905

📚 Classification

Now we recode the iris dataset into a binary problem (“virginica” vs. “others”) and fit a logistic regression. Then we generate predicted classes, compute the confusion matrix and summarize it.

# 1) recode iris
# to binary problem
iris$species_num <- as.numeric(
  iris$Species == "virginica"
)

# 2) fit the logistic
# regression
model <- glm(
  formula = species_num ~ Sepal.Length + Sepal.Width,
  data    = iris,
  family  = binomial(
    link = "logit"
  )
)

# 3) generate predicted
# classes
predicted <- factor(
  as.numeric(
    predict(model, type = "response") > 0.5
  ),
  levels = c(1,0),
  labels = c("Virginica", "Others")
)

# 4) generate actual
# values as factor
actual <- factor(
  x = iris$species_num,
  levels = c(1,0),
  labels = c("Virginica", "Others")
)

# 4) generate
# confusion matrix
summary(
  confusion_matrix <-  cmatrix(
    actual    = actual,
    predicted = predicted
  )
)
#> Confusion Matrix (2 x 2) 
#> ================================================================================
#>           Virginica Others
#> Virginica        35     15
#> Others           14     86
#> ================================================================================
#> Overall Statistics (micro average)
#>  - Accuracy:          0.81
#>  - Balanced Accuracy: 0.78
#>  - Sensitivity:       0.81
#>  - Specificity:       0.81
#>  - Precision:         0.81

ℹ️ Installation

🛡️ Stable version

## install stable release
devtools::install_github(
  repo = 'https://github.com/serkor1/SLmetrics@*release',
  ref  = 'main'
)

🛠️ Development version

## install development version
devtools::install_github(
  repo = 'https://github.com/serkor1/SLmetrics',
  ref  = 'development'
)

ℹ️ Code of Conduct

Please note that the {SLmetrics} project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.

The source code for these benchmarks is available here. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.github		.github
NEWS_files/figure-gfm		NEWS_files/figure-gfm
R		R
data-raw		data-raw
man		man
pkgdown		pkgdown
src		src
tests		tests
tools		tools
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
DESCRIPTION		DESCRIPTION
LICENSE.md		LICENSE.md
Makefile		Makefile
NAMESPACE		NAMESPACE
NEWS.Rmd		NEWS.Rmd
NEWS.md		NEWS.md
README.Rmd		README.Rmd
README.md		README.md
_pkgdown.yml		_pkgdown.yml
codecov.yml		codecov.yml
codemeta.json		codemeta.json
cran-comments.md		cran-comments.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

{SLmetrics}: Machine learning performance evaluation on steroids

📚 Table of Contents

🚀 Gettting Started

🛡️ Installation

📚 Basic Usage

ℹ️ Why?

⚡ Performance Comparison

⏩ Speed comparison

💾 Memory-efficiency

ℹ️ Basic usage

📚 Regression

📚 Classification

ℹ️ Installation

🛡️ Stable version

🛠️ Development version

ℹ️ Code of Conduct

About

Releases 4

Sponsor this project

Languages

License

serkor1/SLmetrics

Folders and files

Latest commit

History

Repository files navigation

{SLmetrics}: Machine learning performance evaluation on steroids

📚 Table of Contents

🚀 Gettting Started

🛡️ Installation

📚 Basic Usage

ℹ️ Why?

⚡ Performance Comparison

⏩ Speed comparison

💾 Memory-efficiency

ℹ️ Basic usage

📚 Regression

📚 Classification

ℹ️ Installation

🛡️ Stable version

🛠️ Development version

ℹ️ Code of Conduct

Footnotes

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 4

Sponsor this project

Languages