dart_tensor_preprocessing

Tensor preprocessing library for Flutter/Dart. NumPy-like transforms pipeline for ONNX Runtime, TFLite, and other AI inference engines.

Features

PyTorch Compatible: Matches PyTorch/torchvision tensor operations
Non-blocking: Isolate-based async execution prevents UI jank
Type-safe: ONNX-compatible tensor types (Float32, Int64, Uint8, etc.)
Zero-copy: View/stride manipulation for reshape/transpose operations
Declarative: Chain operations into reusable pipelines

Installation

dependencies:
  dart_tensor_preprocessing: ^0.5.1

Quick Start

import 'package:dart_tensor_preprocessing/dart_tensor_preprocessing.dart';

// Create a tensor from image data (HWC format, Uint8)
final imageData = Uint8List.fromList([/* RGBA pixel data */]);
final tensor = TensorBuffer.fromUint8List(imageData, [height, width, channels]);

// Use a preset pipeline for ImageNet models
final pipeline = PipelinePresets.imagenetClassification();
final result = await pipeline.runAsync(tensor);

// result.shape: [1, 3, 224, 224] (NCHW, Float32, normalized)

Pipeline Presets

Preset	Output Shape	Use Case
`imagenetClassification()`	[1, 3, 224, 224]	ResNet, VGG, etc.
`objectDetection()`	[1, 3, 640, 640]	YOLO, SSD
`faceRecognition()`	[1, 3, 112, 112]	ArcFace, FaceNet
`clip()`	[1, 3, 224, 224]	CLIP models
`mobileNet()`	[1, 3, 224, 224]	MobileNet family

Custom Pipeline

final pipeline = TensorPipeline([
  ResizeOp(height: 224, width: 224),
  ToTensorOp(normalize: true),  // HWC -> CHW, scale to [0,1]
  NormalizeOp.imagenet(),       // ImageNet mean/std
  UnsqueezeOp.batch(),          // Add batch dimension
]);

// Sync execution
final result = pipeline.run(input);

// Async execution (runs in isolate)
final result = await pipeline.runAsync(input);

// Async with custom isolate threshold (default: 100,000 elements)
// Small tensors skip isolate overhead and run synchronously
final result = await pipeline.runAsync(input, isolateThreshold: 50000);

Available Operations

Resize & Crop

ResizeOp - Resize to fixed dimensions (nearest, bilinear, bicubic)
ResizeShortestOp - Resize preserving aspect ratio
CenterCropOp - Center crop to fixed dimensions
ClipOp - Element-wise value clamping (presets: unit, symmetric, uint8)
PadOp - Padding with multiple modes (constant, reflect, replicate, circular)
SliceOp - Python-like tensor slicing with negative index support

Normalization

NormalizeOp - Channel-wise normalization (presets: ImageNet, CIFAR-10, symmetric)
ScaleOp - Scale values (e.g., [0-255] to [0-1])
BatchNormOp - Batch normalization for CNN inference (PyTorch compatible)
LayerNormOp - Layer normalization for Transformer inference (presets: BERT, BERT-Large)

Layout

PermuteOp - Axis reordering (e.g., HWC to CHW)
ToTensorOp - HWC uint8 to CHW float32 with optional scaling
ToImageOp - CHW float32 to HWC uint8

Data Augmentation

RandomCropOp - Random cropping with deterministic seed support
GaussianBlurOp - Gaussian blur using separable convolution

Utility

concat() - Concatenates tensors along specified axis

Shape

UnsqueezeOp - Add dimension
SqueezeOp - Remove size-1 dimensions
ReshapeOp - Reshape tensor (supports -1 for inference)
FlattenOp - Flatten dimensions

Type

TypeCastOp - Convert between data types

Core Classes

TensorBuffer

Tensor with shape and stride metadata over physical storage.

// Create tensors
final zeros = TensorBuffer.zeros([3, 224, 224]);
final ones = TensorBuffer.ones([3, 224, 224], dtype: DType.float32);
final fromData = TensorBuffer.fromFloat32List(data, [3, 224, 224]);

// Access elements
final value = tensor[[0, 100, 100]];

// Zero-copy operations
final transposed = tensor.transpose([2, 0, 1]);  // Changes strides only
final squeezed = tensor.squeeze();

// Copy operations
final contiguous = tensor.contiguous();  // Force contiguous memory
final cloned = tensor.clone();

DType

ONNX-compatible data types with onnxId for runtime integration.

DType.float32  // ONNX ID: 1
DType.int64    // ONNX ID: 7
DType.uint8    // ONNX ID: 2

BufferPool

Memory pooling for buffer reuse, reducing GC pressure in hot paths.

final pool = BufferPool.instance;

// Acquire buffer (reuses from pool if available)
final buffer = pool.acquireFloat32(1000);

// ... use buffer ...

// Release back to pool for reuse
pool.release(buffer);

// Monitor pool usage
print('Pooled: ${pool.pooledCount} buffers, ${pool.pooledBytes} bytes');

Zero-Copy View Operations

TensorBuffer extension methods for zero-copy tensor manipulation:

// Slice along first dimension (batch slicing)
final batch = tensor.sliceFirst(2, 5);  // Views elements 2..4

// Split tensor into views
final items = tensor.unbind(0);  // List of views along dim 0

// Select single index (reduces rank)
final first = tensor.select(0, 0);  // First item, shape reduced

// Narrow dimension
final narrowed = tensor.narrow(0, 1, 3);  // 3 elements starting at 1

// Format conversion without copying
final nhwc = nchwTensor.toChannelsLast();   // NCHW -> NHWC view
final nchw = nhwcTensor.toChannelsFirst();  // NHWC -> NCHW view

// Flatten to 1D view
final flat = tensor.flatten();

Memory Formats

Format	Layout	Strides (for [1,3,224,224])
`contiguous`	NCHW	[150528, 50176, 224, 1]
`channelsLast`	NHWC	[150528, 1, 672, 3]

PyTorch Compatibility

This library is designed to produce identical results to PyTorch/torchvision operations:

Operation	PyTorch Equivalent
`TensorBuffer.zeros()`	`torch.zeros()`
`TensorBuffer.ones()`	`torch.ones()`
`tensor.transpose()`	`tensor.permute()`
`tensor.reshape()`	`tensor.reshape()`
`tensor.squeeze()`	`tensor.squeeze()`
`tensor.unsqueeze()`	`tensor.unsqueeze()`
`tensor.sum()` / `sumAxis()`	`tensor.sum()`
`tensor.mean()` / `meanAxis()`	`tensor.mean()`
`tensor.min()` / `max()`	`tensor.min()` / `max()`
`NormalizeOp.imagenet()`	`transforms.Normalize(mean, std)`
`ResizeOp(mode: bilinear)`	`F.interpolate(mode='bilinear')`
`ToTensorOp()`	`transforms.ToTensor()`
`ClipOp(min, max)`	`torch.clamp(min, max)`
`PadOp(mode: reflect)`	`F.pad(mode='reflect')`
`SliceOp([(start, end, step)])`	`tensor[start:end:step]`
`concat(tensors, axis)`	`torch.cat(tensors, dim)`
`RandomCropOp`	`transforms.RandomCrop()`
`GaussianBlurOp`	`transforms.GaussianBlur()`
`AddOp` / `SubOp`	`torch.add()` / `torch.sub()`
`MulOp` / `DivOp`	`torch.mul()` / `torch.div()`
`PowOp`	`torch.pow()`
`AbsOp` / `NegOp`	`torch.abs()` / `torch.neg()`
`SqrtOp` / `ExpOp` / `LogOp`	`torch.sqrt()` / `exp()` / `log()`
`ReLUOp` / `LeakyReLUOp`	`F.relu()` / `F.leaky_relu()`
`SigmoidOp` / `TanhOp`	`torch.sigmoid()` / `torch.tanh()`
`SoftmaxOp`	`F.softmax()`
`BatchNormOp`	`torch.nn.BatchNorm2d` (inference)
`LayerNormOp`	`torch.nn.LayerNorm`
`TensorBuffer.full()`	`torch.full()`
`TensorBuffer.random()`	`torch.rand()`
`TensorBuffer.randn()`	`torch.randn()`
`TensorBuffer.eye()`	`torch.eye()`
`TensorBuffer.linspace()`	`torch.linspace()`
`TensorBuffer.arange()`	`torch.arange()`
`tensor.select(dim, index)`	`tensor.select(dim, index)`
`tensor.narrow(dim, start, len)`	`tensor.narrow(dim, start, len)`
`tensor.unbind(dim)`	`tensor.unbind(dim)`
`tensor.flatten()`	`tensor.flatten()`

Performance Benchmarks

Run benchmarks with dart run benchmark/run_all.dart.

Zero-Copy Operations (O(1))

Operation	Time	Ops/sec
`transpose()`	~1µs	700K+
`reshape()`	~1µs	1.6M+
`squeeze()`	<1µs	3.2M+
`unsqueeze()`	~1µs	780K+

Pipeline Performance

Pipeline	Input Shape	Time
Simple (Normalize + Unsqueeze)	[3, 224, 224]	~3.4ms
ImageNet Classification	[3, 224, 224]	~3.0ms
Object Detection	[3, 640, 640]	~25ms

Sync vs Async

Execution	224x224	640x640
`run()` (sync)	~3.5ms	~29ms
`runAsync()` (isolate)	~11ms	~93ms
Isolate overhead	~7ms	~64ms

Note: Use runAsync() for large tensors or when UI responsiveness is critical.

Requirements

Dart SDK ^3.0.0

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
benchmark		benchmark
example		example
lib		lib
test		test
.gitignore		.gitignore
.pubignore		.pubignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
analysis_options.yaml		analysis_options.yaml
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dart_tensor_preprocessing

Features

Installation

Quick Start

Pipeline Presets

Custom Pipeline

Available Operations

Resize & Crop

Normalization

Layout

Data Augmentation

Utility

Shape

Type

Core Classes

TensorBuffer

DType

BufferPool

Zero-Copy View Operations

Memory Formats

PyTorch Compatibility

Performance Benchmarks

Zero-Copy Operations (O(1))

Pipeline Performance

Sync vs Async

Requirements

License

About

Uh oh!

Releases 11

Packages

Contributors 2

Uh oh!

Languages

License

brody-0125/dart_tensor_preprocessing

Folders and files

Latest commit

History

Repository files navigation

dart_tensor_preprocessing

Features

Installation

Quick Start

Pipeline Presets

Custom Pipeline

Available Operations

Resize & Crop

Normalization

Layout

Data Augmentation

Utility

Shape

Type

Core Classes

TensorBuffer

DType

BufferPool

Zero-Copy View Operations

Memory Formats

PyTorch Compatibility

Performance Benchmarks

Zero-Copy Operations (O(1))

Pipeline Performance

Sync vs Async

Requirements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Contributors 2

Uh oh!

Languages

Packages