Speech to Text Analysis Whitepaper

This project evaluates various Speech-to-Text (STT) configurations for both streaming and batch (offline) transcription, testing different model variants, container environments, model serving frameworks, deployment platforms, and hardware configurations.

Goal

Two fold:

Demonstrate how to move from individual experimentation to operational enterprise scale.
Collect benchmarking data to guide architectural and operational decisions on model performance, system efficiency, and container security.

Summary of Experimentation to Operations

START HERE -> Crawl README — Experiment locally using Ubuntu and UBI9-minimal containers on a single server.
Walk README — Scale from a single server to a Kubernetes cluster.
Run README — Shift from embedded inference to decoupled model serving.
Sprint — Integrate Model Registries and optimize for production-scale serving.

Along the journey, we introduce automation and answer common performance, security, and scalability questions.

Summary of Benchmarking

Execute benchmarking to capture metrics from:

Models: OpenAI Whisper
Containers: Ubuntu, UBI9-minimal
Platforms: Linux, Kubernetes
Model Servers: OpenAI Whisper, vLLM
CPUs: Intel Cascade Lake, AWS Graviton3, AMD EPYC, Intel Sapphire Rapids
GPUs: T4, L4, A10, H100
Instance Types: g4dn.12xlarge, g6.12xlarge, g5.12xlarge, p5.48xlarge
Command Modes: basic, hyperparameters
Start Modes: cold, warm

Input Audio Files (in `/data/input-samples/`):

Ground Truth Transcripts (in `/data/ground-truth/`):

Harvard.txt
JFK Inaugural Address (official transcript)
JFK Rice University Speech (official transcript)

Evaluation Scripts (in `/data/evaluation-scripts/`):

whisper-functional-batch-metrics.sh - Batch Transcription Benchmarking
compare_transcripts.py - Accuracy Metrics Scoring
system_non_functional_monitoring.py - System Resource Monitoring
cleanup-benchmark-results.sh - Benchmark Workspace Cleanup

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
benchmark		benchmark
crawl		crawl
data		data
run		run
walk		walk
.gitignore		.gitignore
README.md		README.md
STORYLINE.md		STORYLINE.md
whitepaper-stt-evaluation-on-kubernetes.aux		whitepaper-stt-evaluation-on-kubernetes.aux
whitepaper-stt-evaluation-on-kubernetes.pdf		whitepaper-stt-evaluation-on-kubernetes.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech to Text Analysis Whitepaper

Goal

Summary of Experimentation to Operations

Summary of Benchmarking

Input Audio Files (in `/data/input-samples/`):

Ground Truth Transcripts (in `/data/ground-truth/`):

Evaluation Scripts (in `/data/evaluation-scripts/`):

Related Resources

About

Uh oh!

Releases

Packages

Languages

redhat-na-ssa/whitepaper-stt-evaluation-on-kubernetes

Folders and files

Latest commit

History

Repository files navigation

Speech to Text Analysis Whitepaper

Goal

Summary of Experimentation to Operations

Summary of Benchmarking

Input Audio Files (in /data/input-samples/):

Ground Truth Transcripts (in /data/ground-truth/):

Evaluation Scripts (in /data/evaluation-scripts/):

Related Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Input Audio Files (in `/data/input-samples/`):

Ground Truth Transcripts (in `/data/ground-truth/`):

Evaluation Scripts (in `/data/evaluation-scripts/`):

Packages