A Practical Introduction to Automatic Audio Segmentation Using Deep Learning

In Proceedings of: Summer School: Deep Learning for Language Analysis - September, 2019

Goal

Train a deep neural network on VGGish audio features to automatically segment an audio file into speech and non-speech parts.

Setup

Please follow the instructions below in order.

Install and setup the following:
1. docker
2. git
Clone this repository.
- Or, download it if you don't have git.
Start Docker on your machine.
- Check in Docker settings if appropriate sharing of drive is setup
- Check in Docker settings if appropriate CPU/RAM limits are setup

Build a docker image (Terminal on Linux/macOS, or Powershell on Windows):

# change directory to the clone, if not already
# replace ~ below with where you cloned this repository
cd ~/UoC-ml-school-2019

# Build a docker image
docker build --tag uoc:2019 $PWD

Run a docker container with the built image (serving a Jupyter Lab instance)

# assuming you are in UoC-ml-school-2019 directory

# run the created image, exposing Jupyter Lab port to 8888, 
# and mounting the current directory inside the container
docker run -it -p 8888:8888 -v $PWD/:/ml-school uoc:2019 jupyter lab

Open the served Jupyter Lab instance in the browser by following instructions in the terminal.
- There will be a URL in the terminal that you can copy and then paste in the browser.
Check setup by opening and following the instructions 00-check-setup.ipynb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

A Practical Introduction to Automatic Audio Segmentation Using Deep Learning

Setup

Files

README.md

Latest commit

History

README.md

File metadata and controls

A Practical Introduction to Automatic Audio Segmentation Using Deep Learning

Setup