Prediction of jobs submitted to one of Purdue's central computing clusters

This assignment deals with predicting failure of application executions (referred to as “job”) on Purdue ITaP’s central computing cluster. This is data that we have collected, collated, and analyzed as part of a project from the National Science Foundation (NSF) (Computer System Failure Data Repository to Enable Data-Driven Dependability Research,” Proposal No. CNS-1513197).

For each job, we have data about the resources the job uses and whether the job succeeded or failed. The resources for which we have data are:

Memory
Network
Local IO
Network File System (NFS)

We are releasing the training data, which has about 8% failure data (this is referred to as the “positive class”). You will build Machine Learning models in Python to predict whether a job will fail or not, given the resource usage data. We will evaluate your model on some test data that we are not releasing now and that we will use later at the time of the evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
train_data.csv		train_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prediction of jobs submitted to one of Purdue's central computing clusters

About

Releases

Packages

moiz1235/application-failure-prediction

Folders and files

Latest commit

History

Repository files navigation

Prediction of jobs submitted to one of Purdue's central computing clusters

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages