GSoC_2015_project_deep_learning

Hip Deep learning

Taking last year's project to a new level.

Mentors

Theofanis Karaletsos
Sergey (github: lisitsyn, IRC: lisitsyn)

Difficulty & Requirements

Pretty difficult. Requires:

C++ programming
Understanding what neural nets are and how do we learn them
OpenCL programming is a plus
Understanding Of inference in graphical models, variational inference is a plus
Experience with Stochastic Optimization is a plus

Description

Last year we had an introductory project completed by Khaled Nasr on deep learning which introduced most essential components to use neural networks with Shogun. The content of that project was to implement most common feedforward architectures, autoencoders and boltzmann machines. This year we wish to devote more time to deep directed generative models with a Bayesian treatment of parameter learning. Generative models are immensely powerful for many tasks and using deep networks to parametrize exponential family distributions in the form of sigmoid belief networks is an old idea that recently has received a lot of attention with astounding results. We aim to include the newest results on that field in Shogun to facilitate many applied settings. The project will include implementation of variational inference machinery for graphical models with factors given by MLPs, optimization methods and applications. The first part is heavy on mathematics and will require good understanding of probability theory and graphical models. The second part consists of including various stochastic optimizers into shogun, as just stochastic gradient decent is not enough nowadays to train complex models well. The third part will consist of delivering convincing demonstrations of speed, performance and scalability of the implemented models in a series of applications. Given time, we will also offer the chance to explore novel developments in this field.

Details

Waypoints and initial work

First of all, you'd have to dive into the existing goods. We believe the easiest way to do that is to improve the code through small iterative changes that improve code quality
Training neural nets with CPU only is really slow and we're actually pretty near to have GPU training in Shogun so the next step is to finish up GPU training and finally train some deep net with a GPU

Optional

We expect this project to consider active research problems so probably you can even come up with something really-really new!

Why this is cool

Deep learning is everywhere - it is on the bleeding edge of research. Most importantly, in recent attempts deep learning is getting the benefit of a bayesian treatment and the goal of this project is to marry deep nets and variational inference in the spirit of Reverend Bayes.

Papers

Auto-Encoding Variational Bayes by Kingma and Welling
Stochastic Backpropagation and Approximate Inference in Deep Generative Models by Rezende et al
Neural Variational Inference and Learning in Belief Networks by Mnih and Gregor
Deep Exponential Families by Ranganath, Blei et al

Useful resources

"Neural Networks for Machine Learning" by Geoffrey Hinton (https://www.coursera.org/course/neuralnets)
"Deep learning" by Yoshua Bengio, Ian Goodfellow and Aaron Courville (http://www.iro.umontreal.ca/~bengioy/dlbook/)

Entrance level tasks

Speed up any NN implementation in Shogun using linalg, multicore, and GPU!
Implement a recurrent neural network (paper)
Implement LSTM (long short term memory) (paper)
Implement stochastic optimizer with Hessian Free Optimization (paper)
Implement stochastic optimizer rmsProp (paper)
Implement new initializations for neural nets as per MSR paper
~~Add leaky ReLU neural layer~~
Reproduce "Text understanding from scratch" neural model with a notebook
Support DotFeatures in neural networks
Add preprocessor layer for neural network
Output gradient information in neural nets

Home
Readmes:
Development
- Getting involved
- Dev tips.
GSoC
Credits
Authors
Contributions
License
msufsort
SVMlight
Tapkee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly