Identifier Suggestion

Models for source code identifier suggestion built by learning from Big Code.

Architecture

VSCode Extension Demo

Setup

Requirements

Have Python 3.7 or above installed.
Have a Unix-like terminal with bash, coreutils and some other utilities like wget (required by some .sh scripts).

Installation

Run pip install -r requirements/dev.txt to install all Python package dependencies.

Installation with a virtual environment

For example, you can use the standard venv:

Run python3 -m venv .venv to create a virtual environment.
Run source .venv/bin/activate to activate the virtual environment. This step must be executed on every new terminal session.
Run pip install -r requirements/dev.txt to install all Python package dependencies.

Training the model

Run ./src/scripts/baseline/train.sh from this directory with all necessary packages pre-installed.

Serving the model

Run ./src/scripts/baseline/serve.sh from this directory with all necessary packages pre-installed.

Project Structure

Follows the Cookiecutter Data Science project structure.

├── LICENSE
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── external       <- Data from third party sources.
│   ├── interim        <- Intermediate data that has been transformed.
│   ├── meta           <- Data about the data.
│   ├── processed      <- The final, canonical data sets for modeling.
│   └── raw            <- The original, immutable data dump.
│
├── experiments        <- Generated JSON files with hyperparameters for random search
│
├── models             <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
│                         the creator's initials, and a short `-` delimited description, e.g.
│                         `1.0-jqp-initial-data-exploration`.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── thesis         <- Generated graphics and tables used in the thesis document
│
├── requirements       <- The requirements files for reproducing this environment.
│
├── src                <- Source code of the project.
│   ├── common         <- Module with common values and procedures
│   ├── data           <- Scripts for Java method name parsing and data preprocessing
│   ├── evaluation     <- Logic for evaluating model performance
│   ├── metrics        <- Metrics for evaluation of models
│   ├── models         <- Scripts to train models and then use trained models to make predictions
│   ├── pipelines      <- Training pipelines
│   ├── preprocessing  <- Data preprocessing logic
│   ├── scripts        <- Scripts for running model training or serving
│   ├── server         <- Model server
│   ├── utils          <- Utility functions
│   └── visualization  <- Scripts to create exploratory and results oriented visualizations
│
└── vscode-extension   <- Extension for VSCode

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Identifier Suggestion

Architecture

VSCode Extension Demo

Setup

Requirements

Installation

Installation with a virtual environment

Training the model

Serving the model

Project Structure

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 313 Commits
data		data
experiments		experiments
notebooks		notebooks
reports/thesis		reports/thesis
requirements		requirements
src		src
vscode-extension		vscode-extension
.gitignore		.gitignore
.jtconf		.jtconf
LICENSE		LICENSE
README.md		README.md
thesis.pdf		thesis.pdf

License

antonpetkoff/identifier-suggestion

Folders and files

Latest commit

History

Repository files navigation

Identifier Suggestion

Architecture

VSCode Extension Demo

Setup

Requirements

Installation

Installation with a virtual environment

Training the model

Serving the model

Project Structure

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages