RAMPAGE - A Training and Comparing AGD Detectors Framework

RAMPAGE: (fRAMework to comPAre aGd dEtectors) is a framework aimed at training and comparing machine learning models for the detection of Algorithmically Generated Domains (AGDs).

Features

Training and testing of machine learning models for detecting AGDs
A structured, clean and straightforward approach to comparing machine learning models
Possibility of extending the base data types to add new features to dataset elements
Automated management of datasets
Simple way to share models and results

Installation

RAMPAGE runs on Python 3.11. To use RAMPAGE, it is necessary to create a pip package and install it. For now, it has been decided not to upload it to PyPI.

To generate the package, execute the following command:

python3 setup.py sdist bdist_wheel

The package will be generated in the dist/ directory. To install it, use pip to install the file with the .whl extension that has been created. E.g.:

pip3 install dist/RAMPAGE-1.0-py3-none-any.whl

Usage

In the examples/ directory, there is a sample execution that includes a series of models. Below is an explanation of how to set them up.

Requirements

It is necessary to install the packages of the machine learning frameworks used in examples. To do this, execute the following command (it is recommended to use a virtualized Python environment):"

pip install -r requirements.txt

Example of use

`main.py`

The main.py file contains the primary execution code. It should import RAMPAGE Framework and DatasetManager class.

from RAMPAGE.Framework import Framework
from PersonaldatasetManager import PersonalDatasetManager

Next, DatasetManager is then defined within RAMPAGE Framework, and the datasets to be used are added.

# Create Framework
framework = Framework()
# Create DatasetManager1 implementation
datasetManager = PersonalDatasetManager()
# Set percentajes to use in train, validation and test
datasetManager.setPercentages(70,15,15)
# Set in framework the dataset to use
framework.defineDatasetManager(datasetManager)
# Add dga dataset
framework.addDataset(PATH_DGA, True)
# Add Non dga dataset
framework.addDataset(PATH_NON_DGA, True)

The next step is to define the classifiers, train, and test them.

# Define classifiers
classifiers = [
    LSTM_example,
    CNN_example,
    Baseline_example
]
for classifier in classifiers:
    classifierObj = classifier()
    framework.addClassifier(classifierObj)
# Train all classifiers
framework.train()
# Test all classifiers
framework.test()

After executing the classifier tests, we can retrieve and display the results.

# Get results
results = framework.getResults()
# Show results
for i in range(len(results)):
    print(framework.getClassifierByIndex(i).__class__.__name__)
    print(framework.getResultByIndex(i).toString())

Datasets Definition

For managing datasets, two classes need to be considered: DataElement and DatasetManager. DataElement represents a single unit with all its features. In the base version, it only includes the domain and a boolean indicating whether the domain should be classified as malicious or not. If new fields or features need to be added, two new classes must be created, inheriting from DataElement and DatasetManager, respectively.

In DataElement, you need to add as many attributes to the class as the number of features you want to include. In DatasetManager, the parseDataElement function must be overridden so that it can read the new fields of the updated DataElement.

Result

Result class is special, as it is empty by default. Therefore, a new class that inherits from Result should be created, where the desired metrics for the statistics to be measured will be implemented. E.g.:

class ResultPersonal(Result):

    def __init__(self, accuracy, precision, recall):
        self.accuracy = accuracy
        self.precision = precision
        self.recall = recall

    def toString(self) -> str:
        ret = " * Accuracy   -> " + str(self.accuracy) + "\n"
        ret = ret + " * Precision  -> " + str(self.precision) + "\n"
        ret = ret + " * Recall     -> " + str(self.recall)
        return ret
    
    def toCSVheader(self, separator:str) -> str:
        ret = "accuracy" + separator + "precision" + separator + "recall"
        return ret

    def toCSV(self, separator:str) -> str:
        ret = str(self.accuracy) + separator + str(self.precision)
        ret = ret + separator + str(self.recall)
        return ret

Classifier

The classifiers inherit from the RAMPAGE Classifier class. To do so, they must implement the train and test functions. A proposed implementation could be as follows:

class Example(Classifier):

    #Define your configuration values
    ...

    def __init__(self) -> None:
        # Define your model
        ...
        
    def train(self, train:set, validation:set):
        # Define your training method
        ...


    def test(self, test:set) -> Result:
        ...
        return PersonalResult(...)

License

Licensed under the GNU GPLv3 license.

How to cite

If you are using this software, please cite it as follows:

TBD

Funding support

Part of this research was supported by the Spanish National Cybersecurity Institute (INCIBE) under Proyectos Estratégicos de Ciberseguridad -- CIBERSEGURIDAD EINA UNIZAR and by the Recovery, Transformation and Resilience Plan funds, financed by the European Union (Next Generation).

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
RAMPAGE		RAMPAGE
examples		examples
misc/img		misc/img
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAMPAGE - A Training and Comparing AGD Detectors Framework

Features

Installation

Usage

Requirements

Example of use

`main.py`

Datasets Definition

Result

Classifier

License

How to cite

Funding support

About

Releases 1

Packages

Languages

License

reverseame/RAMPAGE

Folders and files

Latest commit

History

Repository files navigation

RAMPAGE - A Training and Comparing AGD Detectors Framework

Features

Installation

Usage

Requirements

Example of use

main.py

Datasets Definition

Result

Classifier

License

How to cite

Funding support

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

`main.py`

Packages