Optimizing Neural Network Training and Quantization with Rooted Logistic Objectives (AISTATS 2025) 🚀

First-order methods are widely employed for training neural networks that are used in practical applications. For classification of input features, Cross-Entropy based loss functions are often preferred since they are differentiable everywhere. Recent optimization results show that the convergence properties of first-order methods such as gradient descent are intricately tied to the separability of datasets and the induced loss landscape. We introduce Rooted Logistic Objectives (RLO) to improve practical convergence behavior with benefits for downstream tasks. We show that our proposed loss satisfies strict convexity properties and has better condition number properties that will benefit practical implementations. To evaluate our proposed RLO, we compare its performance on various classification benchmarks. Our results illustrate that training procedure converges faster with RLO in many cases. Furthermore, on two downstream tasks viz., post-training quantization and finetuning on quantized space, we show that it is possible to ensure lower performance degradation while using reduced precision for sequence prediction tasks in large language models over state of the art methods.

📜 Citation

If you find this project useful, please give us a star and cite:

@inproceedings{wang2025optimizing,
  title={Optimizing Neural Network Training and Quantization with Rooted Logistic Objectives},
  author={Wang, Zhu and Veluswami, Praveen Raj and Mishra, Harsh and Ravi, Sathya N},
  booktitle={The 28th International Conference on Artificial Intelligence and Statistics},
  year={2025},
  url={https://openreview.net/forum?id=g5ml9INmja}
}

@article{wang2023accelerated,
  title={Accelerated Neural Network Training with Rooted Logistic Objectives},
  author={Wang, Zhu and Veluswami, Praveen Raj and Mishra, Harsh and Ravi, Sathya N},
  journal={arXiv preprint arXiv:2310.03890},
  year={2023}
}

🧠 Deep neural networks with RLO

Datasets support:

More coming...

Vision Models support:

LLM Models quantization support:

OPT
Llama2
Llama3

More coming...

Loss function support:

cross entropy
focal
RLO 😍

How to use RLO in your NN training/quantization:

Default settings: dataset: cifar10, net: ViT, loss: root, epochs:200, k:3, m:3

python train.py

Other settings example:

python train.py --dataset cifar100 --net Swin --k 8 --m 10

LLMs Quantization with RLO:

Finetune OPT with RLO: Default settings: dataest: wikitext2, model: facebook/opt_125m. epochs: 3, k:5, m:5

pyhon ft_opt.py

Quantization:

CUDA_VISIBLE_DEVICES=0 python opt.py model_name wikitext2 --wbits 2 --quant ldlqRG --incoh_processing --save save_path

GAN with RLO:

We use the official PyTorch implementation of the StyleGAN2-ADA from https://github.com/NVlabs/stylegan2-ada-pytorch/ to demonstrate the results of using the rooted loss replacing the original cross-entropy loss. Clone the official StyleGAN2-ADA code using the below command.

https://github.com/NVlabs/stylegan2-ada-pytorch.git

Steps to implement our experiments.

Please follow the instructions in their documentation to prepare the dataset. Store 'FFHQ' and 'Stanford Dogs' images in their respective folders in './train_dataset'
Make appropriate changes to the file 'loss.py' as given in our repository.
Change the values of the variables 'kparam' and 'ls' as per requirement. Default settings: kparam=2 ; ls='rlo'
Refer to the file 'commands.txt' to find example commands for respective tasks/experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
StyleGAN2-ADA-RLO		StyleGAN2-ADA-RLO
quantization		quantization
rlo-dnn		rlo-dnn
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimizing Neural Network Training and Quantization with Rooted Logistic Objectives (AISTATS 2025) 🚀

📜 Citation

🧠 Deep neural networks with RLO

Datasets support:

Vision Models support:

LLM Models quantization support:

Loss function support:

How to use RLO in your NN training/quantization:

LLMs Quantization with RLO:

GAN with RLO:

About

Releases

Packages

Contributors 2

Languages

ellenzhuwang/rooted_loss

Folders and files

Latest commit

History

Repository files navigation

Optimizing Neural Network Training and Quantization with Rooted Logistic Objectives (AISTATS 2025) 🚀

📜 Citation

🧠 Deep neural networks with RLO

Datasets support:

Vision Models support:

LLM Models quantization support:

Loss function support:

How to use RLO in your NN training/quantization:

LLMs Quantization with RLO:

GAN with RLO:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages