This project implements the methodology and experiments discussed in the paper. It explores how input reranking improves the performance of Large Language Models (LLMs) on symmetric tasks. The repository is organized into several subdirectories and files for clarity and modularity.
Contains plots and reports generated for the paper. These visuals illustrate the experimental results, trends, and key insights.
Contains CSV files with the results of experiment runs. These include raw and processed data used for analysis.
Task-specific code and notebooks are organized here. Each subdirectory corresponds to a different task:
Handles the Query Database Task:
db_queries:1.ipynb
: Executes the first part of the algorithm, where relevance estimations are obtained from helper models and inputs are ranked accordingly.db_queries:2.ipynb
: Executes the second part, where reranked inputs are queried using the LLM.
Handles the Graph Degree Task:
- Contains two notebooks similar to
./tasks/db
, focusing on reranking and querying for graph-related tasks.
Contains utility implementations:
- Bipartite algorithm for reranking.
- Functions for making API calls to interact with LLMs.
A standalone notebook that measures the LLM's output error for varying graph sizes in the Graph Degree Task. Provides baseline insights for the experiments.