Skip to content

Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.

License

Notifications You must be signed in to change notification settings

ggeop/DataDialogueLLM

Repository files navigation

Data Dialogue
Elevate Your Data Insights


Docker Python GitHub release (latest by date)

Python Release Workflow Black Code Formatter Check Pydantic v2 License: MIT

🎉 Welcome to Data Dialogue

Transform the way you interact with data using AI-powered conversations

DataDialogue is an innovative application that bridges the gap between your data and natural language understanding. By leveraging advanced AI capabilities, it enables intuitive data exploration and analysis through conversational interfaces.

A user can select a LLM model and connect it with a data source and do his analysis by writing simple prompts instead of writing SQL queries or code.

We're currently in an exciting preview phase! Here's what that means for you:

  • 💡 Early Access: Get a first look at cutting-edge AI-data interaction
  • 🔄 Regular Updates: Frequent improvements and new features
  • 👥 Community Driven: Your feedback shapes the future of DataDialogue
  • 🐞 Bug Reports Welcome: Help us polish the experience

Please don't hesitate to open an issue or submit a pull request.

✨ Key Features

  • Natural language querying of databases
  • Integration with multiple language models
    • Integration with HuggingFace models
    • Integration with Commercial models (currently with Google LLMs)
  • Flexible architecture for extending to different data sources
  • RESTful API for easy integration
  • Secure query execution with protection against harmful SQL commands
  • Create validated Queries (+ Copy option)
  • Results Visualization
  • Supported Models: Choose between HuggingFace & public providers
  • Data Sources: Suppoorts connection with multiple data sources

Supported Providers

ProviderStatus
Google✔️
HF✔️
OpenAI✔️
Anthropic✔️

Supported Data Sources

SourceStatus
PostgreSQL✔️
MySQL✔️
Databricks✔️
Mongo DB✔️
CSV✔️

🎯 Getting Started

UI Screenshot UI Screenshot

Prerequisites

  • Docker
  • Docker Compose

Run

  1. Clone the repository:

    git clone https://github.com/ggeop/data-dialogue.git
    cd data-dialogue
  2. Run Data Dialogue services:

    docker-compose --env-file .env.prod up
    
  3. Access the frontend at http://localhost:5000 in your web browser.

  4. Try the Demo: Click the video above to see it in action

  5. Choose Your Model:

    • Local: HuggingFace models (requires sufficient computing power)
    • Cloud: Commercial LLMs (recommended for lighter setups)

Pro Tip: Get your FREE Google Gemini API key for the best cloud-based experience!

📑 Development

For local development and setup instructions, please refer to our Local Development Setup Guide.

The guide includes:

  • Local environment setup for Windows and Linux
  • Code style configuration with Black
  • Development workflows
  • Common issues and troubleshooting
  • Contributing guidelines

🤝 Contributing

Your feedback and contributions make DataDialogue better! If you have:

  • 💭 Feature suggestions
  • 🐞 Bug reports
  • 💡 General feedback

Please read our Contributing Guidelines before submitting any pull requests.

♥️ Support and Community

✋ Need Help?

Feel free to:

  • Open an issue on GitHub
  • Ask questions in pull requests
  • Contact project maintainers

📎 License

Distributed under the MIT License. See LICENSE for more information.