[website] DSpy Blogpost (#463)

hal9ai · Jan 6, 2025 · 769694c · 769694c
1 parent 8bed595
commit 769694c
Show file tree

Hide file tree

Showing 4 changed files with 146 additions and 1 deletion.
diff --git a/website/blog/2025-06-01-Hal9_DSPy/DSPy.md b/website/blog/2025-06-01-Hal9_DSPy/DSPy.md
@@ -0,0 +1,140 @@
+---
+slug: Hal9-DSpy
+title: "Building a PDF Chat Agent with DSPy and Hal9" 
+authors: [luis]
+tags: [DSPy, RAG, Chain of Thought]
+---
+
+import ThemedImage from '../../src/components/themedimg.jsx'
+
+### **Harnessing DSPy for Simplified AI Programming**
+
+In the world of Large Language Models (LLMs), DSPy emerges as a game-changer for programming, focusing on abstraction rather than low-level prompt engineering. Unlike traditional approaches that require manually crafting complex prompts, DSPy lets you describe tasks naturally using **signatures**—declarative specifications of inputs and outputs. This allows for easier program construction without getting bogged down in prompt formatting or manual adjustments. 
+
+With DSPy, you can build advanced AI-powered systems like a PDF chat agent with much less effort, shifting the focus from managing prompts to constructing modular, efficient AI programs. This post will demonstrate how to leverage DSPy’s powerful modules to create a document-interacting chatbot, while using Hal9’s robust infrastructure to deploy and interact with OpenAI's LLMs effortlessly.
+
+---
+
+### **DSPy Fundamentals**
+
+#### **Signatures: Simplifying Tasks**
+
+At the heart of DSPy are **signatures**, which specify what an LLM should do in plain language. Instead of manually crafting detailed prompts, you declare the task’s input and output types. For example:
+
+```python
+'sentence -> sentiment: bool'
+'document -> summary
+```
+
+This high-level abstraction allows you to focus on the desired behavior rather than the exact phrasing of the prompt. The power of signatures lies in their simplicity and adaptability, which make them an excellent tool for developers who want to interact with LLMs without needing to know the intricacies of prompt engineering. With DSPy, you can specify what you want—whether it’s summarizing a document or extracting sentiment—and let the framework handle the underlying complexity.
+
+#### **Modules: The Building Blocks**
+
+Once you've defined your task with a signature, DSPy uses **modules** to handle different prompting techniques. These modules abstract away the details of the interaction, offering an easy way to apply complex prompting strategies.
+
+Modules can incorporate advanced techniques like **Chain of Thought** (CoT) or **ReAct**, allowing you to tailor the prompt's behavior for specific tasks. For example, if you need reasoning steps for a complex task, you might use the **Chain of Thought** module, which helps the LLM break down the problem into smaller, manageable pieces.
+
+DSPy also offers flexibility by enabling you to create custom modules to suit more specialized use cases. These modules work as building blocks that can be connected together to form more complex pipelines.
+
+#### **Chain of Thought: Structured Reasoning**
+
+One key module, **Chain of Thought**, facilitates structured reasoning, which can enhance the performance of the model for complex tasks. This method allows the LLM to think step-by-step before generating an answer, reducing errors that might arise from direct responses. For example, when asked a complex question, the LLM may break the process into logical steps, providing a more comprehensive response.
+
+By using Chain of Thought, you can enable the model to deliver responses with more nuance and accuracy, perfect for applications requiring deeper understanding or reasoning.
+
+#### **Plugging in Any LLM: Focused on Programming, Not Prompting**
+
+One of DSPy’s key strengths is the ability to integrate any LLM of your choice with ease. In this case, we’ve seamlessly integrated DSPy with Hal9's proxy infrastructure.
+
+```python
+lm = dspy.LM('openai/gpt-4-turbo', api_key='hal9', api_base='https://api.hal9.com/proxy/server=https://api.openai.com/v1/')
+dspy.configure(lm=lm)
+```
+
+#### Retrieval-Augmented Generation (RAG)
+
+RAG combines LLM reasoning with retrieval mechanisms. Instead of relying solely on the model's internal knowledge, RAG searches an external corpus (e.g., document chunks) to retrieve relevant context. This retrieved information is then fed into the LLM for accurate and context-aware answers. DSPy simplifies the creation of RAG pipelines, as you'll see in the implementation.
+
+### **Building a Document-Interacting Chatbot**
+
+Now that we understand the principles of DSPy, Chain of Thought and RAG , let’s dive into building a simple chatbot that interacts with PDF documents. We’ll break this process down into manageable sections.
+
+#### **Step 1: Text Processing**
+
+Before we can interact with the document, we need to process and split its content into chunks. Here, we’ll extract text from a PDF document, and split it into smaller, overlapping chunks to allow efficient searching.
+
+```python
+def extract_text_from_pdf(pdf_url):
+    response = requests.get(pdf_url)
+    pdf_document = fitz.open(stream=BytesIO(response.content))
+    text = "".join([pdf_document[page_num].get_text() for page_num in range(len(pdf_document))])
+    return text
+
+def split_text(text, n_words=300, overlap=0):
+    words = text.split()
+    chunks = []
+    start = 0
+    while start < len(words):
+        chunk = words[start:start + n_words]
+        chunks.append(" ".join(chunk))
+        start += n_words - overlap
+    return chunks
+```
+
+#### **Step 2: Embedding Generation**
+
+Once we’ve split the document into chunks, the next step is generating embeddings for each chunk of text. This allows the model to efficiently retrieve relevant passages when answering questions about the document.
+
+```python
+def generate_embeddings(texts):
+    embeddings = []
+    for text in texts:
+        response = openai_client.embeddings.create(input=text, model="text-embedding-3-small")
+        embeddings.append(response.data[0].embedding)
+    return np.array(embeddings)
+```
+
+#### **Step 3: Setting Up the RAG Module**
+
+Now, we can define the **RAG (Retrieval-Augmented Generation)** module, which will handle the process of searching through document chunks and generating responses. In this example, we'll use a simple Chain of Thought model to handle question-answering based on document context.
+
+```python
+class RAG(dspy.Module):
+    def __init__(self):
+        self.respond = dspy.ChainOfThought('context, question -> response')
+
+    def forward(self, question, search):
+        context = search(question).passages
+        return self.respond(context=context, question=question)
+```
+
+#### **Step 4: Generate a Response**
+
+Finally, we bring everything together into a working example. The user provides a prompt (e.g., a question), and the chatbot uses the RAG system to retrieve relevant document chunks and generate a coherent response:
+
+```python
+embedder = dspy.Embedder(generate_embeddings)
+search = dspy.retrievers.Embeddings(embedder=embedder, corpus=chunks, k=5)
+rag = RAG()
+response = rag(question=prompt, search=search)
+```
+
+### **Check the Results**
+
+Curious about how it performs? Try the PDF-interacting chatbot yourself! Upload a PDF or provide a link, and see how it answers questions based on the document's content. [Click here to test the chatbot](https://hal9.com/luis/dspy).
+
+<center><a href="https://hal9.com/luis/dspy"><ThemedImage src="dspy"/></a></center>
+
+---
+
+### **Conclusion**
+
+DSPy revolutionizes the way we program large language models (LLMs) by shifting the focus from manual prompt engineering to intuitive program design. With its powerful abstractions like signatures and modules, DSPy simplifies the creation of complex systems, such as a chatbot that interacts with documents using Retrieval-Augmented Generation (RAG).
+
+This integration between DSPy and Hal9 showcases how you can develop scalable, modular applications with ease. By following the steps in this blog, you’ve seen how DSPy enables a seamless approach to AI programming, helping developers stay focused on building logic rather than managing prompts.
+
+### **References**
+
+- [DSPy GitHub Repository](https://github.com/stanfordnlp/dspy)  
+- [Pipelines and Prompt Optimization with DSPy](https://www.dbreunig.com/2024/12/12/pipelines-prompt-optimization-with-dspy.html)  
+- [Exploratory Tour of DSPy](https://medium.com/the-modern-scientist/an-exploratory-tour-of-dspy-a-framework-for-programing-language-models-not-prompting-711bc4a56376)
diff --git a/website/blog/authors.yml b/website/blog/authors.yml
@@ -11,4 +11,9 @@ diego:
 karan:
   name: Karan Saxena
   title: Data Analyst
-  url: https://github.com/Karan0310
+  url: https://github.com/Karan0310
+
+luis:
+  name: Luis Guillen
+  title: Generative AI Engineer
+  url: https://github.com/LuisGuillen03
diff --git a/website/static/screenshots/dspy-dark.png b/website/static/screenshots/dspy-dark.png
diff --git a/website/static/screenshots/dspy-light.png b/website/static/screenshots/dspy-light.png