Skip to main content

2 posts tagged with "RAG"

View All Tags

Inverted HyDE: Solving Real-World Dense Retrieval Challenges

· 9 min read
Hieu Nguyen
AI Engineer

Dense retrieval systems have revolutionized how we search through large document collections, but the gap between theoretical breakthroughs and production reality often reveals unexpected challenges. While HyDE (Hypothetical Document Embeddings) showed impressive results in research settings, its real-world deployment faces critical bottlenecks that limit its practical adoption. Enter Inverted HyDE - a clever twist that maintains the core benefits while addressing the fundamental production constraints.

RAG In Production - Best Practices Notes

· 10 min read
Hieu Nguyen
AI Engineer

Retrieval-Augmented Generation (RAG) method are transforming the landscape of natural language processing by combining the strengths of retrieval-based and generative models (LLMs). When deployed in production, RAG systems can provide more accurate and contextually relevant responses. This guide outlines best practices for implementing RAG models in a production environment, ensuring robustness, scalability, and efficiency.