March 16, 2025 7:00 PM PDT

The session titled "From Fantasy to Fact: The Secret Weapon that Crushes AI Hallucinations" provided insights into the challenges of AI hallucinations and introduced retrieval augmented generation (RAG) as a solution. The workshop aimed to enhance understanding of AI's capabilities and limitations, particularly in enterprise applications, emphasizing the importance of accurate and reliable AI responses.

Presenter: Coach Cindy, Director of Data Science, Machine Learning and Frontend Engineering

Meeting Notes

Introduction

AI support agents can enhance customer communication but may generate incorrect responses, known as hallucinations.
The presenter introduced retrieval augmented generation (RAG) as a technology to mitigate these issues by allowing AI agents to access up-to-date information.

AI Hallucinations

Hallucinations occur when AI generates responses that are incorrect or non-existent.
Examples included a fictional scenario where a user inquired about remote work policies in a fictional land, highlighting the AI's inability to recognize the fictional context.
Hallucinations can arise from:
- Domain Shift: AI struggles to apply real-world knowledge to fictional or non-standard scenarios.
- Task Shift: Misunderstanding the user's intent can lead to irrelevant or inaccurate responses.
- Data Restrictions: Limited access to relevant data can result in vague or inaccurate answers.

RAG Architecture

RAG combines a query, context, and response generation to ensure that AI answers are grounded in relevant information.
The architecture includes:
1. Document Ingestion: Splitting documents into chunks for effective processing.
2. Embedding Creation: Converting text into numerical representations (embeddings) for machine learning models.
3. Vector Database: Storing embeddings for efficient retrieval based on semantic similarity.
4. Search and Reranking: Retrieving relevant chunks and ranking them to provide the most accurate response.

Chunking and Embeddings

Documents are split into manageable chunks to reduce noise and improve retrieval accuracy.
The size and overlap of chunks are critical for maintaining context and relevance in responses.
Embeddings transform text into numerical vectors that can be processed by machine learning models.

Search Mechanism

The search process involves querying the vector database to find the nearest relevant chunks.
Reranking ensures that the most relevant results are prioritized in the final response.

Challenges and Considerations

Mismatch and coverage issues can arise if the AI does not retrieve the correct or complete information.
Bias and fairness in AI responses are crucial, as dominant data points can skew results.
Continuous tuning and evaluation of the RAG system are necessary to improve accuracy and reliability.

Future Learning Opportunities

The presenter announced upcoming classes focused on advanced topics related to RAG, including query expansion and hybrid search mechanisms.
Participants were encouraged to engage in hands-on projects to solidify their understanding of the concepts discussed.

Conclusion

The session concluded with a call to action for participants to join future training sessions and engage with the community for further learning and development in AI and machine learning.

From Fantasy to Fact: The Secret Weapon that Crushes AI Hallucinations