Developed RAG Pipeline and OpenAI API Enhancements
- Day: 2025-05-11
- Time: 14:00 to 15:10
- Project: Dev
- Workspace: WP 2: Operational
- Status: In Progress
- Priority: MEDIUM
- Assignee: Matías Nehuen Iglesias
- Tags: RAG, Openai, API, Development, Conference
Description
Session Goal
The session aimed to enhance the Retrieval-Augmented Generation (RAG) system and integrate new functionalities using OpenAI’s Assistants API.
Key Activities
- Explored potential contributions to the LSFA 2025 conference, focusing on semantic pipelines and networking opportunities.
- Developed a roadmap for the RAG mini-suite, including phases for MVP completion and productization.
- Implemented a Gradio UI module for RAG, enabling context-based question answering with LLMs.
- Fixed bugs in the RAG UI, including missing functions and tab configuration issues.
- Updated the
run_llm()function to comply with OpenAI API version 1.0.0. - Explored manual vs. native in-model RAG retrieval methods.
- Created a starter script for an OpenAI-native Assistant-based RAG pipeline.
- Updated OpenAI Assistants API tool types and methods for message creation with attachments.
- Addressed file attachment limits in the Assistants v2 API and proposed a hybrid RAG approach using a vector database.
Achievements
- Completed the implementation of the RAG UI module and fixed related bugs.
- Successfully updated the
run_llm()function and integrated it into the existing system. - Developed strategic options for RAG document retrieval using OpenAI Assistants.
Pending Tasks
- Further exploration and implementation of strategic RAG options for enhanced document retrieval.
- Continued development of the RAG mini-suite for full productization and outreach.
- Preparation and submission of proposals for LSFA 2025.
Evidence
- source_file=2025-05-11.sessions.jsonl, line_number=1, event_count=0, session_id=e05a36cb258201154db58f5341112ca3d3742107a0fde26e4861e427489c1448
- event_ids: []