Developed RAG Pipeline and OpenAI API Enhancements

  • Day: 2025-05-11
  • Time: 14:00 to 15:10
  • Project: Dev
  • Workspace: WP 2: Operational
  • Status: In Progress
  • Priority: MEDIUM
  • Assignee: Matías Nehuen Iglesias
  • Tags: RAG, Openai, API, Development, Conference

Description

Session Goal

The session aimed to enhance the Retrieval-Augmented Generation (RAG) system and integrate new functionalities using OpenAI’s Assistants API.

Key Activities

  • Explored potential contributions to the LSFA 2025 conference, focusing on semantic pipelines and networking opportunities.
  • Developed a roadmap for the RAG mini-suite, including phases for MVP completion and productization.
  • Implemented a Gradio UI module for RAG, enabling context-based question answering with LLMs.
  • Fixed bugs in the RAG UI, including missing functions and tab configuration issues.
  • Updated the run_llm() function to comply with OpenAI API version 1.0.0.
  • Explored manual vs. native in-model RAG retrieval methods.
  • Created a starter script for an OpenAI-native Assistant-based RAG pipeline.
  • Updated OpenAI Assistants API tool types and methods for message creation with attachments.
  • Addressed file attachment limits in the Assistants v2 API and proposed a hybrid RAG approach using a vector database.

Achievements

  • Completed the implementation of the RAG UI module and fixed related bugs.
  • Successfully updated the run_llm() function and integrated it into the existing system.
  • Developed strategic options for RAG document retrieval using OpenAI Assistants.

Pending Tasks

  • Further exploration and implementation of strategic RAG options for enhanced document retrieval.
  • Continued development of the RAG mini-suite for full productization and outreach.
  • Preparation and submission of proposals for LSFA 2025.

Evidence

  • source_file=2025-05-11.sessions.jsonl, line_number=1, event_count=0, session_id=e05a36cb258201154db58f5341112ca3d3742107a0fde26e4861e427489c1448
  • event_ids: []