πŸ“… 2025-06-10 β€” Session: Developed Editorial Workflow and AzureML Pipelines

πŸ•’ 21:20–23:15
🏷️ Labels: Editorial Workflow, Azureml, Jinja, Openai Api, Data Processing
πŸ“‚ Project: Dev
⭐ Priority: MEDIUM

Session Goal: The session aimed to develop structured workflows for editorial processes and enhance AzureML pipelines for data processing and automation.

Key Activities:

  • Designed a structured editorial workflow pipeline to process articles from raw input to publication, detailing each stage’s inputs, processes, and outputs.
  • Defined systematic prompt templates for article processing, including CSV parsing, agenda generation, and annotation.
  • Explored function calling in the OpenAI API, focusing on defining functions for structured data handling.
  • Implemented fuzzy row selection in AzureML using OpenAI’s function calling, modifying YAML-based DAGs.
  • Addressed limitations in function calling with LLMs, improving prompt engineering techniques.
  • Developed robust function call schemas for article parsing and clustering, and agenda generation.
  • Created Jinja prompts for parsing, clustering, and generating seed ideas and articles, ensuring structured output and data fidelity.
  • Designed minimal starter pipelines for LLM screening and streamlined AzureML PromptFlow pipelines.
  • Provided guidance on saving pandas DataFrames as JSONL and adjusted column mappings in AzureML pipelines.

Achievements:

  • Successfully outlined editorial and AzureML workflows, enhancing automation and data processing capabilities.

Pending Tasks:

  • Further testing and integration of the designed pipelines and prompts to ensure seamless operation and data integrity.