π 2025-06-10 β Session: Developed Editorial Workflow and AzureML Pipelines
π 21:20β23:15
π·οΈ Labels: Editorial Workflow, Azureml, Jinja, Openai Api, Data Processing
π Project: Dev
β Priority: MEDIUM
Session Goal: The session aimed to develop structured workflows for editorial processes and enhance AzureML pipelines for data processing and automation.
Key Activities:
- Designed a structured editorial workflow pipeline to process articles from raw input to publication, detailing each stageβs inputs, processes, and outputs.
- Defined systematic prompt templates for article processing, including CSV parsing, agenda generation, and annotation.
- Explored function calling in the OpenAI API, focusing on defining functions for structured data handling.
- Implemented fuzzy row selection in AzureML using OpenAIβs function calling, modifying YAML-based DAGs.
- Addressed limitations in function calling with LLMs, improving prompt engineering techniques.
- Developed robust function call schemas for article parsing and clustering, and agenda generation.
- Created Jinja prompts for parsing, clustering, and generating seed ideas and articles, ensuring structured output and data fidelity.
- Designed minimal starter pipelines for LLM screening and streamlined AzureML PromptFlow pipelines.
- Provided guidance on saving pandas DataFrames as JSONL and adjusted column mappings in AzureML pipelines.
Achievements:
- Successfully outlined editorial and AzureML workflows, enhancing automation and data processing capabilities.
Pending Tasks:
- Further testing and integration of the designed pipelines and prompts to ensure seamless operation and data integrity.