πŸ“… 2024-12-28 β€” Session: Developed GitHub Repository Analysis Framework

πŸ•’ 17:20–17:45
🏷️ Labels: Github, Portfolio, Data Science, Automation, Software Development
πŸ“‚ Project: Business
⭐ Priority: MEDIUM

Session Goal

The primary objective of this session was to enhance GitHub portfolios for professionals in data science, policy analysis, and software development through the β€˜Repos Revamp and Curation’ project.

Key Activities

  • Project Overview: Reviewed the objectives and expected outcomes of the β€˜Repos Revamp and Curation’ project.
  • Parser Design: Designed parsers for extracting structured information from repository markdown files, detailing key information and output schema.
  • Financial Impact Calculation: Explored methods for calculating the impact of inflation on ABL and other taxes, including Python code for iterative calculations.
  • Database Design: Developed a reconciled table structure for integrating parsing goals and managing repositories, technical insights, and portfolio strategies.
  • API Data Management: Designed a reference table for GitHub API data, enhancing parsers table, and creating a merged view for analysis.
  • Schema Development: Created a unified repository analysis schema and a consolidated repository analysis table, integrating insights and technical details.

Achievements

  • Completed the design of parsers and schemas for repository markdown files and GitHub API data.
  • Developed a comprehensive framework for repository analysis, ensuring no redundancy and clear separation of data sources.

Pending Tasks

  • Implement the designed parsers and schemas into the existing workflow.
  • Validate the impact calculation method for inflation on ABL with real data examples.