Developed News Database with Google Tools

  • Day: 2024-05-23
  • Time: 20:45 to 21:45
  • Project: Media
  • Workspace: WP 1: Strategic / Growth & Development
  • Status: In Progress
  • Priority: MEDIUM
  • Assignee: Matías Nehuen Iglesias
  • Tags: Google Cloud, Bigquery, News Database, Data Collection, Automation

Description

Session Goal

The primary aim of this session was to develop a comprehensive plan for creating a news database using Google tools, specifically Google News RSS and Google BigQuery.

Key Activities

  • Discussed the options for replacing or removing the catalytic converter in a Peugeot 207, including legal and cost considerations.
  • Explored a scientific approach to human rights issues, inspired by Néstor Kirchner, emphasizing evidence-based analysis and international solidarity.
  • Outlined a detailed project plan for a news database using Google Cloud tools, focusing on data collection, storage, and analysis.
  • Provided a modular project structure for news data collection and analysis, with directories for data, scripts, notebooks, configuration, and tests.
  • Detailed the setup of an exploratory data analysis notebook for Google News data, including initial configuration and data storage in BigQuery.
  • Offered guidance on resolving import issues with Google BigQuery libraries in Python notebooks.
  • Provided a step-by-step guide for setting up a project on Google Cloud Platform, enabling necessary APIs, creating service credentials, and configuring the BigQuery client.

Achievements

  • Established a clear framework and execution plan for creating a news database using Google Cloud tools.
  • Developed a modular structure for organizing the project and conducting exploratory data analysis.
  • Resolved technical issues related to library imports and Google Cloud configuration.

Pending Tasks

  • Implement the outlined plan for the news database, focusing on automation and AI integration for data collection and analysis.

Evidence

  • source_file=2024-05-23.sessions.jsonl, line_number=0, event_count=0, session_id=aa4a68bd23472f27fe0d0308d3b1773a9a0c21e22b45d2b5e96f85af65955175
  • event_ids: []