Developed News Database with Google Tools
- Day: 2024-05-23
- Time: 20:45 to 21:45
- Project: Media
- Workspace: WP 1: Strategic / Growth & Development
- Status: In Progress
- Priority: MEDIUM
- Assignee: Matías Nehuen Iglesias
- Tags: Google Cloud, Bigquery, News Database, Data Collection, Automation
Description
Session Goal
The primary aim of this session was to develop a comprehensive plan for creating a news database using Google tools, specifically Google News RSS and Google BigQuery.
Key Activities
- Discussed the options for replacing or removing the catalytic converter in a Peugeot 207, including legal and cost considerations.
- Explored a scientific approach to human rights issues, inspired by Néstor Kirchner, emphasizing evidence-based analysis and international solidarity.
- Outlined a detailed project plan for a news database using Google Cloud tools, focusing on data collection, storage, and analysis.
- Provided a modular project structure for news data collection and analysis, with directories for data, scripts, notebooks, configuration, and tests.
- Detailed the setup of an exploratory data analysis notebook for Google News data, including initial configuration and data storage in BigQuery.
- Offered guidance on resolving import issues with Google BigQuery libraries in Python notebooks.
- Provided a step-by-step guide for setting up a project on Google Cloud Platform, enabling necessary APIs, creating service credentials, and configuring the BigQuery client.
Achievements
- Established a clear framework and execution plan for creating a news database using Google Cloud tools.
- Developed a modular structure for organizing the project and conducting exploratory data analysis.
- Resolved technical issues related to library imports and Google Cloud configuration.
Pending Tasks
- Implement the outlined plan for the news database, focusing on automation and AI integration for data collection and analysis.
Evidence
- source_file=2024-05-23.sessions.jsonl, line_number=0, event_count=0, session_id=aa4a68bd23472f27fe0d0308d3b1773a9a0c21e22b45d2b5e96f85af65955175
- event_ids: []