Developed LinkedIn Web Scraping Strategy
- Day: 2024-01-19
- Time: 15:45 to 16:00
- Project: Dev
- Workspace: WP 2: Operational
- Status: In Progress
- Priority: MEDIUM
- Assignee: Matías Nehuen Iglesias
- Tags: Web Scraping, Linkedin, Python, Beautifulsoup, Requests
Description
Session Goal
The session aimed to develop a comprehensive strategy for scraping job postings from LinkedIn using Python, while ensuring compliance with LinkedIn’s terms of service.
Key Activities
- Web Scraping LinkedIn Job Postings: Explored a structured approach using Python libraries like BeautifulSoup and requests, handling authentication, and making HTTP requests.
- Analysis of LinkedIn Redirection Script: Reflected on a JavaScript snippet used for redirection, understanding its tracking and redirection mechanisms and implications for scraping.
Achievements
- Developed a clear strategy for LinkedIn job scraping, including necessary technical considerations and compliance aspects.
- Gained insights into LinkedIn’s redirection scripts and their impact on web scraping efforts.
Pending Tasks
- Further exploration of LinkedIn’s terms of service to ensure full compliance in web scraping activities.
Evidence
- source_file=2024-01-19.sessions.jsonl, line_number=0, event_count=0, session_id=6459a8286f80a43a6a3b4a6071f43e11889416985931e9b5e84c44e3422e23e9
- event_ids: []