Developed LinkedIn Web Scraping Strategy

  • Day: 2024-01-19
  • Time: 15:45 to 16:00
  • Project: Dev
  • Workspace: WP 2: Operational
  • Status: In Progress
  • Priority: MEDIUM
  • Assignee: Matías Nehuen Iglesias
  • Tags: Web Scraping, Linkedin, Python, Beautifulsoup, Requests

Description

Session Goal

The session aimed to develop a comprehensive strategy for scraping job postings from LinkedIn using Python, while ensuring compliance with LinkedIn’s terms of service.

Key Activities

  • Web Scraping LinkedIn Job Postings: Explored a structured approach using Python libraries like BeautifulSoup and requests, handling authentication, and making HTTP requests.
  • Analysis of LinkedIn Redirection Script: Reflected on a JavaScript snippet used for redirection, understanding its tracking and redirection mechanisms and implications for scraping.

Achievements

  • Developed a clear strategy for LinkedIn job scraping, including necessary technical considerations and compliance aspects.
  • Gained insights into LinkedIn’s redirection scripts and their impact on web scraping efforts.

Pending Tasks

  • Further exploration of LinkedIn’s terms of service to ensure full compliance in web scraping activities.

Evidence

  • source_file=2024-01-19.sessions.jsonl, line_number=0, event_count=0, session_id=6459a8286f80a43a6a3b4a6071f43e11889416985931e9b5e84c44e3422e23e9
  • event_ids: []