π 2024-07-12 β Session: Automated Google Search Script Enhancement
π 19:50β20:05
π·οΈ Labels: Python, Web Scraping, Automation, Google Search, CSV
π Project: Dev
β Priority: MEDIUM
Session Goal
The goal of this session was to enhance and automate the process of retrieving profile links from a CSV file using Google searches.
Key Activities
- Developed a Python script to automate Google searches for profile links using libraries like
requests
,BeautifulSoup
, andgooglesearch-python
. - Corrected the keyword argument in the
google_search
function fromnum
tonum_results
to match the libraryβs function definition. - Updated the
google_search
function by removing thepause
argument and usingsleep_interval
for request delays. - Managed Google search usage limits by considering IP bans and legal issues, and explored solutions like using official APIs, rate limiting, and proxies.
- Created a script to structure extracted profile links into a DataFrame with separate columns.
- Set up a rotating proxy pool for ethical web scraping.
Achievements
- Successfully automated the Google search process for profile links.
- Ensured compliance with search limits and ethical standards.
Pending Tasks
- Further testing and validation of the proxy pool setup.
- Continuous monitoring of script performance and compliance with Googleβs terms of service.