Qualifications
● Bachelor’s degree in Computer Science, Software Engineering, Data Science, or a related
f
ield (or equivalent work experience).
● Proven experience in web scraping and data extraction techniques.
● Strong proficiency in Python and web scraping libraries (e.g., Scrapy, BeautifulSoup,
Selenium, Playwright).
● Hands-on experience with JSON Path and XPath for structured data extraction.
● Knowledge of task/job scheduling tools (e.g., Celery, Apache Airflow, Cron, Redis Queue).
● Experience handling large-scale data scraping, proxies, and anti-bot mechanisms.
● Understanding of web technologies such as HTML, CSS, JavaScript, and HTTP protocols.
● Strong problem-solving and debugging skills.
● Ability to work independently and in a collaborative team environment.
● Excellent communication skills, with the ability to explain technical concepts clearly.
Nice to Have
● Experience with cloud platforms (AWS, Google Cloud, Azure) for scalable scraping
solutions.
● Knowledge of database management (SQL, NoSQL) and data processing pipelines.
● Familiarity with containerization and orchestration tools (Docker, Kubernetes).
● Experience with CI/CD pipelines for automated deployment of web scraping scripts.
● Contributions to open-source web scraping projects.
● Understanding of AI and machine learning applications in web data extraction.