Responsibilities
- Design, build, test, and deploy reliable web scraping tools using Python frameworks such as Playwright, Selenium, Requests, and BeautifulSoup
- Create and manage scalable asynchronous scraping systems designed for high-volume data collection
- Develop and refine anti-detection techniques including proxy rotation to maintain consistent scraper operation
- Ensure uninterrupted data flow by building automated pipelines and integrating with external RESTful services
- Continuously monitor, troubleshoot, and enhance scraper efficiency, stability, and output accuracy
- Work closely with engineering teams to improve internal scraping platforms, logging, and monitoring solutions
- Support DevOps initiatives involving containerization with Docker, CI/CD pipelines, and Linux-based server management
Work Arrangement
Remote (Worldwide)