What You'll Do
Design and maintain scalable web scraping solutions that supply accurate, timely data for business-critical reporting. Refactor legacy scripts to improve efficiency, readability, and long-term sustainability. Implement resilient architectures that handle dynamic web content, complex DOM hierarchies, and session management with precision.
Employ advanced techniques such as header manipulation, user-agent cycling, and proxy rotation—including residential and data center networks—to maintain access and avoid detection. Develop monitoring systems that detect failures early and trigger alerts, ensuring high uptime and reliability.
Collaborate directly with data analysts and internal teams to define data needs, validate outputs, and refine collection strategies. Share best practices and documentation to support proper use of scraped data across workflows. Continuously assess performance, identify bottlenecks, and propose improvements in tooling, frameworks, or methodologies.
Requirements
- At least three years of hands-on experience with tools like Selenium, Playwright, or Puppeteer for automated web data extraction
- Solid grasp of HTTP protocols, RESTful APIs, HTML parsing, and browser rendering behavior, including TLS/SSL interactions
- Proven ability to implement fingerprint spoofing, request signature tuning, and other anti-detection strategies
- Experience managing cookies, headers, session lifecycles, and rotating proxy infrastructures
- Strong troubleshooting skills to diagnose and resolve performance, scalability, and reliability issues
- Proficiency in logging, metrics collection, and alerting systems to maintain system health
- Clear communication skills in English, both with technical peers and non-technical collaborators
Benefits
- Competitive salary and comprehensive benefits package
- Vacation time and parental leave
- Reimbursement for learning and professional development
- Regular team events and collaborative opportunities
- Perks designed to support well-being and productivity
- Clear growth path based on impact, not tenure or internal politics
- Supportive environment that values mastery, ownership, and continuous improvement
Work Environment
This is a fully remote position for candidates based in India. Standard working hours are from 11:00 AM to 8:00 PM IST, with some flexibility allowed. You’ll operate within a culture rooted in transparency, respect, trust, and personal accountability. The company emphasizes learning, skill development, and meaningful contributions over hierarchy.
Equal Opportunity Employer
We uphold equal employment opportunity for all individuals, regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender identity, veteran status, or any other protected characteristic. Our workplace is inclusive, respectful, and committed to fairness.