Responsibilities
- Design and build robust backend systems for data processing using Apache Spark
- Write and refine SQL queries to extract, transform, and analyze large datasets
- Create and manage scalable data workflows and ETL pipelines
- Work closely with team members to understand project needs and deliver reliable solutions
- Engage in peer code reviews and adhere to established software development standards
- Utilize Git for tracking changes and enabling team collaboration
- Identify, diagnose, and fix technical problems in data systems
- Actively suggest and implement improvements to processes, infrastructure, and teamwork