Responsibilities
- Develop backend components and interfaces that enable AI model inference operations
- Support deployment, evaluation, and performance testing of models on specialized AI hardware
- Evaluate inference efficiency and contribute to identifying areas for improvement
- Produce clear, sustainable code with mentorship from experienced engineers
- Work with the engineering team to enhance the stability, user experience, and speed of the inference server infrastructure
Work Arrangement
Hybrid — Belgrade, Serbia