Nagarro is looking for a Senior Staff Engineer, Generative AI to architect and implement scalable, enterprise-grade GenAI and Agentic AI solutions. You will be responsible for converting business needs into robust technical designs and ensuring the performance and reliability of our AI systems within a dynamic, non-hierarchical work culture.
What You'll Do
- Understand client business use cases and technical requirements and convert them into a technical design.
- Map decisions with requirements and translate them to developers.
- Identify different solutions and narrow down the best option that meets client requirements.
- Define guidelines and benchmarks for non-functional requirement considerations during project implementation.
- Write and review design documents explaining overall architecture, framework, and high-level design.
- Review architecture and design on aspects like extensibility, scalability, security, and design patterns.
- Develop and design the overall solution for defined functional and non-functional requirements.
- Understand and relate technology integration scenarios and apply these learnings in projects.
- Resolve issues raised during code review through systematic root cause analysis.
- Carry out proof-of-concepts to ensure suggested designs or technologies meet requirements.
What We're Looking For
- Total experience of 8+ years.
- Deep understanding of LLMs (e.g., GPTs, Llama, Claude, Gemini, Qwen, Mistral, BERT-family models) and their architectures (Transformers).
- Expert-level prompt engineering skills and proven experience implementing RAG patterns.
- High proficiency in Python and standard AI/ML libraries (e.g., LangChain, LlamaIndex, LangGraph, LangSmith, Hugging Face Transformers, Scikit-learn, PyTorch/TensorFlow).
- Strong experience with fine-tuning and distillation techniques and evaluation.
- Strong experience using managed AI/ML services on the target cloud platform (e.g., Azure Machine Learning Studio, AI Foundry).
- Strong understanding of vector databases (e.g., Weaviate, Neo4j).
- Understanding of GenAI evaluation metrics (e.g., BLEU, ROUGE, perplexity, semantic similarity, human evaluation).
- Ability to write high-quality, production-ready Python code with strong testing and maintainability practices.
- Ability to productionize AI systems on Azure or AWS, ensuring enterprise-grade reliability and performance.
- Ability to build and expose APIs using FastAPI, integrating with databases through an ORM.
- Ability to scale GenAI solutions to support enterprise workloads.
- Strong ability to both architect and code GenAI/Agentic AI solutions.
- Proven production experience with GenAI deployments on Azure or AWS.
- Ability to build & deploy AI pipelines using SageMaker, Vertex AI, or Azure ML.
- Hands-on experience with Docker, Kubernetes, and CI/CD pipelines (GitHub Actions, Argo) for scalable AI infrastructure.
- Hands-on experience with serverless AI APIs, containerized model serving, and GPU orchestration.
- Experience with Infrastructure as Code (Terraform / Bicep) and cloud monitoring tools.
- Experience with data pipelines via Airflow, Kafka, or Databricks.
- Strong experience in scaling AI solutions in live environments.
- Very strong Python programming skills with a track record of clean, efficient, and maintainable code.
- Must have successfully delivered at least one production GenAI/Agentic AI solution.
- Must have proficiency with FastAPI and at least one ORM (e.g., SQLAlchemy, Tortoise ORM).
- Must have experience with Model Context Protocol (MCP).
- Must have contributions to open-source GenAI projects.
- Excellent communication skills and the ability to collaborate effectively with cross-functional teams.
- Bachelor’s or master’s degree in computer science, Information Technology, or a related field.
Nice to Have
- Experience with React (or some other JavaScript frameworks) for building user-facing interfaces and front-end integrations.
Technical Stack
- Languages & Core Frameworks: Python, LangChain, LlamaIndex, LangGraph, LangSmith, Hugging Face Transformers, Scikit-learn, PyTorch/TensorFlow, FastAPI, SQLAlchemy, Tortoise ORM
- Cloud & AI Services: Azure Machine Learning Studio, AI Foundry, Azure, AWS, SageMaker, Vertex AI, Azure ML
- Infrastructure & Orchestration: Docker, Kubernetes, GitHub Actions, Argo, Terraform, Bicep, Airflow, Kafka, Databricks
- Databases: Weaviate, Neo4j
Nagarro is an equal opportunity employer.





