Nagarro is hiring a Senior Staff Engineer, Generative AI

About the Role

Nagarro is looking for a Senior Staff Engineer, Generative AI to architect and implement scalable, enterprise-grade GenAI and Agentic AI solutions. You will be responsible for converting business needs into robust technical designs and ensuring the performance and reliability of our AI systems within a dynamic, non-hierarchical work culture.

What You'll Do

  • Understand client business use cases and technical requirements and convert them into a technical design.
  • Map decisions with requirements and translate them to developers.
  • Identify different solutions and narrow down the best option that meets client requirements.
  • Define guidelines and benchmarks for non-functional requirement considerations during project implementation.
  • Write and review design documents explaining overall architecture, framework, and high-level design.
  • Review architecture and design on aspects like extensibility, scalability, security, and design patterns.
  • Develop and design the overall solution for defined functional and non-functional requirements.
  • Understand and relate technology integration scenarios and apply these learnings in projects.
  • Resolve issues raised during code review through systematic root cause analysis.
  • Carry out proof-of-concepts to ensure suggested designs or technologies meet requirements.

What We're Looking For

  • Total experience of 8+ years.
  • Deep understanding of LLMs (e.g., GPTs, Llama, Claude, Gemini, Qwen, Mistral, BERT-family models) and their architectures (Transformers).
  • Expert-level prompt engineering skills and proven experience implementing RAG patterns.
  • High proficiency in Python and standard AI/ML libraries (e.g., LangChain, LlamaIndex, LangGraph, LangSmith, Hugging Face Transformers, Scikit-learn, PyTorch/TensorFlow).
  • Strong experience with fine-tuning and distillation techniques and evaluation.
  • Strong experience using managed AI/ML services on the target cloud platform (e.g., Azure Machine Learning Studio, AI Foundry).
  • Strong understanding of vector databases (e.g., Weaviate, Neo4j).
  • Understanding of GenAI evaluation metrics (e.g., BLEU, ROUGE, perplexity, semantic similarity, human evaluation).
  • Ability to write high-quality, production-ready Python code with strong testing and maintainability practices.
  • Ability to productionize AI systems on Azure or AWS, ensuring enterprise-grade reliability and performance.
  • Ability to build and expose APIs using FastAPI, integrating with databases through an ORM.
  • Ability to scale GenAI solutions to support enterprise workloads.
  • Strong ability to both architect and code GenAI/Agentic AI solutions.
  • Proven production experience with GenAI deployments on Azure or AWS.
  • Ability to build & deploy AI pipelines using SageMaker, Vertex AI, or Azure ML.
  • Hands-on experience with Docker, Kubernetes, and CI/CD pipelines (GitHub Actions, Argo) for scalable AI infrastructure.
  • Hands-on experience with serverless AI APIs, containerized model serving, and GPU orchestration.
  • Experience with Infrastructure as Code (Terraform / Bicep) and cloud monitoring tools.
  • Experience with data pipelines via Airflow, Kafka, or Databricks.
  • Strong experience in scaling AI solutions in live environments.
  • Very strong Python programming skills with a track record of clean, efficient, and maintainable code.
  • Must have successfully delivered at least one production GenAI/Agentic AI solution.
  • Must have proficiency with FastAPI and at least one ORM (e.g., SQLAlchemy, Tortoise ORM).
  • Must have experience with Model Context Protocol (MCP).
  • Must have contributions to open-source GenAI projects.
  • Excellent communication skills and the ability to collaborate effectively with cross-functional teams.
  • Bachelor’s or master’s degree in computer science, Information Technology, or a related field.

Nice to Have

  • Experience with React (or some other JavaScript frameworks) for building user-facing interfaces and front-end integrations.

Technical Stack

  • Languages & Core Frameworks: Python, LangChain, LlamaIndex, LangGraph, LangSmith, Hugging Face Transformers, Scikit-learn, PyTorch/TensorFlow, FastAPI, SQLAlchemy, Tortoise ORM
  • Cloud & AI Services: Azure Machine Learning Studio, AI Foundry, Azure, AWS, SageMaker, Vertex AI, Azure ML
  • Infrastructure & Orchestration: Docker, Kubernetes, GitHub Actions, Argo, Terraform, Bicep, Airflow, Kafka, Databricks
  • Databases: Weaviate, Neo4j

Nagarro is an equal opportunity employer.

Required Skills
PythonLangChainLlamaIndexLangGraphLangSmithHugging Face TransformersPyTorchTensorFlowFastAPISQLAlchemyScikit-learnGenerative AILLMsMachine LearningAPI Development
Invoicing holding you back?

Focus on work, not paperwork

Stop worrying about invoicing, taxes, and compliance. Glopay handles the business setup, you handle the client work. Get paid faster and look professional.

Auto-generated compliant invoices
Built-in expense management
Income reports for tax season
95% of earnings stay with you
Try Glopay free
No credit card needed
About company
Nagarro

A Digital Product Engineering company that builds products, services, and experiences that inspire, excite, and delight. They work at scale across all devices and digital mediums, with over 17000 experts across 39 countries.

Visit website
Job Details
Category data
Posted a month ago