Reply is hiring a Graduate AI Engineer to deliver experience-led, value-focused AI solutions for innovative organizations. You will build bespoke large language models tailored to client business processes, turning proprietary knowledge into competitive advantage by deploying custom LLMs at scale with enterprise-grade performance.
What You'll Do
- Design, develop, and train large language models and AI systems.
- Fine-tune pre-trained LLMs (e.g., GPT, LLaMA, Mistral, Falcon) for specific use cases.
- Build and optimize prompting strategies, Retrieval-Augmented Generation (RAG), and agent-based systems.
- Prepare, clean, and manage large-scale datasets for model training.
- Implement model evaluation, benchmarking, and performance optimization.
- Deploy models into production using scalable and secure architectures.
- Collaborate with cross-functional teams to translate business needs into AI solutions.
- Monitor model performance, manage model drift, iterate improvements, and stay current with the latest research and advancements in AI and LLMs.
What We're Looking For
- A Bachelor’s or Master’s degree (2:1 or higher) in Computer Science, AI, Machine Learning, or a related field (or equivalent experience).
- Strong experience with Python and ML frameworks such as PyTorch or TensorFlow, and hands-on experience training, fine-tuning, or deploying LLMs.
- Solid understanding of NLP, transformers, attention mechanisms, embeddings, and experience with data preprocessing, tokenization, and dataset pipelines.
- Familiarity with REST APIs, microservices, model serving, and MLOps tools (e.g., MLflow, Kubeflow, Airflow, Weights & Biases).
- Experience with cloud platforms (AWS, GCP, Azure), distributed training, model parallelism, inference optimization, and GPU/TPU infrastructure.
- Knowledge of vector databases (e.g., FAISS, Pinecone), security, privacy, and responsible AI practices.
- Strong problem-solving, analytical, and communication skills, with a positive, team-oriented attitude and a passion for continuous learning.
- Willingness to travel within the UK and EU for client engagements as required.
Nice to Have
- Experience with RLHF, open-source contributions, building AI copilots/chatbots, client and stakeholder management, and use of Atlassian tools like Jira and Confluence.
Technical Stack
- Programming & Frameworks: Python, PyTorch, TensorFlow
- Integration & DevOps: REST APIs, MLOps tools (MLflow, Kubeflow, Airflow, Weights & Biases)
- Cloud Infrastructure: AWS, GCP, Azure
- Data & Tools: Vector databases (FAISS, Pinecone), Atlassian tools (Jira, Confluence)
Work Mode
This role follows a local-country work mode and is open to candidates based in the UK and EU.
Reply is an Equal Opportunities Employer and committed to embracing diversity in the workplace. We provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type regardless of age, sexual orientation, gender, identity, pregnancy, religion, nationality, ethnic origin, disability, medical history, skin colour, marital status, parental status, or any other characteristic protected by the Law.


