Hybrid Full-time

fastino.ai is hiring a ML Engineer - Small Language Models

About the Role

Fastino.ai is looking for an ML Engineer to join its mission to develop specialized, efficient AI. You will own the full lifecycle of state-of-the-art small language models, from prototyping and data analysis to deployment and continuous improvement.

What You'll Do

  • Design, build, and deploy the critical small language models foundational to Fastino’s product.
  • Own the full lifecycle of state-of-the-art models, from prototyping to deployment, monitoring, and continuous improvement.
  • Drive the data strategy to improve model performance by analyzing distribution gaps, contributing to synthetic data pipelines, and creating automated annotation systems.
  • Experiment with novel language model architectures, helping drive and execute Fastino's research roadmap.
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences.
  • Build robust and real-world motivated evaluations.
  • Partner with the Fastino engineering team to ship model updates directly to customers.
  • Establish best practices for code health and documentation to facilitate collaboration and reliable development.

What We're Looking For

  • Advanced degree (Bachelors or Masters) in Computer Science, Artificial Intelligence, Machine Learning, or a related technical discipline with concentrated study in deep learning or computer vision methodologies.
  • Demonstrated ability to do independent research in academic or industry settings.
  • Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures.
  • Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization.

Technical Stack

  • PyTorch
  • JAX
  • TensorFlow

Team & Environment

You will be reporting directly to the company Founders.

Work Mode

This role operates on a hybrid work model.

Required Skills
PyTorchJAXTensorFlowMachine LearningLanguage ModelsSmall Language ModelsML EngineeringModel TrainingModel OptimizationModel DeploymentML InfrastructureSoftware EngineeringPython
Ready to relocate and code from paradise?

Thailand or Vietnam — your office, your rules

Iglu offers relocation to Bangkok, Chiang Mai, Ho Chi Minh City, or Hong Kong. Full employment, legal setup, and a community of 200+ digital professionals.

Relocation to 5 countries
Full legal work setup
Developer community access
Work-life balance culture
Explore locations
Relocation support included
About company
fastino.ai

Fastino builds the next generation of LLMs and specializes in efficient AI. Its open source GLiNER family of models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb.

Visit website
Job Details
Category data
Posted a month ago