Work on fundamental problems in LLM-based agentic systems and efficient AI infrastructure, with opportunities to publish your research while making direct impact on production systems serving enterprise customers.
Conduct original research on LLM agent architectures and optimization techniques
Develop and evaluate novel algorithms with both academic rigor and production feasibility
Present your work at internal research seminars and external conferences
Mentor and collaborate with LLM engineers on implementation and deployment

Currently pursuing MS/PhD in Computer Science, Machine Learning, Natural Language Processing, or related fields
Publications at top-tier venues (ICML, NeurIPS, ICLR, ACL, EMNLP, NAACL)
Strong programming skills in Python and PyTorch
Ability to work in-person at our San Francisco office
Ability to work independently and collaborate across research and engineering teams

Experience with self-evolving agent systems
Proficiency in CUDA programming and custom kernel development for LLM operations
Background in reinforcement learning-based LLM fine-tuning
Track record of contributions to production inference systems such as vLLM, TensorRT-LLM, SGLang, or Hugging Face ecosystem
Experience bridging academic research with production systems
Open-source contributions to widely-used ML infrastructure projects

Workato is hiring an Intern, AI Engineering