San Francisco, California, United States Hybrid Employment USD 145,000 - 225,000 Yearly

Baselayer is hiring a Machine Learning Engineer

About the Role

Baselayer is seeking a Machine Learning Engineer to build and maintain machine learning models powering autonomous agents in the Go-To-Market (GTM) space, with a specific focus on Know Your Customer/Business (KYC/KYB) processes. You will architect robust ML systems, develop scalable data pipelines, implement advanced techniques, and ensure model governance and compliance.

What You'll Do

  • Build and maintain ML models, integrating them with various data sources to ensure scalability, high performance, and adaptability for autonomous GTM agents.
  • Architect and design core ML services supporting KYC/KYB processes, leveraging knowledge graphs and LLMs for dynamic use cases.
  • Develop and maintain robust data pipelines for feature extraction and transformation, focusing on scalability and performance with large-scale, high-dimensional data.
  • Implement and experiment with state-of-the-art techniques like reinforcement learning from human feedback (RLHF) and parameter-efficient fine-tuning (e.g., LoRA) to improve LLMs for specific identity-related use cases.
  • Build and maintain infrastructure for model training, evaluation, and deployment, creating a scalable platform foundation for innovation.
  • Ensure ML systems meet industry standards for fairness, explainability, and compliance, particularly around KYC/KYB regulations.
  • Implement optimizations for model inference and training, ensuring ML services can efficiently process identity data while maintaining reliability.
  • Design and conduct experiments to evaluate model performance, debug issues, monitor ML services, and continuously improve architectures for diverse data and use cases.

What We're Looking For

  • 4–8 years of experience in machine learning development, working with Python and building ML models.
  • Comfort working with large-scale data and optimizing performance for computationally intensive ML systems.
  • Strong foundation in AI/ML fundamentals, particularly with LLMs, and an eagerness to experiment with emerging techniques.
  • Prioritization of responsible AI practices and model governance, especially in regulated environments like KYC/KYB.
  • A keen eye for detail and pride in writing clean, maintainable code while optimizing for model performance.
  • Ability to thrive in a high-trust, ownership-focused environment and comfort working across different levels of abstraction.
  • A problem-solver who navigates the unknown confidently.
  • A proactive self-starter who thrives in dynamic settings.
  • Highly feedback-oriented.

Technical Stack

  • Python
  • LLMs
  • Knowledge graphs

Benefits & Compensation

  • Equity package
  • Unlimited vacation
  • Fully paid health insurance, dental, and vision
  • 401(k) with company match
  • Compensation range: $145,000 to $225,000

Work Mode

This role operates on a hybrid schedule and is based in San Francisco (SF).

Baselayer is an equal opportunity employer.

Required Skills
PythonLLMsKnowledge GraphsMachine LearningLarge-scale DataModel Performance OptimizationAI/ML FundamentalsModel GovernanceResponsible AIKYC/KYB
Ready to relocate and code from paradise?

Thailand or Vietnam — your office, your rules

Iglu offers relocation to Bangkok, Chiang Mai, Ho Chi Minh City, or Hong Kong. Full employment, legal setup, and a community of 200+ digital professionals.

Relocation to 5 countries
Full legal work setup
Developer community access
Work-life balance culture
Explore locations
Relocation support included
About company
Baselayer

Baselayer is the intelligent business identity platform that helps verify any business, automate KYB, and monitor real-time risk. Trusted by 2,200+ financial institutions, its B2B risk solutions & identity graph network leverage state & federal government filings and proprietary data sources to prevent fraud, accelerate onboarding, and lower credit losses.

Visit website
Job Details
Department Data and Analytics
Category data
Posted 14 days ago