Hybrid Full-time

Bjak is hiring a MLOps 工程师 (MLOps Engineer)

About the Role

BJAK is looking for an MLOps Engineer focused on building and scaling impactful AI solutions. In this role, you will run and optimize state-of-the-art open-source models, ensuring they are safe, trustworthy, and performant at scale. You will collaborate closely with cross-functional teams across product, engineering, operations, infrastructure, and data.

What You'll Do

  • Run and manage open-source models efficiently, optimizing for cost and reliability.
  • Ensure high performance and stability across GPU, CPU, and memory resources.
  • Monitor and troubleshoot model inference to maintain low latency and high throughput.
  • Collaborate with engineers to implement scalable and reliable model serving solutions.

What We're Looking For

  • Experience with model serving platforms such as vLLM or HuggingFace TGI.
  • Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, or LambdaLabs.
  • Ability to monitor latency, costs, and scale systems efficiently with traffic demands.
  • Experience setting up inference endpoints for backend engineers.

Technical Stack

  • vLLM, HuggingFace TGI
  • Kubernetes, Ray, Modal, RunPod, LambdaLabs

Team & Environment

You will work in a flat structure, collaborating closely with regional teams across product, engineering, operations, infrastructure, and data.

Benefits & Compensation

  • Health, dental & vision insurance.
  • Global travel insurance (for you & your dependents).
  • Unlimited, flexible time off.
  • Housing rental subsidies.
  • Quality company cafeteria.
  • Overtime meals.

Work Mode

This is a hybrid role. It is a global position, with the company headquarters located in Malaysia.

BJAK values speed, clarity, and relentless ownership. Our high-density, high-performance team is focused on high-quality work and global impact.

Required Skills
vLLMHuggingFace TGIKubernetesRayModalRunPodLambdaLabsMLOpsMachine LearningInfrastructureCloudDistributed SystemsModel ServingDevOps
Looking for a remote dev community?

200+ professionals, 37 countries, one network

Working remotely doesn't mean working alone. Iglu connects you with developers, designers, and digital experts worldwide. Collaborate, learn, and grow together.

Global professional network
Knowledge sharing & collaboration
Regular community events
Cross-project opportunities
Join the community
37 countries represented
About company
Bjak

Bjak is focused on providing access to affordable and sustainable financial services for people in ASEAN. Headquartered in Malaysia, Bjak is the largest insurance portal in Southeast Asia. Its main portal, Bjak.com, helps millions find the insurance policy with the best value and highest coverage.

Visit website
Job Details
Category infrastructure
Posted 4 months ago