Islamabad, Islamabad Capital Territory, Pakistan On-site Full-time

Skylabs AI is hiring a Senior Backend Engineer | Skylabs AI | Jobs By Workable

About the Role

Skylabs AI is seeking a Senior Backend Engineer to lead the architecture and implementation of our entire cloud-native infrastructure for AI agents and developer tools. You will build a scalable, resilient, and secure microservices platform that powers the next generation of intelligent software.

What You'll Do

  • Design, build, deploy, and maintain the platform as a set of resilient, scalable microservices (e.g., auth, payments, agent orchestration).
  • Build the critical systems that manage the lifecycle of long-running agentic tasks, including state management and asynchronous communication.
  • Architect and implement containerized execution environments (Docker, Kubernetes) where agents can safely build, test, and run code.
  • Deploy, manage, and wrap high-throughput LLM inference servers (NVIDIA Triton, vLLM) to serve models to our agent systems.
  • Design, secure, and manage core APIs (REST and gRPC), including public-facing MCP/ACP and internal service-to-service communication.
  • Build the high-throughput data ingestion pipeline to process and store massive volumes of telemetry and training data from IDE clients.
  • Implement and manage authentication (OAuth2/OIDC) and payment/subscription (Stripe) systems.
  • Own Infrastructure as Code (Terraform) and build out comprehensive observability (Prometheus, Grafana, Jaeger) across the entire stack.

What We're Looking For

  • Proven ability to design, build, deploy, and maintain a complex system as a set of resilient, scalable, and independent microservices.
  • Mastery of designing clean, secure, and high-performance APIs (both REST and gRPC). Experience with API gateways, versioning, and documentation.
  • Deep practical experience with both SQL (e.g., PostgreSQL, MySQL) and NoSQL (e.g., MongoDB, DynamoDB, Redis) databases, including data modeling, query optimization, and scaling.
  • Experience building and scaling backend systems specifically for LLM use cases, understanding the unique demands of stateful, long-running agentic tasks.
  • Experience building and managing full-stack observability (e.g., using Prometheus, Grafana, ELK/OpenSearch, and distributed tracing like Jaeger).
  • Expertise in implementing robust identity systems, including sign-up, OAuth2/OIDC, JWTs, and fine-grained Role-Based Access Control (RBAC).
  • Experience integrating and managing payment and subscription systems (e.g., Stripe), including metering and subscription logic.
  • Experience designing systems that manage the lifecycle of complex, long-running 'agent' tasks.
  • Deep experience with Docker and Kubernetes (K8s) for orchestrating 'remote sandbox' environments.
  • Expertise in using systems like Kafka, RabbitMQ, or gRPC streams to manage communication between microservices, the IDE plugin, and the AI agents.
  • Hands-on experience deploying and managing high-throughput LLM inference servers like NVIDIA Triton, TGI, or vLLM on GPU-enabled infrastructure.
  • Experience building the backend service that securely wraps inference endpoints, handles request batching, and serves them to the agent orchestrator.
  • Experience designing and building a high-throughput data pipeline to receive, process, and store telemetry and training data.
  • Mastery of Terraform or CloudFormation to provision and manage the entire cloud infrastructure.
  • Experience implementing rate limiting, request validation, and service-to-service authentication (e.g., mTLS).
  • A strong DevOps mindset and experience building automated CI/CD pipelines (e.g., GitLab CI, GitHub Actions) for a microservices environment.

Nice to Have

  • Familiarity with lightweight virtualization (e.g., Firecracker) for remote sandboxing.

Technical Stack

  • Docker, Kubernetes, Kafka, RabbitMQ, gRPC, NVIDIA Triton, vLLM
  • PostgreSQL, MySQL, MongoDB, DynamoDB, Redis
  • OAuth2/OIDC, JWT, Stripe
  • Terraform, CloudFormation, Prometheus, Grafana, Jaeger, ELK/OpenSearch
  • GitLab CI, GitHub Actions, Firecracker

Team & Environment

Work alongside world-class AI researchers and engineers.

Benefits & Compensation

  • Competitive salary in USD
  • Comprehensive health allowance
  • Relocation allowance (if moving to Islamabad)
  • Monthly team events and offsites
  • A beautiful, collaborative office space

Work Mode

This is an onsite position located in Islamabad.

Skylabs AI is an equal opportunity employer.

Required Skills
DockerKubernetesKafkaRabbitMQgRPCNVIDIA TritonvLLMPostgreSQLMySQLMongoDBPythonFastAPISystem DesignMachine LearningCI/CD
Starting a business in Thailand?

Company registration done right

Foreign ownership rules, licenses, tax registration — Thai business setup has many moving parts. SVBL guides you through every step with full legal compliance.

Company registration & structure
Foreign ownership solutions
License & tax registration
BOI promotion eligibility
Start your business
100% foreign ownership possible
About company
Skylabs AI

A US-headquartered company conducting applied research on AI for intelligent reasoning. Specializes in complex neurosymbolic AI to solve intricate problems within software engineering.

Visit website
Job Details
Category backend
Posted 3 months ago