Palo Alto, California, United States USD 180,000 – 440,000 / year

xAI is hiring a Software Engineer

Responsibilities

Build and maintain distributed systems handling petabyte to exabyte-scale data, supporting web crawling, preprocessing, classification, and multimodal pipelines across CPU and GPU environments.
Design high-throughput search and retrieval systems capable of processing trillions of documents using vector, hybrid, and semantic methods, integrated with large language models for accurate, real-time knowledge extraction.
Create robust inference serving platforms with load balancing, auto-scaling, key-value caching, batching, fault tolerance, monitoring via Prometheus and Grafana, CI/CD pipelines using Buildkite and ArgoCD, and performance benchmarking to ensure consistent uptime and low latency.
Improve low-level system performance through optimization of CUDA kernels including GeMM and attention mechanisms, development of Triton and CUTLASS extensions, and techniques like quantization, distillation, and speculative decoding, alongside co-design of models and hardware for future architectures.
Advance compiler and runtime technologies for machine learning frameworks such as JAX/XLA/MLIR, add custom support for next-generation GPUs, and build distributed profiling and debugging tools, while exploring high-speed interconnects including copper and optical solutions, SerDes, photonics, topology simulation, and vendor roadmaps.
Orchestrate complex computing workloads across multiple clusters and cloud environments using Kubernetes, ensure data traceability and integrity, validate high-speed network fabrics, and implement telemetry, automation, and failure analysis systems to maintain production reliability.

Benefits

Total compensation includes base pay, equity, full medical, vision, and dental coverage, 401(k) plan access, short- and long-term disability insurance, life insurance, and additional discounts and employee perks.

Compensation

Base salary is part of a broader rewards package including equity and benefits.

Work Arrangement

Not specified

Team

Not specified

Other

All team members must demonstrate strong communication abilities and effectively convey technical concepts clearly and concisely to colleagues.
A strong work ethic and the ability to prioritize tasks efficiently are essential.

Not specified

Required Skills

PythonRustC++GoJavaApache SparkKubernetesCUDADistributed SystemsPerformance OptimizationData Pipelines

About company

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

All jobs at xAI Visit website

Job Details

Department Software Development

Category other

Posted 4 months ago

Similar Jobs

Other opportunities you might be interested in

MLOps Field Engineer

Canonical

EMEA Remote (Global)

Implementation Engineer

EngFlow

Auckland Remote (Global)

Senior Software Engineer - Cloud

Bitdeer Technologies Group

United States of America Remote (Global)

Customer Success Engineer

Redpanda

Austin Remote (Global)

Lead Engineer – Platform & Infrastructure

Stream

Amsterdam Hybrid

Software Engineer, New Grad - Infrastructure

Palantir Technologies

Related Articles

Insights related to this role

Remote data scientist working with Kubernetes through a low-code platform, enabling cloud-native tools without backend expertise

Platform Engineering: Kubernetes for All

3 min 3 months ago

Workspace setup for an AI developer, showing dual monitors with code and neural networks, symbolizing the AI developer career path.

Become an AI Developer: Your Career Guide

5 min 3 months ago

Home office setup with dual monitors showing Kubernetes dashboards, representing the rise of Kubernetes remote jobs in AI and cloud-native careers 2026.

Kubernetes Remote Jobs: AI & Cloud-Native Careers in 2026

5 min 3 months ago