Remote (Global) Full-time

Tether Operations Limited is hiring a Senior AI Inference Engineer (100% Remote)

About the Role

What You'll Do

Own the development and optimization of the inference infrastructure for on-device AI, ensuring models perform efficiently and consistently across diverse hardware. You'll focus on runtime quality, fine-tuning system behavior for fast startup, low memory pressure, and balanced throughput and latency during extended use.

Work directly with machine learning models using frameworks like llama.cpp, ggml, and ONNX, deploying them to edge environments with a strong emphasis on performance and reliability. Partner with research teams to bridge the gap between experimental models and production-ready implementations, helping refine models for real-world deployment.

Integrate advanced AI capabilities into existing software products, ensuring seamless performance and alignment with user privacy by design.

Requirements

Strong proficiency in C++ with a focus on systems-level programming and runtime efficiency
Hands-on experience deploying machine learning models to edge or resource-constrained devices
Familiarity with inference frameworks such as llama.cpp, ggml, and ONNX
Excellent written and verbal communication skills in English
Ability to collaborate across disciplines, especially with research and product teams

Benefits

Work 100% remotely from anywhere in the world
Collaborate with a lean, high-impact team at the forefront of fintech innovation
Contribute to a transparent, globally distributed organization committed to technological empowerment
Be part of a mission-driven effort advancing blockchain-based financial systems

Required Skills

C++llama.cppggmlonnxAI inferencemachine learning optimizationmodel quantizationperformance optimization C++llama.cppggmlonnxAI inferencemachine learning optimizationmodel quantizationperformance optimization

Your first international client?

Don't lose them over invoicing

Clients ghost freelancers with unprofessional invoicing. Glopay gives you a real EU company partnership so they take you seriously from invoice #1.

Instant EU company partnership

Invoice builder with your branding

Automated payment reminders

Real-time payment tracking

Get EU company now

Ready in 24 hours

About company

Pioneers a global financial revolution with cutting-edge solutions empowering businesses to integrate reserve-backed tokens across blockchains. Product suite includes the USDT stablecoin, energy solutions for Bitcoin mining, data solutions for AI and P2P tech, digital education, and ventures at the intersection of technology and human potential.

All jobs at Tether Operations Limited Visit website

Job Details

Department Data

Category data

Posted 2 months ago