Remote (Global) Full-time

DevSavant is hiring a Senior Computer Vision Engineer

About the Role

DevSavant seeks a Senior Computer Vision Engineer to drive the development of advanced vision systems across video-based applications. You will work on bleeding-edge technologies in pose estimation, object detection, and motion tracking—optimizing deep learning models and deploying them at scale.

What You'll Do

  • Build and optimize pose estimation pipelines using HRNet, including multi-person and sports use cases.
  • Implement keypoint detection models with multi-stage refinement and heatmap generation.
  • Customize and train YOLOv4/X for object detection, optimizing anchor tuning, NMS, and accuracy-speed balance.
  • Evaluate and compare top-down vs bottom-up pose estimation approaches.
  • Apply tracking techniques including Kalman Filters, DeepSort, and custom tracking heuristics.
  • Analyze temporal coherence in high-FPS sequences (30–120fps).
  • Implement segmentation models for semantic and instance-level use cases using both classical and deep learning methods.
  • Convert models from PyTorch → ONNX → TensorRT, leveraging ONNXRuntime for inference.
  • Apply quantization, pruning, batching, and memory optimization techniques to deploy models efficiently at scale.
  • Optimize GPU memory usage and runtime throughput for large-scale inference workloads.
  • Manage full-frame inference for high-FPS video using tools like FFmpeg, OpenCV, and Decord.
  • Implement efficient clip segmentation, sampling strategies, and multi-threaded pre/post-processing pipelines.
  • Deploy models in AWS, GCP, or Azure cloud environments.
  • Develop scalable, containerized inference systems (Docker, REST APIs, async queues).
  • Build video ingestion pipelines with automated processing and feedback loops.
  • Develop evaluation pipelines for pose estimation (e.g., PCKh, mAP), detection accuracy, and frame-wise precision.
  • Integrate scoring models to map visual outputs to business or sports-specific metrics.

What We're Looking For

  • 5+ years of experience in computer vision and deep learning.
  • Strong track record of delivering production-ready vision pipelines.
  • Proven ability to work independently and in a cross-functional remote team.
  • Expert Python.
  • Experience with PyTorch, OpenCV, ONNXRuntime.
  • Experience with video tools FFmpeg, MoviePy.
  • Experience with infrastructure Docker, REST APIs, cloud deployment on AWS/GCP/Azure.

Nice to Have

  • Experience in real-time or near-real-time video inference.
  • Experience in the SportsTech industry or a related field involving human motion analysis.
  • Experience communicating technical concepts to clients and stakeholders.
  • C++.
  • TensorFlow (reference only).
  • Decord.
  • NVIDIA DALI.
  • Experiment Tracking with W&B, MLflow, TensorBoard.

Technical Stack

  • Languages: Python, C++, Bash scripting
  • ML Frameworks: PyTorch, TensorFlow
  • Vision & Video: OpenCV, FFmpeg, MoviePy, Decord, NVIDIA DALI
  • Inference & Deployment: ONNXRuntime, Docker
  • Cloud: AWS, GCP, Azure
  • Experiment Tracking: W&B, MLflow, TensorBoard

DevSavant is fostering a culture of growth and well-being in a supportive, success-driven environment.

Required Skills
PythonC++PyTorchOpenCVONNXRuntimeTensorFlowComputer VisionDeep LearningFFmpegBash scriptingModel OptimizationVideo Processing
Starting a business in Thailand?

Company registration done right

Foreign ownership rules, licenses, tax registration — Thai business setup has many moving parts. SVBL guides you through every step with full legal compliance.

Company registration & structure
Foreign ownership solutions
License & tax registration
BOI promotion eligibility
Start your business
100% foreign ownership possible
About company
DevSavant

A trusted technology partner specializing in Software Development, Data Engineering, AI/Machine Learning, Cloud Solutions, Automation Testing, and UI/UX Design.

Visit website
Job Details
Category data
Posted 8 months ago