Responsibilities

Deploy and integrate pre-trained and fine-tuned ML / LLM models using OpenAI, Hugging Face, TensorFlow, PyTorch, or similar frameworks
Build scalable AI inference APIs using FastAPI, Flask, Node.js, or similar technologies
Implement retrieval-augmented generation (RAG) pipelines using vector databases such as Pinecone, Weaviate, Chroma, or FAISS
Optimize prompt engineering, embeddings, and AI workflows for performance, accuracy, and cost efficiency
Build responsive front-end applications using React, Next.js, Vue, or similar frameworks
Develop back-end services and APIs connecting AI systems to business workflows and user-facing applications
Design scalable architectures for chatbots, AI assistants, analytics dashboards, search systems, and workflow automation tools
Ensure applications are intuitive, secure, responsive, and production-ready
Build ETL/ELT pipelines for ingesting, cleaning, transforming, and processing structured and unstructured datasets
Automate data preprocessing, versioning, labeling, and pipeline orchestration using Airflow, Prefect, Dagster, or similar tools
Store and manage datasets within cloud warehouses such as Snowflake, BigQuery, or Redshift
Maintain reliable data flows supporting training, inference, analytics, and AI operations
Containerize AI services using Docker and deploy workloads to Kubernetes or cloud-native environments
Build and maintain CI/CD pipelines for AI model updates and application releases
Monitor inference latency, application performance, costs, and model drift using MLflow, Weights & Biases, Prometheus, or custom dashboards
Support scalable and reliable cloud infrastructure on AWS, GCP, or Azure
Ensure AI systems comply with GDPR, HIPAA, SOC 2, or relevant privacy/security standards
Implement authentication, access control, rate limiting, and secure API practices
Protect user data and AI workflows using modern security standards and best practices
Collaborate with product managers, designers, and data scientists to prioritize impactful AI features
Translate prototypes into production-grade systems with scalable architecture and maintainable code
Participate in sprint planning, architecture discussions, code reviews, and technical documentation
Maintain clear documentation to support reproducibility, onboarding, and long-term maintainability

Requirements

3+ years of professional software engineering experience with AI/ML exposure
Strong proficiency in Python and JavaScript/TypeScript
Experience with AI/ML frameworks such as PyTorch, TensorFlow, LangChain, or Hugging Face
Experience deploying AI or ML models into production systems
Strong front-end experience with React, Next.js, or Vue
Strong SQL skills and experience with cloud data warehouses
Familiarity with REST APIs, microservices, and distributed systems
Experience with Docker, CI/CD workflows, and cloud infrastructure

Nice to Have

Experience building and scaling AI-powered SaaS applications
Strong understanding of embeddings, vector databases, and RAG architectures
Experience with LLM fine-tuning, evaluation, and prompt optimization
Familiarity with MLOps tools such as MLflow, Kubeflow, Vertex AI, SageMaker, or Weights & Biases
Experience with serverless architectures and cost-optimized inference systems
Background in SaaS, automation platforms, analytics systems, or AI-driven products

Additional Information

Working Hours: U.S. client business hours (with flexibility for deployments, experimentation cycles, and sprint schedules)

Pavago is hiring a Full-Stack AI Engineer

Responsibilities

Requirements

Nice to Have

Additional Information