Taipei, Taiwan On-site

AIFT is hiring a Principal Site Reliability Engineer

Requirements

  • 8+ years technical experience in software engineering, network engineering, or systems administration
  • 6+ years of experience running large scale cloud services.
  • 2+ years of SRE team leadership role.
  • Fluent in English at a business level or higher.
  • Capable of planning infrastructure upgrades and optimizations.
  • Skilled in budget planning and ensuring cloud expenses remain within the allocated budget.
  • Skilled in OKR planning and ensuring the key results meet with company objectives.
  • Advanced knowledge of monitoring solutions like Prometheus, Grafana, ELK (Elasticsearch, Logstash, Kibana).
  • Experience in the complete software development life cycle (SDLC).
  • In-depth understanding of network concepts, particularly with a focus on security.
  • Hands-on experience implementing GitLab CI/CD processes.
  • Proficiency in automation platforms like Ansible and Terraform.
  • Knowledge of orchestration tools like Kubernetes.
  • Familiarity with container technologies like Docker.
  • Experience with Git source code version control systems.
  • Experience with AI pair programming like OpenAI.
  • Proficiency in programming languages such as Bash, Python, or Go.
  • Team player and good interpersonal skills.
Required Skills
Grafana
About company
AIFT
AIFT builds AI security solutions, specifically focusing on GenAI Security Guardrails (Blue Team) and Automated Vulnerability Assessment (Red Team) to protect against GenAI threats such as prompt injection.
All jobs at AIFT Visit website
Job Details
Category infrastructure
Posted 12 days ago