Responsibilities
- Design and maintain a developer portal (Backstage.io or similar) as the central hub for resource management
- Build abstractions and APIs that enable developers to provision resources without manual intervention
- Implement self-service workflows for environment creation, configurations, and permissions
- Create reusable templates and blueprints for services, repositories, and pipelines
- Design, implement, and optimize highly automated CI/CD pipelines
- Reduce build and deployment times through intelligent caching, parallelization, and optimizations
- Implement GitOps and continuous deployment with automated rollback capabilities
- Automate testing (unit, integration, e2e) in pipelines with clear reporting
- Create advanced deployment strategies (blue-green, canary, feature flags)
- Design and implement ephemeral/preview environment solutions for each PR/branch
- Automate the complete lifecycle: creation, configuration, and cleanup
- Optimize costs through auto-scaling, scheduling, and garbage collection of unused resources
- Integrate ephemeral environments with code review and testing workflows
- Implement intelligent alerting systems with noise reduction and event correlation
- Configure dashboards and SLI/SLO metrics for critical services
- Establish automated runbooks and auto-remediation for common incidents
- Integrate observability (logs, metrics, traces) into the developer portal
- Maintain and evolve infrastructure as code (Terraform, CloudFormation, etc.)
- Implement automated security controls (policy as code, security scanning)
- Manage secrets, configurations, and access securely and with full auditability
- Apply least privilege and zero-trust principles across all systems
- Explore and implement AI tools for resource optimization and failure prediction
- Automate operational tasks using ML (anomaly detection, capacity planning, incident classification)
- Evaluate and adopt emerging AI Ops tools
Requirements
- 5+ years of experience in DevOps/SRE/Platform Engineering
- Mastery of cloud providers (preferably AWS)
- Solid experience with Kubernetes and microservices architectures
- Expertise in CI/CD tools (GitHub Actions, GitLab CI, Jenkins, ArgoCD)
- Proficiency in Infrastructure as Code (Terraform, Pulumi, CloudFormation)
- Experience with containers (Docker, Kubernetes, ECS/EKS)
- Advanced scripting skills (Python, Bash, Go)
- Knowledge of observability tools (Prometheus, Grafana, ELK, Datadog, New Relic)
Nice to Have
- Experience with Backstage.io or similar developer portal platforms
- Experience in FinTech organizations or highly regulated environments
- Familiarity with AI Ops tools (AIOps platforms, ML-based monitoring)
- Cloud certifications (AWS Solutions Architect, CKA, etc.)
- Experience with service mesh (Istio, Linkerd)
- Compliance and security knowledge (PCI-DSS, SOC2)
Work Arrangement
Hybrid
Additional Information
- Multicultural team with daily exposure to Portuguese, Spanish, and English (our corporate language)
- Annual learning budget and internal accelerated development paths
- High-ownership environment: we move fast, learn fast, and raise the bar — together
- Smart, ambitious teammates — low ego, high impact
- Flexible vacation and hybrid work model focused on results


