As a Senior Site Reliability Engineer, you will play a pivotal role in ensuring the reliability, scalability, and performance of our cloud-native systems. You'll bridge the gap between development and operations by designing resilient infrastructure, automating deployment workflows, and driving observability across services. Your work will directly impact system uptime, incident response efficiency, and the overall developer experience. You'll collaborate with engineering teams to enforce best practices in infrastructure as code, security, and continuous delivery, while also mentoring peers and shaping long-term technical strategy.
Responsibilities
- Design and implement robust monitoring, logging, and alerting systems using Google Cloud Platform tools and open-source solutions like Prometheus and Grafana.
- Automate the provisioning and management of complex cloud infrastructure using Terraform to enforce consistent Infrastructure as Code practices.
- Enhance and maintain CI/CD pipelines to enable fast, reliable software delivery without compromising system stability.
- Work closely with backend and mobile teams to improve deployment processes, system performance, and application observability.
- Strengthen security and compliance by integrating best practices into cloud configurations and deployment workflows.
- Identify infrastructure inefficiencies and lead initiatives to optimize GCP services such as Cloud Run, BigQuery, and networking components.
- Develop and maintain clear technical documentation for architectures, deployment procedures, and troubleshooting guides.
Requirements
- Minimum of three years of experience in senior DevOps or Site Reliability Engineering roles with a focus on infrastructure modernization.
- Demonstrated expertise in Google Cloud Platform, including hands-on work with Cloud Run, BigQuery, Cloud Storage, and networking services.
- Strong proficiency in Terraform for managing large-scale cloud environments through Infrastructure as Code.
- Experience with container technologies, particularly Docker, and managing containerized applications in production.
- Practical knowledge of observability tools such as GCP Operations Suite, Prometheus, and Grafana for monitoring and diagnostics.
- Proficient in scripting and automation using Go, Python, or TypeScript/JavaScript to build custom tooling and workflows.
- Experience maintaining and optimizing CI/CD systems using platforms like GitHub Actions, GitLab, or ArgoCD.
- Comfortable working in agile, autonomous environments where problem-solving and initiative are highly valued.
- Solid understanding of cloud security principles and experience applying them in real-world deployments.
Tech Stack
Google Cloud Platform, Cloud Run, BigQuery, Cloud Storage, Networking, Terraform, Docker, Prometheus, Grafana, Cloud Monitoring, Cloud Logging
Benefits
- Comprehensive health insurance coverage to support employee well-being.
- Flexible working hours to accommodate individual productivity and life balance.
- Open holiday policy allowing employees to take time off as needed.
- Company-wide profit distribution program rewarding team success.
- Annual company trip, sports groups, and social activities to foster connections and enjoyment.
- Support for professional growth through training programs and conference attendance with personalized learning plans.
- Child care vouchers to assist working parents.
- Choice of laptop and peripherals tailored to personal preferences and needs.
- Unlimited-use mobile hotspot provided in Portugal for work or personal use.
- Access to office snacks and refreshments at locations in Porto, Aveiro, and Coimbra.
- Partnerships with local businesses offering discounts and benefits to employees.
- Hybrid work model with office spaces in multiple cities and remote options available.
Work Arrangement
Hybrid
Team
You will join a distributed engineering organization that values collaboration, transparency, and technical excellence. The team operates in autonomous squads with shared platforms and tooling, enabling fast iteration while maintaining high reliability standards. You'll work alongside experienced engineers across backend, frontend, and data disciplines, contributing to a strong DevOps culture where ownership and continuous improvement are core values.
Additional Information
- We value continuous learning and provide dedicated time and budget for attending conferences and taking courses.
- Our engineering teams follow agile methodologies with a focus on sustainable pace and high-quality delivery.
- We maintain a blameless postmortem culture to learn from incidents and improve system resilience.
- Internal tech talks and knowledge-sharing sessions are held regularly to foster cross-team collaboration.
- We use modern development practices including trunk-based branching, automated testing, and canary deployments.
- The company supports remote work for employees across Portugal with periodic in-person gatherings.
- We are committed to building inclusive teams and fostering an environment where diverse perspectives thrive.
- All engineers are expected to contribute to on-call rotations with proper support and rotation fairness.
- We prioritize documentation and maintain a well-organized internal knowledge base accessible to all teams.


