Remote (USA) Remote (Country) Employment $180,000 - $230,000 a year

Arcadia is hiring a Principal Site Reliability Engineer

Responsibilities

  • Develop and implement strategies to enhance system reliability and performance.
  • Collaborate with cross-functional teams to ensure system stability and scalability.
  • Design and maintain monitoring and alerting systems to detect and resolve issues proactively.
  • Lead incident response and post-incident reviews to improve system resilience.
  • Create and maintain documentation for system architecture, processes, and procedures.
  • Mentor and guide junior engineers in best practices for site reliability engineering.
  • Participate in on-call rotations to ensure 24/7 system availability.
  • Conduct regular system audits and assessments to identify and mitigate risks.
  • Implement and manage disaster recovery and business continuity plans.
  • Work with development teams to integrate reliability best practices into the software development lifecycle.
  • Analyze system performance metrics and make data-driven decisions to optimize performance.
  • Develop and maintain automated deployment and infrastructure management tools.
  • Ensure compliance with industry standards and best practices for system reliability and security.
  • Collaborate with vendors and third-party service providers to ensure system reliability.
  • Stay up-to-date with emerging technologies and trends in site reliability engineering.
  • Participate in the design and implementation of new systems and infrastructure.
  • Provide technical leadership and expertise to support the organization's reliability goals.
  • Develop and implement strategies to reduce system downtime and improve availability.
  • Conduct regular training sessions and workshops to enhance the team's skills and knowledge.
  • Work with stakeholders to understand their needs and ensure system reliability meets their expectations.
  • Develop and implement strategies to improve system scalability and performance.
  • Collaborate with the security team to ensure system reliability and security are aligned.
  • Participate in the development and implementation of the organization's reliability roadmap.
  • Conduct regular reviews and assessments of the organization's reliability practices and procedures.
  • Develop and implement strategies to improve system resilience and fault tolerance.

Nice to Have

  • Experience with Kubernetes and Docker.
  • Knowledge of Prometheus and Grafana.
  • Experience with Terraform and Ansible.
  • Proficient in using ELK Stack (Elasticsearch, Logstash, Kibana).
  • Experience with AWS, Azure, or Google Cloud Platform.
  • Knowledge of CI/CD pipelines and tools.
  • Experience with monitoring and alerting tools such as Nagios or Zabbix.
  • Proficient in using JIRA and Confluence.
  • Experience with infrastructure as code (IaC) tools such as Puppet or Chef.
  • Knowledge of network security protocols and best practices.
  • Experience with container orchestration tools such as Kubernetes or Docker Swarm.
  • Proficient in using Git and GitHub.
  • Experience with log management and analysis tools such as Splunk or ELK Stack.
  • Knowledge of performance tuning and optimization techniques for databases.
  • Experience with capacity planning and resource management tools.
  • Proficient in using data analysis and visualization tools such as Tableau or Power BI.
  • Experience with compliance and regulatory requirements for data security and privacy.
  • Knowledge of vendor management and third-party service provider best practices.
  • Experience with agile methodologies and practices such as Scrum or Kanban.
  • Proficient in using project management and collaboration tools such as Asana or Trello.
  • Experience with disaster recovery and business continuity planning tools.
  • Knowledge of system and network security best practices for cloud environments.
  • Experience with infrastructure as code (IaC) tools such as CloudFormation or Pulumi.

Compensation

Competitive salary and benefits package

Work Arrangement

On-site with flexible hours

Team

Work closely with cross-functional teams including development, operations, and security.

About Us

  • We are a leading provider of innovative solutions in the healthcare industry.
  • Our mission is to transform healthcare through technology and data-driven insights.
  • We are committed to improving patient outcomes and enhancing the quality of care.
  • Our team is dedicated to delivering exceptional service and support to our clients.
  • We foster a culture of innovation, collaboration, and continuous learning.
  • We value diversity, inclusion, and equality in the workplace.
  • Our company is recognized for its commitment to sustainability and social responsibility.
  • We offer a dynamic and challenging work environment with opportunities for growth and development.
  • Our team is passionate about making a positive impact on the healthcare industry.
  • We are proud to be a leader in the healthcare technology sector.

Our Benefits

  • Comprehensive health, dental, and vision insurance plans.
  • 401(k) retirement savings plan with company match.
  • Generous paid time off, including vacation, sick leave, and holidays.
  • Employee assistance program for personal and professional support.
  • Tuition reimbursement for continuing education and professional development.
  • Flexible work arrangements, including remote work options.
  • On-site fitness center and wellness programs.
  • Employee referral bonus program.
  • Professional development and training opportunities.
  • Performance-based bonuses and incentives.
  • Company-sponsored events and team-building activities.
  • Employee recognition and reward programs.
  • Access to a variety of employee resource groups.
  • Paid parental leave for new parents.
  • Health and wellness initiatives, including on-site health screenings and flu shots.
  • Employee stock purchase plan.
  • Volunteer time off for community service and charitable activities.
  • On-site cafeteria and dining options.
  • Free parking and public transportation subsidies.
  • Relocation assistance for eligible employees.
  • Pet insurance and pet-friendly workplace policies.

Visa sponsorship available for qualified candidates

Invoicing holding you back?

Focus on work, not paperwork

Stop worrying about invoicing, taxes, and compliance. Glopay handles the business setup, you handle the client work. Get paid faster and look professional.

Auto-generated compliant invoices
Built-in expense management
Income reports for tax season
95% of earnings stay with you
Try Glopay free
No credit card needed
About company
Arcadia
Arcadia is a technology company dedicated to breaking the fossil fuel monopoly by providing access to clean energy data and services. We connect consumers and businesses to renewable energy sources through our platform.
All jobs at Arcadia Visit website
Job Details
Department Engineering
Category infrastructure
Posted a month ago