About the Role
The AI Infrastructure Engineer will be responsible for designing, implementing, and maintaining GPU infrastructure to support AI workloads. This role involves collaborating with cross-functional teams to ensure optimal performance and scalability of AI systems.
Responsibilities
- Design and implement GPU infrastructure solutions.
- Collaborate with cross-functional teams to integrate AI models into production environments.
- Ensure the scalability and reliability of AI infrastructure.
- Monitor and optimize GPU performance.
- Troubleshoot and resolve infrastructure issues.
- Implement security measures to protect AI infrastructure.
- Stay updated with the latest GPU technologies and industry trends.
- Document infrastructure designs and processes.
- Provide technical support and guidance to team members.
- Participate in on-call rotations to ensure 24/7 support.
- Conduct regular performance reviews and capacity planning.
- Implement automated deployment and monitoring tools.
- Ensure compliance with industry standards and best practices.
- Collaborate with data scientists and engineers to understand AI requirements.
- Develop and maintain infrastructure as code (IaC) solutions.
- Implement disaster recovery and business continuity plans.
- Conduct regular security audits and vulnerability assessments.
- Provide training and mentorship to junior team members.
- Participate in infrastructure design reviews and code reviews.
- Implement and manage containerization and orchestration tools.
- Collaborate with cloud providers to optimize GPU usage.
- Develop and maintain monitoring and alerting systems.
- Implement and manage CI/CD pipelines for AI infrastructure.
Nice to Have
- Experience with AI/ML model deployment.
- Knowledge of machine learning operations (MLOps).
- Experience with large-scale AI infrastructure projects.
- Proficiency in multiple programming languages.
- Experience with GPU-specific frameworks and libraries.
- Knowledge of AI infrastructure security best practices.
- Experience with hybrid cloud environments.
- Proficiency in infrastructure automation tools.
- Experience with AI infrastructure monitoring and alerting.
- Knowledge of AI infrastructure scalability and performance optimization.
Compensation
Competitive salary
Work Arrangement
Remote
Team
AI Infrastructure team
About Us
- We are a cutting-edge technology company specializing in AI and machine learning solutions.
- Our mission is to deliver innovative AI technologies that drive business success.
- We value collaboration, innovation, and continuous learning.
- Our team is composed of experts in AI, machine learning, and infrastructure engineering.
- We offer a dynamic and inclusive work environment.
- We provide opportunities for professional growth and development.
- Our company is committed to ethical AI practices and responsible innovation.
- We foster a culture of continuous improvement and excellence.
- We are dedicated to delivering high-quality AI solutions to our clients.
- Our company values diversity, equity, and inclusion.
Our Benefits
- Competitive salary and benefits package.
- Flexible work arrangements and remote work options.
- Comprehensive health and wellness programs.
- Professional development and training opportunities.
- Generous vacation and time-off policies.
- Employee assistance programs and support services.
- Performance-based bonuses and incentives.
- Opportunities for career advancement and growth.
- Collaborative and inclusive work environment.
- Access to cutting-edge technology and tools.
How to Apply
- Submit your resume and cover letter through our online application portal.
- Include relevant experience and skills in your application.
- Highlight your achievements and contributions in AI infrastructure engineering.
- Provide examples of your problem-solving and troubleshooting abilities.
- Demonstrate your knowledge of GPU technologies and architectures.
- Showcase your experience with cloud platforms and containerization tools.
- Include any relevant certifications or training.
- Provide references from previous employers or colleagues.
- Prepare for technical interviews and assessments.
- Be ready to discuss your experience with AI infrastructure projects.
Not provided