About the Role
The role involves building and optimizing cloud-based data infrastructure to support intelligent, agentic AI applications, ensuring high performance, reliability, and scalability.
Responsibilities
- Design and implement data pipelines on Microsoft Azure
- Develop scalable storage solutions for AI workloads
- Integrate machine learning models into production data flows
- Optimize data processing for low-latency decision systems
- Ensure data consistency across distributed environments
- Collaborate with AI researchers and software developers
- Monitor system performance and troubleshoot issues
- Maintain data security and compliance standards
- Automate deployment of data infrastructure
- Support real-time data streaming and processing
- Build reusable components for data transformation
- Document architecture and data workflows
- Improve data quality and validation processes
- Work with large-scale unstructured data sets
- Implement monitoring and alerting for data systems
- Contribute to system design discussions
- Ensure high availability of data services
- Manage metadata and data lineage tracking
- Support CI/CD pipelines for data code
- Evaluate new Azure data services and tools
- Assist in capacity planning for data growth
- Participate in code and design reviews
- Troubleshoot production data pipeline failures
- Enhance data observability practices
- Support disaster recovery planning for data systems
Nice to Have
- Master’s degree in a technical field
- Experience with AI or ML platforms
- Knowledge of agent-based systems
- Familiarity with MLOps practices
- Experience with large-scale data migration
- Contributions to open-source data projects
- Certification in Azure data technologies
- Background in autonomous systems
- Experience with graph databases
- Knowledge of natural language processing pipelines
Compensation
Competitive salary and benefits package
Work Arrangement
Remote with flexible hours
Team
Cross-functional team focused on AI-driven data solutions
About the Agentic AI Platform
The platform enables AI agents to process, reason, and act on data with minimal human intervention. It relies on robust, low-latency data infrastructure to support dynamic workflows and decision-making cycles.
Technology Stack
Azure Data Factory, Azure Databricks, Azure Blob Storage, Azure Synapse, Event Hubs, Cosmos DB, Python, Terraform, MLflow, Kubernetes
Available for qualified candidates