The Mid-level AI Cross Capability SRE is responsible for maintaining the reliability, scalability, and performance of AI systems that span multiple capabilities, such as computer vision, natural language processing, and data analytics. This role involves managing the integration of diverse AI solutions, ensuring seamless collaboration between systems, and addressing operational challenges across cross-functional AI deployments. You will work closely with engineering, data science, and DevOps teams to deliver robust, high-performing AI systems.
Typical Responsibilities
ensuring the reliability, scalability, and performance of AI systems across multiple platforms and capabilities while emphasizing reusability of components
collaborate with cross-functional teams to optimize infrastructure, monitor system health, troubleshoot issues, and ensure seamless integration of AI models into various business functions.
Candidate Requirements
Technical Skills
Proficiency in programming languages like Python, Java, or Go.
Experience with cloud platforms (AWS, Azure, GCP) and AI-specific services across them.
Hands-on expertise in containerization (Docker) and orchestration (Kubernetes).
Familiarity with monitoring tools like Prometheus, Grafana, and ELK Stack for multi-service environments.
Knowledge of MLOps tools (e.g., MLflow, Kubeflow, SageMaker) and AI pipeline orchestration.
Strong understanding of AI concepts, including NLP, computer vision, and data analytics.
Experience with distributed systems and data processing frameworks (e.g., Apache Spark, Kafka).
Education and Experience
Bachelor degree in Computer Science, Data Science, or related fields.
Three years of experience in SRE, DevOps, or AI/ML operations roles, with exposure to multi-capability AI systems.
Soft Skills
Strong analytical and problem-solving skills, with attention to detail.
Excellent collaboration and communication skills to work across diverse teams.
Ability to adapt to a fast-paced, dynamic work environment.
Nice-to-Have
Knowledge of advanced AI frameworks and libraries, such as TensorFlow, PyTorch, or Hugging Face Transformers.
Experience with data engineering tools and techniques, including ETL pipelines.
Certifications in cloud platforms or relevant AI/ML technologies.
Exposure to AI governance and compliance best practices.
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.