Skip to main content

Description

Join the GFT team at Royal Bank of Canada as a Lead ML Ops Engineer to build and manage CI/CD pipelines for Machine Learning and agentic AI solutions on OpenShift, utilizing GitHub Actions and Airflow. This role involves partnering with machine learning and data engineers to deliver business value through agile practices, ensuring strong data governance and security. Key responsibilities include designing scalable pipelines, supporting data ingestion and feature engineering, integrating models into APIs and services, participating in architecture reviews, setting operational standards, and maintaining comprehensive documentation. The engineer will also be responsible for monitoring, incident triage and resolution, and optimizing the reliability, performance, and cost of GenAI solutions, enabling teams to safely deploy new technologies.

What We're Looking For

Build and operate CI/CD for ML and GenAI on OpenShift using GitHub Actions and Airflow.,Deploy, scale, and manage Large Language Models (LLMs) and agentic services on OpenShift Container Platform (OCP) and cloud platforms (AWS/Azure).,Architect and deliver solutions and Proof-of-Concepts (PoCs) for cutting-edge Generative AI (GenAI), including RAG, tool-use, and agent orchestration.,Own the entire model lifecycle operations: versioning, registries, feature stores, vector databases, and GPU environments.,Build and maintain Model Context Protocol (MCP) integrations (servers/clients and tool adapters) to enable LLM agents to securely invoke internal APIs, data sources, and actions on OpenShift.,Define release and rollback procedures to ensure reliable maintenance of deployed models.,Collaborate with data scientists, ML engineers, and data engineers to integrate models/APIs into microservices and data pipelines.,Drive the platform roadmap, define standards, and maintain documentation, empowering teams through tooling, templates, and best practices.,Hands-on MLOps/LLMOps experience with containers, Kubernetes, and OpenShift.,Experience with CI/CD via GitHub Actions; orchestration with Airflow is a good-to-have.,Production experience with LLMs and agentic AI: RAG, vector search, prompt management, and serving patterns.,Hands-on experience with agentic AI and understanding of MCP, including designing agent tool-use flows and deploying MCP servers/tools.,Expert proficiency in Python and SQL.,Cloud experience (AWS or Azure), including designing scalable and secure architectures in collaboration with data architecture and ML Engineers.,3+ years of hands-on experience.,Proven production experience running ML/GenAI with monitoring and performance tuning.,Excellent communication and cross-team leadership skills.,Ability to articulate complex technical concepts to leadership and key stakeholders.,Ability to foster and drive key innovations within the extended team using emerging technology.

Ideal Candidate

Bachelor's Degree in Computer Science, Engineering, or a related field (Master's or PhD preferred).,3+ years of hands-on experience.

Minimum Education

Bachelor's Degree (Master's/PhD preferred)

Hard Skills

Big Data Management
Data Mining
Data Science
Deep Learning
Machine Learning (ML)
Predictive Analytics
Programming Languages
MLOps
LLMOps
Containers
Kubernetes
OpenShift
CI/CD
GitHub Actions
Airflow
LLMs
Agentic AI
RAG (Retrieval-Augmented Generation)
Vector Search
Prompt Management
Python
SQL
AWS
Azure
Snowflake (nice to have)
SageMaker (nice to have)
MLflow (nice to have)
Feast (nice to have)

Soft Skills

Excellent Communication
Cross-team Leadership
Agile Practices
Collaboration
Innovation
Problem-solving

Work Hours

37.5 hours/week

Benefits

Comprehensive Total Rewards Program (bonuses, flexible benefits, competitive compensation, commissions, stock where applicable)
Leaders who support your development through coaching and managing opportunities
Ability to make a difference and lasting impact
Work in a dynamic, collaborative, progressive, and high-performing team
World-class training program in financial services
Opportunities to do challenging work
Opportunities to take on progressively greater accountabilities
Opportunities to build close relationships with clients
Access to a variety of job opportunities across business and geographies

About the Company

R

Royal Bank of Canada

Royal Bank of Canada is a global financial institution with a purpose-driven, principles-led approach to delivering leading performance. As Canada's largest bank, it provides personal and commercial banking, wealth management, and capital markets services to over 17 million clients worldwide.

Purpose-driven
Inclusive
Innovative
Collaborative
Professional
View all jobs at Royal Bank of Canada

    We respect your privacy

    BerryMap uses cookies to provide essential features, analyze usage, and improve your experience. You can customize your preferences below.