Skip to main content
Full-Time
On-Site

Platform & Reliability Engineering/SRE

View on Map

Description

The role involves designing, implementing, and optimizing core platform services, APIs, and automation frameworks to support software development, AIOps, and SRE operations. Key responsibilities include developing and maintaining infrastructure using Infrastructure as Code (IaC) tools like Terraform, architecting cloud-native solutions in GCP and on-prem OpenShift for reliability, scalability, and cost-efficiency. The position also requires implementing and enhancing CI/CD pipelines with tools such as GitHub Actions, GitLab CI, Jenkins, or ArgoCD, establishing comprehensive monitoring, logging, and alerting strategies using technologies like Prometheus, Grafana, OpenTelemetry, and NewRelic. Embedding security best practices into platform solutions and supporting 24/7 AIOps in production through SRE with enhanced automation capabilities for proactive incident resolution are also critical aspects of the job.

What We're Looking For

Design, implement, and optimize core platform services, APIs, and automation frameworks to support software development, AIOps, and SRE operations.,Develop and maintain infrastructure using tools such as Terraform.,Architect and optimize cloud-native solutions in GCP and on-prem OpenShift, ensuring reliability, scalability, and cost-efficiency.,Implement and enhance CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, or ArgoCD to improve software delivery.,Ensure standardized DORA observability across prioritized development programs using Gathr as the platform.,Establish monitoring, logging, and alerting strategies using Prometheus, Grafana, OpenTelemetry, NewRelic, or similar technologies.,Embed security best practices in platform solutions, including identity management, secrets management, and policy enforcement.,Support AIOps 24/7 in production through SRE and enhance automation capabilities for proactive incident resolution.,7+ years of professional software development experience, with at least 3 years focused on platform engineering, DevOps, or SRE.,Proficiency in at least one programming language such as Python, Go, Java, or Rust.,Hands-on experience with cloud platforms (GCP and on-prem OpenShift) and cloud-native technologies such as Kubernetes.,Strong knowledge of Infrastructure as Code (Terraform).,Experience with containerization technologies (Docker, Kubernetes, Helm).,Expertise in CI/CD tooling and best practices for software deployment and release automation.,Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, OpenTelemetry, NewRelic, ELK stack).,Strong problem-solving skills and a track record of delivering scalable and maintainable solutions.,Excellent communication and collaboration skills, with experience working in agile environment.

Ideal Candidate

7+ years of professional software development experience, with at least 3 years focused on platform engineering, DevOps, or SRE.,Proficiency in at least one programming language such as Python, Go, Java, or Rust.,Hands-on experience with cloud platforms (GCP and on-prem OpenShift) and cloud-native technologies such as Kubernetes.,Strong knowledge of Infrastructure as Code (Terraform).,Experience with containerization technologies (Docker, Kubernetes, Helm).,Expertise in CI/CD tooling and best practices for software deployment and release automation.,Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, OpenTelemetry, NewRelic, ELK stack).,Experience with service meshes and API gateways (Nice to Have).,Knowledge of SRE principles and reliability engineering (Nice to Have).,Experience with FinOps and cost optimization in cloud environments (Nice to Have).,Exposure to policy-as-code frameworks (Nice to Have).

Hard Skills

Python
Go
Java
Rust
Terraform
GCP
OpenShift
Kubernetes
Docker
Helm
GitHub Actions
GitLab CI
Jenkins
ArgoCD
Prometheus
Grafana
OpenTelemetry
NewRelic
ELK stack
Service Meshes
API Gateways
SRE principles
Reliability Engineering
FinOps
Cost Optimization
Policy-as-code frameworks
DevOps

Soft Skills

Problem-solving
Communication
Collaboration
Agile environment experience

Benefits

Ownership/CGI Partners
Teamwork
Respect
Belonging
Opportunities to reach full potential
Develop innovative solutions
Build relationships with teammates and clients
Access to global capabilities
Opportunities to scale ideas
Embrace new opportunities
Benefit from expansive industry and technology expertise
Career shaping
Supported by leaders who care about health and well-being
Opportunities to deepen skills and broaden horizons

Also Available At

About the Company

C

CGI Inc.

CGI Inc. is one of the world's largest IT and business consulting firms, providing a comprehensive range of services including strategic consulting, systems integration, managed IT, and business process services. Founded in 1976 and headquartered in Montreal, the company operates across more than 40 countries, helping clients across various industries accelerate their digital transformation and achieve measurable business outcomes.

Ownership-driven
Collaborative
Ethical
Global
Client-centric
View all jobs at CGI Inc.

    We respect your privacy

    BerryMap uses cookies to provide essential features, analyze usage, and improve your experience. You can customize your preferences below.