Site Reliability Engineer Sr
Job Description
Job title: Site Reliability Engineer Sr
Company: Ceridian
Job description: Location: Work is what you do, not where you go. For this role, we are open to remote work and can hire anywhere in the United States or CanadaAbout the OpportunitySenior Site Reliability Engineers bridge the gap between development and operations organizations and facilitate effective collaboration which leads to faster feature delivery and elevated service quality overall.Join the pioneering Site Reliability Engineering team at Ceridian, where we lead the charge in ensuring our state-of-the-art products set new benchmarks in scalability, availability, and reliability. We embrace the SRE engagement model to deliver exceptional performance. As a member of our team, you’ll help build and maintain a suite of internal tools that proactively alert, report, and autonomously remediate Ceridian’s environments, ensuring seamless service. We’re committed to proactive solutions, crafting robust processes, empowering our talented developers with the latest technology, and engineering innovative remedies to prevent issues from recurring. If you’re passionate about pushing the boundaries in site reliability engineering, join us at Ceridian and become part of a team that thrives on challenges and celebrates continuous improvement. Elevate your career with us and be a part of a transformative journey in the world of SRE.What you’ll get to do
- Learn about Ceridian’s cloud infrastructure and the applications that run on them to build a full mental model of how the Dayforce ecosystem works.
- Onboard new features into the SRE operating model.
- Seek out, propose and execute on projects to improve Dayforce’s reliability, SRE processes and reduce day-to-day toil.
- Participate and begin to lead in incidents, investigate root cause, and remediate Dayforce environment issues.
- Create runbooks and reusable runbook components.
- Contribute to the inner source SRE repository.
- Develop trusted relationships with all parts of Ceridian’s business.
- PagerDuty On-Call rotations as required
Skills and Experience we Value
- Self-starter and passionate individual willing to learn new concepts and technologies as well as contributing to the SRE powered ecosystem.
- 3-6 years’ experience as an SRE, System Administrator, Network Engineer, Database Administrator or Software Engineer
- Ability to identify and resolve performance bottlenecks.
- Experience with Cloud Platforms (Azure Preferred)
- Experience with APM and Observability platforms and how to leverage and tune these tools.
- Experience with at least one object-oriented programing language (C# and Java preferred).
- Experience with at least one scripting language (Python and PowerShell preferred).
- Experience with at least one database engine and querying language (MSSQL / TSQL and Postgres / PLSQL preferred).
- Excellent communication and collaboration skills are mandatory.
- Building and contributing to Auto-Remediation tooling is an asset.
- Proven ability to consistently deliver solutions on time.
- Knowledge of Terraform is considered an asset but not required.
- Knowledge of containerization and Kubernetes is considered an asset.
- Knowledge of Chaos Testing principles is considered an asset.
#LI-Remote
Expected salary:
Location: Canada
Job date: Sat, 21 Sep 2024 23:36:52 GMT
Apply for the job now!