The SF Climate Week 2025 calendar is now open! 🎉 Register for 200+ events now before they fill up.
Back

Senior DevOps Engineer

6 days ago
Full time role
Remote · Mexico City, CDMX, MX... more

Wood Mackenzie is the global data and analytics business for the renewables, energy, and natural resources industries. Enhanced by technology. Enriched by human intelligence. In an ever-changing world, companies and governments need reliable and actionable insight to lead the transition to a sustainable future. That’s why we cover the entire supply chain with unparalleled breadth and depth, backed by over 50 years’ experience. Our team of over 2,400 experts, operating across 30 global locations, are enabling customers’ decisions through real-time analytics, consultancy, events and thought leadership. Together, we deliver the insight they need to separate risk from opportunity and make confident decisions when it matters most.

WoodMac.com

Wood Mackenzie Brand Video

Wood Mackenzie Values

  • Inclusive – we succeed together
  • Trusting – we choose to trust each other
  • Customer committed – we put customers at the heart of our decisions
  • Future Focused – we accelerate change
  • Curious – we turn knowledge into action

Senior Site Reliability Engineer

Job Description

Wood Mackenzie has an exciting opportunity for a Senior Site Reliability Engineer (SRE) to join a dynamic global business to help drive change and innovation. We are looking for a skilled senior SRE professional to help us manage and support our products and services within the enterprise.

Role Purpose 

The principal responsibility of this role is to provide operational expertise within the SRE team and work with the software engineering teams to design, build, release and maintain new and existing applications, resources, and platforms. This encompasses:

  • Working in partnership with the business and the technology teams, bringing awareness and insight of the different operational constraints / opportunities for projects targeting cloud-based or on-premises deployment.
  • Design, implementation, and maintenance of cloud and on-prem resources and environments.
  • Fostering a culture of radical candor in cross-functional groups, strictly following SRE best practices within a devops organization.
  • Ownership of advanced continuous integration/delivery toolset or the processes, resources, and platforms that use those tools.
  • Proactive approach to ensuring service availability as well as detection and prevention of problems.
  • Articulate technical and business concepts to different audiences and influence technical decisions with solid metrics collection and proof of concepts.

Responsibilities:

  • Strategic Pipeline Leadership: Lead design and implementation of efficient and scalable delivery pipelines.  Work closely with cross-functional teams, including developers, QA, and product managers, to design, develop, implement, and maintain advanced delivery pipelines for efficiency and scalability using tools like Jenkins, TeamCity, Octopus Deploy, and GitHub Actions.
  • Operational Strategy: Input into development and implementation of comprehensive operational strategies through identifying operational constraints and opportunities such as auto-scaling, container orchestration, and system resiliency.   
  • Innovative Solutions: Lead the implementation of advanced solutions to predict and mitigate potential issues.  Continuous analysis of data to identify gaps, trends, and areas for improvement.
  • Tooling Strategy: Spearhead the creation and maintenance of state-of-the-art tooling solutions, including awareness of new industry trends and how to implement them.
  • Continuous Improvement Champion: Lead continuous improvement initiatives.
  • Mentorship and Coaching: Provide mentorship to Levels 1 and 2 engineers.
  • Incident Command: Act as an active primary incident lead during critical (P1 and P2) incidents.
  • Documentation and Process Development: Ensure comprehensive documentation exists, knowledge shares happen regularly, and review of SRE processes/standards.
  • Autonomous Operation:   Proven ability to work independently with minimal oversight for the skills and requirements listed below.
  • On-Call Rotation: Participate in the 24/7 on-call rotation, providing advanced support, responding to system alerts, and incidents to ensure continuous system availability and performance.

Qualifications

We understand every organization is different and professionals have their own unique history and experience, so we don’t expect to find a 100% match of candidate competencies in respect of the tech stack we use in Wood Mackenzie. We list our preferred technologies, but if you have transferrable knowledge and you are willing to learn what you do not know, we will consider your application.

Skill Requirements:

  • Experience: Minimum of 4+ years in SRE/DevOps roles.
  • Agile Leadership: Extensive experience leading projects in an agile manner.
  • Expert Cloud Skills: Deep AWS expertise (Solutions Architect Associate or equivalent knowledge), substantial Azure experience (Microsoft Certified: Azure Administrator Associate AZ-104 or equivalent knowledge).
  • Linux Mastery: Advanced Linux system administration proficiency (Linux+, RHSA, or equivalent knowledge).
  • Automation and Innovation: Proven automation expertise (e.g. Ansible, Salt Stack, Rundeck, and Jenkins), including tool design, configuration, and implementation.
  • Mentorship and Growth: Mentor SRE Is and SRE IIs.
  • Strategic Execution: Ability to execute strategic operational initiatives.
  • Troubleshooting: Utilize tools such dig, tcpdump, grep, nslookup, vim, less, cat, etc.

Additional Preferred Skills:

  • Expert Cloud Management: Deep expertise in managing large-scale cloud environments.
  • Advanced Container Orchestration: Advanced knowledge of Docker and Kubernetes (CKA or equivalent knowledge).
  • Sophisticated Scripting and Development: Advanced scripting (Python, Bash, or Powershell) and development skills (e.g. C#, python applications, javascript frameworks, or php).
  • Infrastructure as Code Mastery: Mastery of Terraform, CloudFormation, CDK, or Pulumi.
  • CI/CD Pipeline Leadership: Expertise in designing, creating, implementing, and maintaining CI/CD pipelines.
  • Comprehensive Monitoring and Logging: Advanced experience with designing, building, searching (troubleshooting), and maintaining tools like Prometheus, Nagios, Grafana, ELK Stack, Splunk, App Insights, and CloudWatch.  
  • In-depth Networking Skills: Functional networking knowledge around concepts such as AWS VPCs, Azure vNets, AWS Direct Connect, Azure Express Route, Routing Tables, Network Security Groups, Route 53 Resolvers and Cloud Load Balancers.
  • Security Leadership: Strong understanding of security best practices and compliance frameworks (e.g. SOX, SOC II, NIST, CIS) with demonstrated ability to work within them.
  • Database Management: Experience with SQL and NoSQL databases.
  • Enterprise SaaS applications: Management experience with SaaS applications such as Okta, Jira, and Confluence.
  • Leadership and Mentorship: Cross-functional leadership of projects and ability to mentor less experienced team members.
  • Strategic Vision Execution: Assisting with operational initiative execution.
  • Collaboration Tools: Experience with Git, GitHub, and documentation platforms like Confluence.

Equal Opportunities

We are an equal opportunities employer. This means we are committed to recruiting the best people regardless of their race, colour, religion, age, sex, national origin, disability or protected veteran status. You can find out more about your rights under the law at www.eeoc.gov 

If you are applying for a role and have a physical or mental disability, we will support you with your application or through the hiring process.  

Subscribe