This job is no longer available

Google DeepMind

Head of Evaluations

New York, NY, US

RemoteFull time roleDirector / Executive

11 months ago

About the Job

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Snapshot 

As Head of Evaluations in the Responsible Development and Innovation (ReDI) team, you’ll be responsible for driving our approach to evaluations of Google DeepMind’s most groundbreaking models and overseeing and expanding the evaluation portfolio ahead of new model launches.   

You will work with teams at Google DeepMind, and internal and external partners, to ensure that our work is conducted in line with responsibility and safety best practices, helping Google DeepMind to progress towards its mission.

About us

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and responsibility are the highest priority.

The role

As Head of Evaluations in the ReDI team, you’ll oversee a team of specialists and be a critical part of the ReDI leadership team, using your expertise to deliver impactful work through direct collaboration on groundbreaking research projects and to help develop the broader governance ecosystem at Google DeepMind. You’ll be a critical input to informing Google DeepMind and Google leadership about the responsibility and safety of our models and to model development teams as they build new models. This role will work with teams from across Google DeepMind and Google, as well as external partners. 

Key responsibilities

  • Lead and manage a team of specialists, fostering a collaborative and high-performing environment, providing mentorship and guidance to team members. Actively invest in their professional development through regular feedback, coaching, and opportunities for growth.
  • Build and manage a roadmap of evaluations development that examine responsibility and safety questions for Google DeepMind’s most groundbreaking models across internal assurance evaluations, external evaluations and policy-aligned evaluations for model teams.
  • Proactively engage with industry-wide developments in this space to promote best practices in Google DeepMind.
  • Identify areas where new testing approaches are required, working with Google DeepMind research and engineering teams to: 
    • Drive the development of new evaluations for upcoming model releases; 
    • Oversee automated evaluations that support model development evaluations and assurance; and
    • Engage external organisations to provide insights on the responsibility and safety aspects of Google DeepMind models. 
  • Work closely with other areas of ReDI to develop a deep understanding of key policy areas which guide and inform the evaluations being run. 
  • Continuously identify ways to scale evaluations work and drive efficiencies, including identifying right sized approaches to manage evaluations for new modalities and capabilities through to more established models. 
  • Coordinate across relevant teams across the organization, such as those working on autonomy and cybersecurity, to create a comprehensive overview of the safety profile of Google DeepMind models. 
  • Proactively improve the responsibility and safety of Google DeepMind’s models by sharing and communicating insights from evaluations with model development teams, senior stakeholders and decision makers and broader Google teams.
  • Work closely with Google DeepMind and Google teams to ensure external commitments are upheld. 

About you

In order to set you up for success as a Head of Evaluations at Google DeepMind, we look for the following skills and experience: 

  • Demonstrated prior experience designing and implementing audits or evaluations of cutting edge AI systems. 
  • Deep understanding of AI technologies and machine learning principles. 
  • Expertise in data science, statistics, algorithmic auditing, or other relevant fields
  • Demonstrable expertise in identifying solutions to scale evaluations work, driving efficiencies to manage new modalities scaled through to established models
  • Demonstrated experience in managing a high performing interdisciplinary team in a fast paced environment.
  • Demonstrated ability to lead cross-functional teams, foster collaboration, and influence outcomes.
  • Experience working with ethics and safety topics associated with AI development in a technology company such as child safety, privacy, representational harms and discrimination, misinformation, or other areas of content or model risks.
  • Proven ability to engage with and influence a range of internal stakeholders from researchers and engineers through to senior leadership and external partners, from academia through to suppliers. 
  • Excellent communication skills, both written and verbal, with the ability to effectively communicate complex technical concepts to a wide range of audiences.

Preferred Experience:

  • Master's degree or PhD (or equivalent experience) in a relevant field, such as philosophy, ethics, computer science, or public policy
  • Prior experience participating in or leading red teaming exercises for AI models.  
  • Familiarity with cybersecurity principles and practices relevant to AI model safety and security.
  • Product management expertise or other similar experience.

Deadline to apply: EOD Wednesday 7th August 2024. 

The US base salary range for this full-time position is between $168000 - $266000 + bonus + equity + benefits. 

About the Company

Google DeepMind Logo

Google DeepMind

London, UK

1001-5000

<p>&nbsp;</p> <p><span style="font-weight: 400;">Artificial Intelligence could be one of humanity&rsquo;s most useful inventions. At Google DeepMind, we&rsquo;re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.</span></p>

Similar Jobs

Overstory Logo

Head of Data/ML

Head of Data/ML

  • Overstory
  • Hybrid, Remote
  • Full time role

Real-time satellite data to reduce wildfire risk and improve grid reliability, combating climate change.

3 months ago

Causal labs Logo

Member of Technical Staff - ML Infra

Member of Technical Staff - ML Infra

  • Causal labs
  • San Francisco, CA, US
  • Hybrid, Remote
  • Full time role

AI Physics Models to Predict & Control the Weather

6 days ago

Causal labs Logo

Member of Technical Staff - ML Research

Member of Technical Staff - ML Research

  • Causal labs
  • San Francisco, CA, US
  • Hybrid, Remote
  • Full time role

AI Physics Models to Predict & Control the Weather

6 days ago

Workiva Logo

Senior AI Enablement Engineer

Senior AI Enablement Engineer

  • Workiva
  • United States
  • Remote
  • Full time role

"Streamlining integrated ESG reporting for transparent climate impact and compliance."

About 2 months ago

Workiva Logo

Director of Data Science

Director of Data Science

  • Workiva
  • United States
  • Remote
  • Full time role

"Streamlining integrated ESG reporting for transparent climate impact and compliance."

About 1 month ago

Zoox Logo

Principal Software Engineer, Autonomy Evaluation

Principal Software Engineer, Autonomy Evaluation

  • Zoox
  • Foster City, CA, US
  • Hybrid, Remote
  • Full time role

Pioneering electric autonomous vehicles for low-carbon, congestion-free urban transportation.

27 days ago

Bedrock ocean exploration Logo

VP, Engineering / Software

VP, Engineering / Software

  • Bedrock ocean exploration
  • Remote
  • Full time role

Revolutionizing ocean exploration with advanced robotics for scalable, affordable, and eco-conscious subsea surveys.

27 days ago

Agerpoint Logo

Head of Product Engineering

Head of Product Engineering

  • Agerpoint
  • North Carolina, US, Durham, NC, US
  • Hybrid, Remote
  • Full time role

Enabling sustainable agriculture and forest conservation through AI-powered 3D plant modeling and carbon tracking.

23 days ago

Agerpoint Logo

Director, Spatial Data Science

Director, Spatial Data Science

  • Agerpoint
  • North Carolina, US, Durham, NC, US
  • Hybrid, Remote
  • Full time role

Enabling sustainable agriculture and forest conservation through AI-powered 3D plant modeling and carbon tracking.

22 days ago

Zoox Logo

Senior Staff Software Engineer, Autonomy Evaluation

Senior Staff Software Engineer, Autonomy Evaluation

  • Zoox
  • Foster City, CA, US
  • Hybrid
  • Full time role

Pioneering electric autonomous vehicles for low-carbon, congestion-free urban transportation.

19 days ago