This job is no longer available

First Street

High Performance Computing Manager

New York, NY, US

HybridFull time roleEarly Career, Mid Level

5 months ago

About the Job

Who we are: First Street is the standard for Climate Risk Financial Modeling. We use transparent and peer-reviewed methodologies to calculate the past, present, and future climate risk for every property in the world. We started working with the world’s leading climate scientists to create groundbreaking, climate-adjusted, property specific models over 8 years ago and haven’t stopped. 


Our mission: We exist to connect climate and financial risk. 


Our data: We create physics-based, deterministic models of flooding, wildfire and hurricanes, and advanced statistical models of extreme heat, air quality, drought, hail, severe convective storms, winter storms, and more. All of this data is used to create property-level financial risk metrics and macroeconomic variables to quantify the impacts of climate, property by property.


Our customers: We empower governments at the highest levels to make smart regulations, businesses to avoid bad investments, and everyday Americans to understand their personal risk from climate change. We are relied on every day by:


  • Agencies ranging from the U.S. Department of Treasury to Fannie Mae

  • The world's biggest banks such as Bank of America and Wells Fargo 

  • Institutional investors like Nuveen and Blackstone 

  • Millions of everyday users on Zillow, Redfin, Realtor.com, Homes.com, and more 


We believe:  With the right data, we can identify the problems, avoid bad investments, and implement solutions. This is why we have invested tens of millions of dollars into our science, data, people, and products and have raised tens of millions more to move even faster. Read more about our culture here and see what Climate Risk Financial Modeling is all about here


Come join us and use your talents to change the world.

Position Overview: The High Performance Computing Manager will be responsible for the administration and optimization of research and development as well as production activities on our on-premises Linux cluster, and managing computational workloads across various platforms, including AWS and other cloud services. This role will involve maintaining the linux-based compute environment, installing and maintaining compute libraries and software packages, utilizing Docker and related technologies, deploying and managing compute jobs using Slurm, developing and maintaining scripts in bash and python, and ensuring efficient operation of our GitHub repositories for collaborative development.

Key Responsibilities:

  • Cluster Administration: Administer and maintain an on-premises Linux cluster running Ubuntu, including system updates, performance tuning, and troubleshooting.

  • Cloud Compute Management: Deploy, manage, and optimize compute jobs on AWS and other cloud platforms, ensuring seamless integration with existing workflows.

  • Job Management: Utilize Slurm for job scheduling and resource management, optimizing job queues and ensuring efficient use of computational resources.

  • Scripting and Automation: Develop and maintain bash and python scripts to automate tasks, streamline workflows, and enhance computational efficiency.

  • Repository Maintenance: Oversee and manage GitHub repositories, including version control, branching strategies, and collaborative code development.

  • Collaboration: Work closely with scientists, researchers and developers to understand computational needs, provide technical support, and ensure that computational resources align with project requirements.

  • Documentation: Maintain comprehensive documentation for system configurations, processes, and best practices.

  • IT: Provide internal and infrastructure IT support

    • Support and maintain our internal systems (including: Vanta / Jamf / Intune / Google / Tailscale / etc)

    • Implement solutions to help improve our systems

    • Onsite support in the NYC office (M/W/Th)

Qualifications:

  • Education: Bachelor’s degree in Computer Science, Environmental Sciences, Applied Mathematics, or a related field. Advanced degrees or relevant certifications are a plus.

  • Experience: Proven experience managing Linux clusters and commercial cloud computing platforms. Hands-on experience with Slurm job scheduling, bash, and python scripting is essential.

  • Technical Skills:

    • Proficiency in administering Linux-based systems, specifically Ubuntu.

    • Experience with cloud computing platforms such as AWS, Azure, and/or Google Cloud.

    • Strong knowledge of Slurm for job scheduling and resource management.

    • Proficiency in linux utilities, bash and python scripting for automation and workflow optimization.

    • Experience managing GitHub repositories, including version control and collaboration tools.

  • Soft Skills:

    • Strong problem-solving skills and attention to detail.

    • Excellent communication skills and the ability to work collaboratively with interdisciplinary teams.

    • Ability to manage multiple tasks and projects simultaneously in a dynamic environment.

  • Nice to have skills:

    • Experience with massively parallel, cloud-based High Performance Computing

    • Knowledge of very large volume datasets and HDF/netCDF, Zarr, Xarray, and similar technologies

    • Experience with running large physics-based models, including weather forecasting (e.g. WRF) and hydrology (e.g. HEC-RAS) applications.

How we work: 

  • Impact: We only focus on things that move the needle 

  • Drive: We are driven by the role we play in connecting climate and financial risk 

  • Ownership: This is our company and we act accordingly

  • Urgency: We move quickly because the world depends on it 

  • Resilience: We have a growth mindset in all that we do


What we offer: 

  • Competitive salary commensurate with experience 

  • Ownership interest in the company via Employee Stock Option Plan 

  • Hybrid Schedule with in-office work days on Monday, Wednesday and Thursday 

  • 15 vacation days along with 8 statutory company holidays, 5 days for winter break office closure, and 10 sick days 

  • Health benefits covered at 100% for employee or a significant contribution for family plans 

  • Vision and dental benefits with partial employee contribution

  • 12 weeks of paid parental leave 

  • Access to One Medical, Teledoc, HealthAdvocate, Kindbody, and Talkspace

  • Company 401k program 

  • Commuter benefits 

  • Life Insurance

  • Tech startup environment 

  • Weekly team meals and an office stocked with coffee and snacks 

  • Working on the world’s biggest issue with other passionate professionals 


We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

About the Company

First Street Logo

First Street

New York, NY, USA

51-100

Real estate, the bedrock of the American economy, is worth over 45 trillion dollars. Climate change has started to erode that foundation.

In an effort to keep up with the increasing risk of natural disasters, insurance companies are raising premiums at a rapid rate. In some places, properties are uninsurable.

When insurance premiums go up, property values go down.

It's not just insurance; critical infrastructure is failing nationwide because it was built without considering the impact of climate change.

Similar Jobs

Nuro Logo

Systems Engineer, IT

Systems Engineer, IT

  • Nuro
  • Mountain View, CA, US
  • Hybrid
  • Full time role

Efficient, electric robots delivering goods affordably and reducing emissions for a sustainable future.

17 days ago

Crusoe Logo

Cloud Support Engineer

Cloud Support Engineer

  • Crusoe
  • California, US, San Francisco, CA, US, Denver, CO, US, Colorado, US
  • Hybrid, Remote
  • Full time role

Transforming stranded energy into eco-friendly power for data centers, reducing environmental impact significantly.

17 days ago

Helion Logo

Systems Engineer, HPC

Systems Engineer, HPC

  • Helion
  • Everett, WA, US
  • In-person
  • Full time role

Revolutionizing energy with unlimited, clean fusion power.

15 days ago

Crusoe Logo

Staff/Senior Staff Software Engineer - Cloud Hypervisor R&D

Staff/Senior Staff Software Engineer - Cloud Hypervisor R&D

  • Crusoe
  • San Francisco, CA, US
  • Hybrid
  • Full time role

Transforming stranded energy into eco-friendly power for data centers, reducing environmental impact significantly.

10 days ago

Novisto Logo

Senior Cloud Developer, Platform Engineering

Senior Cloud Developer, Platform Engineering

  • Novisto
  • Montreal, QC, CA
  • Hybrid
  • Full time role

Empower your company's sustainability with smarter ESG data and reporting management.

9 days ago

AECOM Logo

Cloud Operations Specialist

Cloud Operations Specialist

  • AECOM
  • Germantown, MD, US
  • Hybrid
  • Full time role

Building sustainable legacies through innovative infrastructure and environmental solutions.

8 days ago

Pacific Fusion Logo

Cloud Infrastructure Engineer

Cloud Infrastructure Engineer

  • Pacific Fusion
  • San Leandro, CA, US
  • Hybrid
  • Full time role

Accelerating commercial fusion power for a sustainable, clean energy future.

4 days ago

Constellr Logo

Senior Cloud Engineer

Senior Cloud Engineer

  • Constellr
  • Hybrid, Remote
  • Full time role

Revolutionizing climate monitoring by tracking land surface temperatures from space.

2 days ago

Eurofins Logo

Programming/Scripting IT Specialist

Programming/Scripting IT Specialist

  • Eurofins
  • San Diego, CA, US
  • In-person
  • Full time role

Analytical testing services promoting safer, healthier, and more sustainable environments globally.

1 day ago

Omnidian Logo

Senior Platform Engineer

New

Senior Platform EngineerNew

  • Omnidian
  • Seattle, WA, US
  • Hybrid, Remote
  • Full time role

Empowering sustainable energy with 24/7 solar system monitoring and advanced diagnostics.

About 7 hours ago