Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance. Our data centers are optimized for AI workloads and are powered by clean, renewable energy.
Be part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure.
About This Role:
This role is critical to ensuring industry-leading reliability and uptime for our cloud platform, directly impacting our ability to deliver innovative solutions to our customers. You'll be involved in exciting projects, from supporting the burn-in/stress testing of new hardware to troubleshooting complex server issues and collaborating with vendors. The ideal candidate is a highly skilled and experienced technician with a deep understanding of server hardware, a passion for problem-solving, and a commitment to maintaining peak performance in a fast-paced environment. This is a full-time position.
What You’ll Be Working On:
Troubleshooting & Repair: Diagnose and resolve hardware failures in complex GPU-based servers (both air and liquid-cooled), ensuring minimal downtime.
Hardware Testing & Qualification: Collaborate with the Infrastructure Systems team to support burn-in/stress testing of new hardware and resolve any issues that arise. Support the qualification of new hardware.
Vendor Management: Open and manage support tickets with hardware vendors, serve as the datacenter liaison for vendor support personnel, and maintain a hardware issue tracker.
Inventory Management: Maintain an accurate spares inventory and replenish stock as needed to ensure quick repairs.
Deployment Support: Assist the Cloud Deployments team with racking and cabling servers, contributing to the efficient expansion of our infrastructure.
Documentation & Communication: Maintain detailed records of hardware issues and resolutions, and communicate effectively with internal teams and vendors.
Physical Demands: Work in a physically challenging environment (sound/vibration/thermal) and be able to lift 50 lbs.
On-Call Support: Provide occasional after-hours support to address critical issues.
What You’ll Bring to the Team:
Server Hardware Expertise: Possess significant experience diagnosing and repairing complex GPU-based servers (both air and liquid-cooled).
Technical Proficiency: Demonstrate a deep understanding of server hardware, BMC-based manageability, BIOS settings, and firmware deployment.
Datacenter Experience: Have four or more years of hands-on experience working in a datacenter environment.
Networking Knowledge: Familiarity with Infiniband switches and network topology.
Linux Skills: Basic Linux system administration expertise.
Problem-Solving Abilities: Excellent analytical and problem-solving skills to effectively troubleshoot hardware issues.
Communication Skills: Strong organizational, time management, and communication skills.
Education: Associates Degree or equivalent experience in an IT-related field.
Bonus Points:
Experience with other high-performance computing (HPC) technologies.
Relevant certifications (e.g., CompTIA Server+, CCNA).
Experience with scripting languages (e.g., Python, Bash).
Knowledge of datacenter infrastructure management (DCIM) tools.
Experience working in a fast-growing startup environment.
Familiarity with various cooling systems used in data centers.
Experience with liquid cooling systems.
Benefits:
Crusoe offers a comprehensive benefits package designed to support well-being and financial security. This includes full social security coverage, contributions to provident, trade union, and pension funds, with options for additional pensions. Employees also have optional access to Global Life Insurance and private health insurance. Crusoe provides generous leave policies, including maternity, paternity, parental, and sick leave, ensuring you have the support you need at every stage of life.
Compensation:
Compensation will be paid as salary or hourly. Compensation to be determined by the applicant’s education, experience, knowledge, skills, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.