Crusoe is on a mission
to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to power the world’s most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster.
We’re in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We’re solving that — with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI.
We’re looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved — people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services.
If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe.
About This Role:
Crusoe is seeking a visionary Senior Staff Engineer to join our Model LifeCycle team, where you will architect the backbone of our managed AI application platform. In this high-impact role, you will lead the development of a comprehensive ecosystem for the entire model development lifecycle, specifically optimized for Large Language Models (LLMs) and advanced Machine Learning workflows. By building these core systems from first principles, you will empower developers to harness Crusoe’s sustainable high-performance computing power to build the next generation of AI-driven applications.
As a technical leader, you will experience significant 0 → 1 ownership, designing and implementing mission-critical abstractions and APIs that define how models are trained, managed, and deployed at scale. This is a full-time position for a foundational engineer who is passionate about blending deep AI expertise with robust systems engineering to solve the industry’s most challenging infrastructure hurdles.
What You’ll Be Working On:
- Model Fine-Tuning Orchestration: Design and manage sophisticated fine-tuning systems for large foundation models, incorporating SFT, PEFT, LoRA, and adapters while ensuring multi-node orchestration, checkpointing, and cost-efficient scaling.
- End-to-End Training Pipelines: Implement and maintain robust training rimes for LLMs, including distillation and reinforcement learning pipelines such as preference optimization (PPO/DPO) and reward modeling.
- Agent & Execution Infrastructure: Build and scale the underlying infrastructure required for reliable agent execution and complex model-driven workflows.
- Lifecycle Management: Develop comprehensive systems for dataset, model, and experiment management, ensuring rigorous versioning, lineage, and reproducible fine-tuning at an enterprise scale.
- Strategic Architectural Leadership: Influence long-term decisions regarding training runtimes, scheduling, and storage, shaping the core abstractions that will define Crusoe’s platform.
- Cross-Functional Collaboration: Partner closely with product, business, and platform teams to translate complex technical requirements into intuitive, high-performance system APIs.
- Ecosystem Engagement: Actively contribute to and engage with the open-source LLM community to ensure Crusoe remains at the forefront of AI infrastructure innovation.
What You’ll Bring to the Team:
- Advanced Technical Foundation: An advanced degree (Masters or PhD) in Computer Science, Engineering, or a related technical field.
- Deep Industry Experience: 8–12+ years of professional experience driving high-impact engineering projects, with a significant portion dedicated specifically to the AI/ML space.
- Cloud Infrastructure Expertise: Expert-level proficiency in leveraging cloud-based services, including elastic compute, object storage, virtual networking, and managed databases to build scalable systems.
- Generative AI Mastery: Deep technical experience in Generative AI, specifically focusing on the infrastructure requirements for LLM training and large-scale inference.
- Rapid Project Delivery: A proven track record of architecting and delivering 0 → 1 projects under tight deadlines while maintaining high engineering standards.
- Collaborative Leadership: Strong interpersonal skills with a proactive approach to autonomy, mentorship, and cross-functional problem-solving.
Bonus Points:
- Production Language Proficiency: Advanced skills in Golang or Python specifically for building large-scale, production-ready services.
- Open-Source Contributions: Active contributions to prominent AI projects such as vLLM, DeepSpeed, or similar high-performance frameworks.
- Hardware Optimization: Experience with GPU performance tuning, CUDA kernels, or specialized inference framework optimizations.
- Framework Expertise: Deep hands-on experience with PyTorch and specialized libraries for LLM training and fine-tuning.
- Aspirational Mindset: A visible passion for solving “impossible” technical problems and a desire to build cutting-edge products that redefine the AI landscape.
Benefits:
- Competitive compensation
- Restricted Stock Units
- Paid time off & paid holidays
- Comprehensive health, dental & vision insurance
- Employer contributions to HSA account
- Paid parental leave
- Paid life insurance, short-term and long-term disability
- Professional development & tuition reimbursement
- Mental health & wellness support
- Commuter benefits (parking & transit)
- Cell phone stipend
- 401(k) Retirement plan with company match up to 4% of salary
- Volunteer time off
Compensation Range
Compensation will be paid in the range of up to $237,600 – $288,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.