Senior Software Engineer, Managed AI
Company: Crusoe Energy Systems LLC
Location: San Francisco
Posted on: April 14, 2025
Job Description:
Crusoe is building the World's Favorite AI-first Cloud
infrastructure company. We're pioneering vertically integrated,
purpose-built AI infrastructure solutions trusted by Fortune 500
companies to power their most advanced AI applications. Crusoe is
redefining AI cloud infrastructure, with a mission to align the
future of computing with the future of the climate. Our AI platform
is recognized as the "gold standard" for reliability and
performance. Our data centers are optimized for AI workloads and
are powered by clean, renewable energy.Be part of the AI revolution
with sustainable technology at Crusoe. Here, you'll drive
meaningful innovation, make a tangible impact, and join a team
that's setting the pace for responsible, transformative cloud
infrastructure.About This Role:The Crusoe Cloud Managed AI team
seeks an ambitious and experienced Senior Software Engineer to join
their team. You'll have a pivotal role in shaping the architecture
and scalability of our next-generation AI inference platform. You
will lead the design and implementation of core systems for our AI
services, including resilient fault-tolerant queues, model
catalogs, and scheduling mechanisms optimized for cost and
performance. This role gives you the opportunity to build and scale
infrastructure capable of handling millions of API requests per
second across thousands of customers.From day one, you'll own
critical subsystems for managed AI inference, helping to serve
large language models (LLMs) to a global audience. As part of a
dynamic, fast-growing team, you'll collaborate cross-functionally,
influence the long-term vision of the platform, and contribute to
cutting-edge AI technologies. This is a unique opportunity to build
a high-performance AI product that will be central to Crusoe's
business growth.What You'll Be Working On:
- Design and develop core infrastructure for the Managed AI
platform:
- Lead the design and implementation of resilient fault-tolerant
queues, model catalogs, and scheduling mechanisms.
- Build and optimize systems for high-volume AI inference,
handling millions of API requests per second.
- Develop and maintain core services for AI model deployment and
management.
- Collaborate with cross-functional teams:
- Work closely with product managers, data scientists, and other
engineers to define and deliver AI services.
- Integrate with other Crusoe Cloud services to provide a
seamless user experience.
- Collaborate with research and development teams to explore and
implement new AI technologies.
- Ensure high availability and performance:
- Design and implement systems with high availability, low
latency, and fault tolerance.
- Optimize performance across all stages of the AI inference
pipeline, including model loading, execution, and response
handling.
- Continuously monitor and improve the performance and
reliability of AI services.What You'll Bring to the Team:
- Strong Engineering Foundations:
- Advanced degree in Computer Science, Engineering, or a related
field.
- Proven experience in distributed systems design and
implementation.
- Expertise in using cloud-based services, such as elastic
compute, object storage, virtual private networks, managed
databases, etc.
- Experience with container runtimes (e.g., Kubernetes) and
microservices architectures.
- Experience using REST APIs and common communication protocols,
such as gRPC.
- Demonstrated experience in the software development cycle and
familiarity with CI/CD tools.
- AI/ML Expertise:
- Experience in Generative AI (Large Language Models,
Multimodal).
- Technical Skills:
- Proficiency in Golang or Python for large-scale,
production-level services. (Preferred)
- Familiarity with AI infrastructure, including training,
inference, and ETL pipelines. (Preferred)
- Contributions to open-source AI projects such as VLLM or
similar frameworks. (Preferred)
- Performance optimizations on GPU systems and inference
frameworks. (Preferred)
- Soft Skills:
- Proven track record of delivering early-stage projects under
tight deadlines.
- Strong communication and collaboration skills.
- Proactive and results-oriented with the ability to work
independently and as part of a team.
- Passion for building high-quality, scalable, and impactful
products.Bonus Points:
- Experience with AI/ML frameworks such as TensorFlow, PyTorch,
or Hugging Face Transformers.
- Knowledge of machine learning algorithms and data
structures.
- Experience with data streaming and real-time processing
technologies.
- Contributions to open-source projects outside of AI.Benefits:
- Hybrid work schedule
- Industry competitive pay
- Restricted Stock Units in a fast growing, well-funded
technology company
- Health insurance package options that include HDHP and PPO,
vision, and dental for you and your dependents
- Employer contributions to HSA accounts
- Paid Parental Leave
- Paid life insurance, short-term and long-term disability
- Teladoc
- 401(k) with a 100% match up to 4% of salary
- Generous paid time off and holiday schedule
- Cell phone reimbursement
- Tuition reimbursement
- Subscription to the Calm app
- MetLife Legal
- Company paid commuter benefit; $50 per pay
periodCompensation:Compensation will be paid in the range of
$183,000 - $210,000 base salary. Restricted Stock Units are
included in all offers. Compensation to be determined by the
applicants knowledge, education, and abilities, as well as internal
equity and alignment with market data.Crusoe is an Equal
Opportunity Employer. Employment decisions are made without regard
to race, color, religion, disability, genetic information,
pregnancy, citizenship, marital status, sex/gender, sexual
preference/ orientation, gender identity, age, veteran status,
national origin, or any other status protected by law or
regulation.
#J-18808-Ljbffr
Keywords: Crusoe Energy Systems LLC, San Francisco , Senior Software Engineer, Managed AI, IT / Software / Systems , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...