GPU programming Expert (San Francisco)
Company: Mistral AI
Location: Palo Alto
Posted on: October 16, 2024
Job Description:
Mistral AI is hiring an expert in the role of serving and
training large language models at high speed on GPUs. The role is
based in San Francisco.The role will involve:
- Writing low-level code to take full advantage of high-end GPUs
(H100) and maximize their capacity.
- Rethinking various parts of the generative model architecture
to make them more suitable for efficient inference.
- Integrating low-level efficient code into a high-level MLOps
framework.The successful candidate will have:
- High technical competence in writing custom CUDA kernels and
pushing GPUs to their limits, along with high expertise in the
distributed computation infrastructure of current generation GPU
clusters.
- Overall understanding of the field of generative AI, with
knowledge or interest in fine-tuning and using language models for
applications.About Mistral:
- At Mistral AI, we are a tight-knit, nimble team dedicated to
bringing our cutting-edge AI technology to the world.
- Our mission is to make AI ubiquitous and open.
- Developers are using our API via la Plateforme to build
incredible AI-first applications powered by our models that can
understand and generate natural language text and code.
- We are multilingual at our core. We released le Chat as a
demonstrator of our models.
- We are creative, low-ego, team-spirited, and have been
passionate about AI for years.
- We hire people who thrive in competitive environments because
they find them more fun to work in.
- We hire passionate women and men from all over the world.
- Our teams are distributed between France, the UK, and the
USA.
#J-18808-Ljbffr
Keywords: Mistral AI, San Francisco , GPU programming Expert (San Francisco), Other , Palo Alto, California
Didn't find what you're looking for? Search again!
Loading more jobs...