At Mistral AI, we are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world. Our mission is to make AI ubiquitous and open
We are creative, low-ego, team-spirited, and have been passionate about AI for years
We hire people who thrive in competitive environments, because they find them more fun to work in
We hire passionate women and men from all over the world
Our teams are distributed between France, UK and USA Mistral AI is hiring an expert in the role of pre-training and fine-tuning large language models
Key Responsibilities
Modifying pre-trained large language models to make them able to interact with humans
Equipping large language models with the ability of calling external tools
Aligning large language models based on feedback obtained during their deployment, or going through an ad-hoc annotation process
Designing ad-hoc annotation processes themselves
Participating to the pre-training effort.
Qualifications & Profile
High scientific understanding of the field of generative AI. This means a broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications
High technical engineering competence. This means being able to design complex software and make them usable in production
Be able to navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage
Occasionally be able to do front-end development, and have to use complex HPC infrastructure with full autonomy
Hands-on experience with AI frameworks and tools (e.g., TensorFlow, PyTorch, Jax)
Have experience working with large distributed systems