Are you applying to the internship?
Job Description
About the Role
This role is for a Machine Learning Intern at Cohere, a company committed to scaling intelligence to serve humanity by training and deploying frontier models for AI developers and enterprises. The intern will ship state-of-the-art models to production, design and implement novel research ideas, and build elegant training/deployment pipelines, joining at a pivotal moment to shape what the company builds.
What You’ll Do
• Design, train and improve upon cutting-edge models.
• Help us develop new techniques to train and serve models safer, better, and faster.
• Train extremely large-scale models on massive datasets.
• Explore continual and active learning strategies for streaming data.
• Learn from experienced senior machine learning technical staff.
• Work closely with product teams to develop solutions.
You’re a Good Fit If You
• Are a student currently enrolled in a post-secondary program, available for a full-time 3-6 month internship, co-op, or research work term.
• Have proficiency in Python and related ML frameworks such as Tensorflow, TF-Serving, JAX, and XLA/MLIR.
• Have experience using large-scale distributed training strategies.
• Have familiarity with autoregressive sequence models, such as Transformers.
• Have strong communication and problem-solving skills.
• Have a demonstrated passion for applied NLP models and products.
Bonus Qualifications
• Experience writing kernels for GPUs using CUDA.
• Experience training on TPUs.
• Papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).