Are you applying to the internship?
Job Description
About the Role
The Speech team’s mission is to empower content interaction and creation using speech & audio related technologies, focusing on cutting-edge R&D in areas like speech & audio, music processing, and multimodal deep learning. This role involves improving the performance and efficiency of large-scale AI models across training, inference, and deployment, providing hands-on experience in high-performance ML systems to enhance user experience across ByteDance products.
What You’ll Do
• Support research and engineering efforts to optimize deep learning models for speed, memory, and scalability.
• Contribute to benchmarking and profiling tools to identify performance bottlenecks.
• Collaborate with engineers to integrate optimized models into production pipelines.
You’re a Good Fit If You
• Currently pursuing a Bachelor’s or Master’s degree in Computer Science or a related field.
• Must be able to commit to a 12-week full-time work period during Fall 2026
• Strong programming skills in Python and familiarity with deep learning frameworks (PyTorch, TensorFlow, JAX).
• Knowledge of fundamental ML concepts and algorithms.
• Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.
Bonus Qualifications
• Experience with AI model training/inference/deployment is a plus.
• Familiarity with GPU programming (CUDA, Triton, or similar) is a plus.
• Strong problem-solving skills and eagerness to learn.
Role Highlights & Compensation
• The hourly rate range for this position in the selected city is $42.75- $42.75.
• Interns have day one access to health insurance, life insurance, wellbeing benefits and more.
• Interns also receive 10 paid holidays per year and paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year).
• Interns who are not working 100% remote may also be eligible for housing allowance.