Are you applying to the internship?
Job Description
Site Reliability Engineer Intern (Compute Platform) – 2026 Summer (BS/MS) | TikTok
The Tone:
This is a 12-week internship at TikTok, with flexible start dates available in Summer 2026. TikTok is a leading destination for short-form mobile video, dedicated to inspiring creativity and bringing joy to its global audience. This role is essential for ensuring the reliability of the company’s critical Big Data services and products, helping to shape the future of a newly established Compute Platform SRE team. Interns will gain industry exposure and hands-on experience, utilizing their knowledge in real-world scenarios to build a strong foundation for professional growth.
The TL;DR
• Role: Internship
• Type: Temporary (12 weeks)
• Location: Not specified (12-week Summer 2026 internship)
• Pay: $42.75 hourly
• Team: Compute Platform SRE team, supporting Big Data services and products across the company.
• Mission: Ensure the reliability of TikTok’s major data warehouse products, services, and query engines.
• Tech Stack: ClickHouse, Spark, Presto, Doris, Hadoop, Kubernetes, Python, Shell, Java, Go
What You’ll Actually Do
• Reliability: Ensure the reliability of TikTok’s major data warehouse products, services, and query engines, including ClickHouse, Spark, Presto, and Doris.
• SLA Compliance: Uphold Service Level Agreements (SLAs) for ByteDance’s Data Platform services and promptly respond to any system outages or issues.
• Performance Optimization: Analyze service performance and reliability patterns to identify bottlenecks, implement proactive measures, and work with development teams to optimize application performance.
• Incident Management: Lead efforts to troubleshoot and resolve service incidents and postmortems, coordinating with cross-functional teams to mitigate service-impacting events.
• Infrastructure Automation: Automate infrastructure provisioning, scaling, and management processes to reduce manual interventions and improve overall service quality.
The Must-Haves
• Background: Currently pursuing an Undergraduate or Master’s degree in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
• Experience: Familiarity with open-source or commercial technologies such as ClickHouse, Hadoop, Doris, Spark, Presto, and Kubernetes.
• Skills: In-depth understanding of Linux, computer networking, and databases; proficient in common SRE/DevOps open-source toolsets, system monitoring tools, and container orchestration platforms like Kubernetes; strong coding skills in at least one scripting or programming language (e.g., Python, Shell, Java, Go).
• Bonus: Excellent problem-solving skills and the ability to think critically under pressure; a strong customer-first mindset with a sense of ownership and collaborative ability; graduating December 2026 onwards with the intent to return to a degree program after the completion of the internship.