Are you applying to the internship?
Job Description
Research Scientist Intern (Compute Platform – Database Acceleration) | TikTok
The Tone:
This is an internship at TikTok, with a focus on deep technical research and development within the company’s Data Platform. TikTok is a leading global destination for short-form mobile video, inspiring creativity and bringing joy through its innovative product. This role is crucial for advancing TikTok’s core data infrastructure, specifically within the Compute Platform’s DB Boost team, which is dedicated to building next-generation database kernels for cloud-native data warehouses. Interns will contribute directly to products and research, influencing future plans and emerging technologies that underpin TikTok’s data-driven decision-making processes.
The TL;DR
• Role: Internship
• Location: In-person, Los Angeles, CA
• Pay: $36–$45 hourly
• Team: Data Platform – Compute Platform – DB Boost team
• Mission: To design and develop core features for cloud-native distributed analytical database systems, aiming for best-in-class query latency and hardware utilization, supporting data-driven decisions at TikTok.
• Tech Stack: C++, Rust, Velox, DataFusion, DuckDB, Polars, Clickhouse, Spark, Presto
What You’ll Actually Do
• Design: Design and develop kernel modules or core features for cloud-native distributed analytical database systems.
• Research: Conduct research for cutting-edge technology evolvement in database and data management areas and identify potential future directions.
• Problem Solving: Solve unique and complex problems on large-scale, high-performance cloud infrastructure.
• Collaboration: Partner with different teams on building backend support for cloud native database service.
The Must-Haves
• Background: Candidates must currently be pursuing or have recently obtained a PhD in Computer Science, a related STEM field, or a quantitative discipline. This role is suitable for final year PhD students or recent graduates with a foundational background in Software Development, Computer Science, Computer Engineering, or a related technical discipline.
• Experience: Required experience includes proficiency with at least one system programming language, such as C++ or Rust. Preferred experience includes deep working or research background in OLAP databases and data management systems. This encompasses areas like database kernel development, query processing and optimization, storage system design, hardware acceleration techniques, and cloud-native infrastructure.
• Skills: Core technical skills include system-level programming (C++, Rust), a strong understanding of database internals, distributed systems, and performance optimization for large-scale data platforms. The ability to conduct independent research and apply findings to practical system development is also key.
• Bonus: Ideal candidates are graduating December 2026 or later, with the intention of returning to their degree program after the internship. Experience with open-source and industry database and data management systems like Velox, DataFusion, DuckDB, Polars, Clickhouse, Spark, or Presto is a plus. A passion for contributing to open-source communities, combined with high levels of creativity and quick problem-solving abilities, is highly valued.