Research Intern – AI Interpretability

Engineering

San Francisco, CA

June 28, 2026

Internship Full Time

Apply Now

Are you applying to the internship?

Job Description

Research Intern: Interpretability & Reliability (Summer 2027) | CTGT

The Tone:
This is a full-time internship at CTGT, located in San Francisco. CTGT builds a deterministic governance layer for AI, enabling important global institutions to deploy AI workflows with confidence. This role is crucial because frontier AI models, while often right, are occasionally confidently wrong in ways that make them useless for high-stakes environments. This intern will help close that critical gap by owning a hard research problem to make systems provably reliable.

The TL;DR
• Role: Internship
• Type: Full-time
• Location: In-person, San Francisco, CA

• Team: Research function, sitting directly with engineers building the Policy Engine.
• Mission: Own one hard research problem inside the interpretability and reliability program from end to end.
• Tech Stack: Python, Rust, Node/TypeScript, React, PostgreSQL, vector, graph databases, Docker, Kubernetes, Terraform, PyTorch.

What You’ll Actually Do
• Implement: Stress-test methods for feature extraction and runtime intervention, ensuring they work repeatably across model families.
• Design: Create evaluations that bound error, focusing on calibration under imbalanced data and verifiable task regimes.
• Research: Read relevant literature, determine what matters, reproduce findings, and push past existing theories.
• Collaborate: Work with engineering to transform research findings into capabilities for the Policy Engine, shipping into audited, high-stakes environments.
• Present: Defend your reasoning and present progress every week in research reviews.

The Must-Haves
• Background: Pursuing a degree (Bachelor’s through PhD) in computer science, mathematics, the sciences, or a similarly unforgiving quantitative field; Student level.
• Experience: Experience writing real code for real computational systems; demonstrated fluency with PyTorch and the modern ML stack, or a track record proving quick learning ability.
• Skills: Strong mathematical foundations including linear algebra, probability, optimization, and information theory; ability to read a paper, discern relevant information, and implement findings; self-directed with the capacity to make real progress independently.
• Bonus: Drawn to interpretability, model internals, and developing systems that are provably reliable rather than just “usually fine.”

Date Posted

June 28, 2026
Location

San Francisco, CA
Expiration date

--
Gender

Neutral
Qualification

Bachelor Degree
Career Level

Student

AI Resume Builder

LinkedIn Optimizer

AI Cover Letter Trending

AI Mock Interview Trending

EzApply Chrome Extension

AI Pitch Generator New

Research Intern – AI Interpretability

Are you applying to the internship?

Job Description

Related Jobs

PhD Research Intern – Low-Power Sensing & ML

PhD Intern – GPU Infrastructure Management

Structural Engineering Intern

PhD Intern – LLM Security Research

Products

For Candidates

Company

Welcome to Internexxus

Reset Password

Welcome to Internexxus

Research Intern – AI Interpretability

Are you applying to the internship?

Job Description

Share this post

Related Jobs

Login to Internexxus

Reset Password

Create a free Internexxus account

Products

For Candidates

Company