Are you applying to the internship?
Job Description
About Safari AI
The Company’s vision is to Automate Action of the leading companies in the physical economy, from Entertainment, QSR’s, Retail and beyond. It’s using computer vision AI to Measure, Alert and provide AI-generated Recommendations at the operations of leading companies such as Merlin/Legoland, 7-11, Tanger Outlets, Manhattan Mini Storage, Charlotte Hornets, Calgary Flames, and more. In the near future, Safari AI will use this data and to suggest how its clients can optimize its SOPs to generate more revenue and create more valuable guest and staff experiences.
Safari AI is seed-funded by leading venture capital investors and expects to raise its Series A in 2025.
Leadership & Culture: Safari AI is co-founded by Ali Vahabzadeh & Kaiwen Yuan, two leaders who have meaningful exits under their belts and have managed large, high-performing Go To Market and Engineering teams. The company is headquartered in NYC where the GTM team is based and works from the office five days a week, while Engineering is distributed between North America and Brazil.
About The Role
We are seeking computer vision interns to join our team for 6-12 months, with the opportunity to become a full-time team member afterwards pending performance. Interns will work closely with our engineering team on deploying new locations and features for our clients. Interns will have an opportunity to work with cutting-edge AI technology and to be part of a dynamic and innovative team.
What You Will Be Working On Weekly Basis
• Implement computer vision solutions in the areas of object detection and tracking, scene segmentation, scene understanding, and depth estimation
• Build solutions that are robust to camera motion, occlusions, poorly exposed scenes
• Work with a growing team of infrastructure and machine learning engineers of various levels
• Define labeling ontologies and create training, validation, and testing sets across customer sites and weather conditions
• Onboard customers with machine learning solutions and calibrate models for satisfactory accuracy
Requirement for the Role
• Only students who can commit 8 months (2025/08-2026/04) will be screened
• Senior undergraduate or graduate school students in computer science or relevant field with exposure to classic and modern computer vision and machine learning techniques
• Excellent written and verbal communication skills
• Self-motivated, critical thinking and enthusiastic in solving real-world problems
• Python experience/expertise training, evaluating, and deploying models with one of more common deep learning framework such as PyTorch or Tensorflow
• Sufficient familiarity with traditional computer vision and machine learning techniques with good mathematics foundations and geometry knowledge
• Deep insight and experience on modern computer vision and machine learning techniques (e.g., object detection, multi-object tracking)
• Familiar with popular computer vision solution related frameworks, such as opencv, gstreamer, ffmpeg, deepstream, tensorrt, etc..
• Experience with Linux environment and targeting embedded deployment
• Experience with public cloud such as GCP, Azure or AWS
• Ability to design, implement, present, and operate independently without oversight
• Good business insight and exceptional analytical skills
Nice to Have’s
• Experiences with data streaming frameworks, such as flink, spark, beam, etc.
• Experiences in startup environment
• Experiences with embedded platforms such as NVIDIA Jetson, Intel Movidus, etc.
• Fluent in Java or Typescript
• Publication records on top ML/CV conferences / journals such as CVPR, NeurIPS and ICCV