Senior Software Engineer, AI Platform

April 2, 2026

Are you applying to the internship?

Job Description

Senior Software Engineer, AI Platform

Are you a passionate and innovative Senior Software Engineer with a deep interest in artificial intelligence and scalable platforms? We are seeking a highly motivated individual to join our dynamic AI Platform team, where you will play a pivotal role in designing, developing, and maintaining the core infrastructure that powers our cutting-edge AI products and services. This is an incredible opportunity to contribute to a platform that impacts millions of users, working with state-of-the-art technologies and collaborating with brilliant minds.

About Our Team

Our AI Platform team is at the heart of innovation, building robust, high-performance, and scalable systems that enable our data scientists and machine learning engineers to rapidly develop, deploy, and monitor AI models. We foster a culture of technical excellence, continuous learning, and collaborative problem-solving, all while pushing the boundaries of what’s possible with AI.

The Opportunity

As a Senior Software Engineer on the AI Platform team, you will be instrumental in evolving our AI infrastructure. You will work on critical components such as distributed training systems, model serving engines, feature stores, data pipelines, and MLOps tools. Your expertise will directly influence the speed, reliability, and efficiency with which our AI models are brought to production and deliver value to our customers. This role demands a blend of strong software engineering fundamentals, distributed systems knowledge, and a keen understanding of the machine learning lifecycle.

Key Responsibilities

  • Design and Development: Lead the design, architecture, and implementation of scalable, reliable, and high-performance software for our core AI platform components, including distributed training frameworks, inference engines, and data management systems.
  • System Optimization: Identify and resolve performance bottlenecks, ensuring the efficient operation of the AI platform across various environments and workloads.
  • Collaboration: Work closely with data scientists, machine learning engineers, and other engineering teams to understand their needs, gather requirements, and deliver solutions that accelerate their workflows.
  • Code Quality and Best Practices: Champion best practices in software development, including code reviews, testing, documentation, and continuous integration/continuous deployment (CI/CD).
  • Technical Leadership: Mentor junior engineers, contribute to technical strategy, and drive the adoption of new technologies and methodologies within the team.
  • Troubleshooting and Support: Provide expert-level support for platform issues, debugging complex problems in production environments.
  • Innovation: Research and evaluate new technologies and tools to continuously improve our AI platform capabilities and maintain our competitive edge.

What You Bring

Required Skills & Experience:

  • Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
  • Experience: 5+ years of professional experience in software development, with a significant focus on building scalable backend systems or infrastructure.
  • Programming Proficiency: Expert-level proficiency in at least one modern programming language (e.g., Python, Java, Go, Scala). Python is highly preferred given its prevalence in AI/ML.
  • Distributed Systems: Strong understanding and practical experience with distributed systems concepts, microservices architectures, and cloud-native development (e.g., Kubernetes, Docker).
  • Cloud Platforms: Hands-on experience with major cloud providers (e.g., AWS, GCP, Azure) and their relevant services.
  • Data Technologies: Experience with large-scale data processing technologies (e.g., Spark, Flink, Kafka) and various database systems (SQL and NoSQL).
  • Problem Solving: Excellent analytical and problem-solving skills, with the ability to tackle complex technical challenges independently and collaboratively.
  • Communication: Strong verbal and written communication skills, capable of articulating complex technical concepts to both technical and non-technical audiences.

Preferred Skills & Experience:

  • MLOps Knowledge: Familiarity with MLOps principles and tools (e.g., MLflow, Kubeflow, Sagemaker, TFX).
  • Machine Learning Fundamentals: Basic understanding of machine learning concepts, algorithms, and common frameworks (e.g., TensorFlow, PyTorch).
  • Performance Engineering: Experience with performance tuning and optimization of large-scale systems.
  • Open Source Contributions: Prior contributions to open-source projects, especially in the AI/ML or infrastructure domain.

Why Join Our Team?

  • Impactful Work: Contribute to a platform that is critical to the success of cutting-edge AI products, directly influencing user experience and business outcomes.
  • Innovation: Work at the forefront of AI and distributed systems technology, with opportunities to research and implement novel solutions.
  • Growth Opportunities: Continuous learning environment with opportunities for professional development, mentorship, and career advancement.
  • Collaborative Culture: Join a supportive and inclusive team that values knowledge sharing, creativity, and mutual respect.
  • Comprehensive Benefits: Enjoy a competitive salary, generous health benefits, paid time off, and other perks designed to support your well-being.

Ready to Apply?

If you are excited by the prospect of building the future of AI infrastructure and possess the skills and passion we are looking for, we encourage you to apply! Please submit your resume and a cover letter outlining your relevant experience and why you believe you’d be a great fit for our team. We look forward to reviewing your application!