Machine Learning Engineer

November 21, 2024

Are you applying to the internship?

Job Description

About the Company:

Cisco ThousandEyes is a Digital Experience Assurance platform that helps organizations ensure flawless digital experiences across all networks, even those they don’t directly manage. Leveraging AI and a comprehensive collection of data from cloud, internet, and enterprise networks, ThousandEyes allows IT teams to proactively identify, diagnose, and resolve issues before they impact end-users. The platform is deeply integrated with the broader Cisco technology portfolio, enabling large-scale deployments and providing AI-driven insights across networking, security, collaboration, and observability solutions.

About the Job: Machine Learning Engineer for the Alerts Team

This role places you at the forefront of AI/ML technology and real-time data processing. As a Machine Learning Engineer, you will focus on developing and refining anomaly detection algorithms for a highly scalable stream processing platform. This position combines the challenges of managing massive datasets with the innovation of applied machine learning to deliver valuable insights to customers.

Responsibilities:

Collaborate with a team of engineers to design, implement, and maintain large-scale AI/ML pipelines for real-time anomaly detection.
Develop and tune machine learning models, including Deep Learning models, Machine Learning (AI/ML) Models, and Large Language Models, to detect anomalies across billions of events.
Design and implement sophisticated anomaly detection algorithms (e.g., Isolation Forests, LSTM-based models, Variational Autoencoders) tailored to the unique characteristics of the data streams.
Create robust evaluation frameworks and metrics to rigorously assess the performance of these algorithms.
Implement and optimize stream processing solutions using technologies like Flink and Kafka.
Work with exceptionally diverse and large-scale data, pushing the boundaries of real-time anomaly detection.

Qualifications:

3-5 years of software development experience and a minimum of 2 internships with direct experience in building and evaluating ML models and delivering large-scale ML products.
MS or PhD in a relevant field (e.g., Computer Science, Machine Learning, Data Science).
Proficiency in crafting machine learning models, including neural networks (transformer models, Large Language Models), decision trees, and other traditional machine learning techniques. Ability to translate concepts into working solutions.
Fluency in machine learning frameworks such as SKLearn, XGBoost, PyTorch, or TensorFlow. Ability to utilize code strategically for innovative solutions.
Proficiency in Python and ability to translate abstract machine learning concepts into robust, efficient, and scalable code.
Strong Computer Science fundamentals and object-oriented design skills.
Experience building large-scale data processing systems.
Experience working in a fast-paced development environment.
Strong team collaboration and communication skills.