Data Scientist

August 14, 2024
$179900 / year

Are you applying to the internship?

Description

About Deloitte

Deloitte is a global professional services network with a focus on audit, consulting, financial advisory, risk advisory, and tax services. They are known for their diverse, equitable, and inclusive culture that empowers employees to make an impact. Deloitte is committed to advancing sustainability, equity, and trust through their work.

Description: Data Scientist

Position Summary:

This position is within the GPS GSi group at Deloitte and involves developing and maintaining data science and business intelligence solutions. You will be responsible for creating and deploying machine learning, deep learning, and generative AI initiatives to help enhance decision-making processes for Enabling Area professionals. This role requires expertise in data science, the ability to work independently, and strong problem-solving skills.

Responsibilities:

Data Science Product Development: Participate in and/or lead the development of data science products that meet client needs, transforming those needs into quantifiable data science solutions.
Independent Problem-Solving: Independently carry out tasks, using critical thinking and problem-solving skills to devise effective solutions for unique challenges.
Model Design and Deployment: Design, train, and deploy machine learning and deep learning models to AWS, Databricks, and Dataiku platforms.
LLM Development and Consultation: Develop, design, and/or advise on Large Language Model (LLM) solutions for enterprise-wide documentation (e.g., Retrieval-Augmented Generation (RAG), Continued Pre-training (CPT), Supervised Fine-tuning (SFT)).
MLOps Implementation: Utilize Machine Learning Operations (MLOps) pipelines, including knowledge of containerization (Docker) and CI/CD for training and deploying models.
Code Development: Develop cleanly written code that is heavily commented and can be easily understood by others for code sharing and collaboration.
Project Documentation: Maintain structured documentation of project development stages, including the utilization of GitHub and/or Jira for version control and project management.
Communication and Expertise: Demonstrate strong communication skills with the ability to provide expertise and break down complex analytical solutions to explain to clients.
Industry Knowledge: Remain current with the latest industry trends and developments in data science and/or related fields, with the ability to learn new skills and knowledge to advance the skillset of the Data Science team.
Quality Assurance: Apply thorough attention to detail, and carefully review data science solutions for accuracy and quality.

Currently pursuings:

Required:

• Bachelor’s Degree in Statistics, Mathematics, Computer Science, Engineering or other analytical field
• 6+ years of experience with data science, including a deep knowledge and mastery of Python, machine learning & deep learning, and associated data science packages (e.g., sklearn, TensorFlow, PyTorch). Demonstrated experience leading data science projects from inception to deployment.
• Strong knowledge of LLMs and RAG.
• Familiarity with AWS, Databricks, and/or Dataiku platforms.
• Working knowledge of MLOps, including familiarity with containerization (e.g., Docker).
• Excellent troubleshooting skills and the ability to work independently.
• Strong organizational skills, including clear documentation of projects and the ability to write clean code.
• Familiarity with agile project methodology and/or project development lifecycle.
• Experience with GitHub for version control.
• Excellent communication and presentation skills, with the ability to explain complex data science concepts to non-technical audiences.
• Ability to complete work in an acceptable timeframe and manage a variety of detailed tasks and responsibilities simultaneously and with accuracy to meet deadlines, goals, and objectives and satisfy internal and external customer needs related to the job.
• Must be legally authorized to work in the United States without the need for employer sponsorship, now or at any time in the future.

Desired:

• Master’s Degree in Statistics, Mathematics, Computer Science, Engineering or other analytical field, OR comparable direct work experience.
• Significant experience with MLOps and associated serving frameworks (i.e., Flask, FastAPI, etc.) and orchestration pipelines (e.g., SageMaker Pipelines, Step Functions, Metaflow, etc.).
• Significant experience working with open source LLMs (e.g., serving via TGI / vLLM, performing CPT and/or SFT, etc.).
• Experience using various AWS Services (e.g., Textract, Transcribe, Lambda, etc.).
• Proficiency in basic front-end web development (e.g., Streamlit).
• Knowledge of Object-Oriented Programming (OOP) concepts.

Compensation:

• A reasonable estimate of the current range is $97,600.00-$179,900.00.
• You may also be eligible to participate in a discretionary annual incentive program.

Note:

This is a detailed overview of the Data Scientist position at Deloitte. It includes information about the company, the job responsibilities, and the qualifications needed for the position.