Roche

Senior Data Scientist

Hyderabad Full time

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections,  where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.

The Position

Position title: Senior Data Scientist
Reporting to: Senior Manager – Data Science
Job Location: Hyderabad / Chennai

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. 

That’s what makes us Roche.

Roche has established the Global Analytics and Technology Center of Excellence (GATE) to drive analytics- and technology-driven solutions by partnering with Roche affiliates across the globe. GATE enables data-led decision-making and innovation across healthcare and biotech operations.

As a Senior Data Scientist, you will lead and deliver high-impact data science projects that enable strategic decision-making within Roche’s healthcare and commercial ecosystem. Collaborating closely with cross-functional teams, your primary focus will be on Healthcare and Pharma Data Science and Marketing Mix Modeling (MMM) leveraging advanced statistical and machine-learning techniques to generate actionable insights and optimize business outcomes.

Exposure to GenAI, LLM, NLP, and ML Ops will be considered a strong advantage, supporting Roche’s future-ready analytics and AI roadmap.

Your Opportunity:

Project Leadership and Collaboration (Primary Focus):

  • Lead and manage end-to-end data science projects from problem definition to model deployment ensuring alignment with business goals and timelines

  • Partner with cross-functional teams to define business requirements and deliver tailored analytics solutions

  • Develop and maintain key stakeholder relationships to ensure effective communication and collaboration throughout project lifecycles

  • Present analytical findings and strategic recommendations to business stakeholders, influencing data-driven decision making across multiple levels of the organization

Advanced Analytics (Primary Focus): 

  • Collect, clean, and prepare large and complex healthcare-related datasets (product performance, patient data, operational metrics, etc.) for analysis

  • Develop and implement statistical and machine learning models (e.g., multivariate regression, time-series analysis, XGBoost, clustering, classification, causal inference) to address complex business problems and uncover meaningful insights

  • Utilize advanced data analytics techniques to explore and identify patterns, trends, and root causes, applying methodologies such as clustering, classification, and causal inference

Marketing and Experimental Analytics (Primary Focus):

  • Build Econometric/market mix models (MMM), multi-touch attribution models (MTA), optimize marketing spend and come-up with implementable recommendations and strategy/plan

  • Lead the development and implementation of advanced Media Mix Models to inform and optimize marketing spend across multiple channels (e.g., TV, digital, print, radio)

  • Design and execute complex statistical analyses to evaluate the effectiveness of marketing strategies and optimize resource allocation

  • Apply experimental design and A/B testing methodologies to validate and measure marketing and operational initiatives

GenAI, NLP, and Machine Learning Operations (Secondary / Emerging Focus): 

  • Develop and implement GenAI models and tools to solve business problems

  • Deploy machine learning models in production environments, ensuring robust ML Ops practices for model monitoring, maintenance, and scaling

  • Collaborate with IT and DevOps teams to streamline the integration of ML models into existing systems and workflows

  • Enhance agent performance through experimentation with LLMs, prompt tuning, and advanced reasoning workflows

Communication, Mentorship, and Governance (Primary Focus):

  • Translate complex data insights into clear and actionable business strategies that address stakeholder needs and expectations

  • Mentor and guide junior data scientists, providing technical expertise and fostering an environment of continuous learning and improvement

  • Promote best practices in coding, data handling, and project management within the data science team, ensuring high-quality deliverables

  • Ensure adherence to Roche’s ethical AI standards and data privacy regulations

Who you are:

  • You hold a bachelor's degree in Technology or a relevant discipline, with a preference for Computer Science, Software, Statistics, Data Science, AI, Machine Learning, Data Engineering and related fields. Preferably, you have a Master's degree

  • Certifications in AI/ML, Data Science, or related technologies would be a plus

  • You have 5-8 years of hands-on experience in data science, with proven experience in leading data science projects within the pharma/biotech/healthcare domain

  • Strong proficiency in Python and SQL, with experience in data wrangling, feature engineering, and analytical model development

  • Experience working in cloud-based environments (AWS preferred), with practical knowledge of GitHub and cloud computing workflows for data science projects

  • Hands-on experience building models using algorithms and techniques such as multivariate regression, time series analysis, XGBoost, clustering, classification, OLS regression, Naïve Bayes, linear and time-decay attribution models, Markov chains, and Shapley value methods

  • Experience in Multi channel Marketing Mix Modeling (MMM), or related fields, with a track record of delivering impactful results

  • At least 4 years of strong experience with machine learning libraries and frameworks such as TensorFlow, PyTorch, scikit-learn, Keras, spaCy, NLTK, NumPy, pandas, and Spark

  • Basic understanding of pharmaceutical datasets (e.g., IQVIA, SHA, Patients data) and familiarity with US healthcare markets would be a plus

  • Strong analytical and problem-solving skills with a data-driven mindset

Good to Have: 

  • Experience with modern deep learning architectures for NLP, including hands-on experience with transformers (e.g., Hugging Face, SBert, GPT-2/GPT-3) and the capability to build ML/DL pipelines for training/tuning models, including transfer learning

  • Proficiency with LLM APIs (e. g. , OpenAI, Claude, Gemini) and agent frameworks such as AutoGen, LangGraph, AgentBuilder, or CrewAI

  • Experience with prompt engineering or intelligent workflow automation

  • Bash/shell scripting is a plus, Experience with Docker, API development is advantageous


Note: This job description is intended as a general guideline for the responsibilities and qualifications required for this position. It is not an exhaustive list, and responsibilities may evolve and change based on business needs.

Who we are

A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.


Let’s build a healthier future, together.

Roche is an Equal Opportunity Employer.