About the job
Our Hubs are a crucial part of how we innovate, improving performance across every Sanofi department and providing a springboard for the amazing work we do. Build a career and you can be part of transforming our business while helping to change millions of lives. Ready? As a Clinical Moder - Data engineer within our Clinical Modeling and Evidence Integration team / Hyderabad, you will develop and utilize advanced tools to query different data sources (clinical trial, landscape etc.) and create standardized and analysis-ready datasets and data pipeline to support predictive clinical modeling.
We are an innovative global healthcare company with one purpose: to chase the miracles of science to improve people's lives. We're also a company where you can flourish and grow your career, with countless opportunities to explore, make connections with people, and stretch the limits of what you thought was possible. Ready to get started?
Want more jobs like this?
Get jobs in Hyderabad, India delivered to your inbox every week.
Main responsibilities:
Work with various data sources: internal & external clinical studies, studies summary results data (CT.gov) and RWD
Implement literature search, tools (i.e. NLP, digitalization) for data curation, information extraction etc.
Implement data quality checks.
Collaborate with cross-functional teams to understand complex data, analytics requirements and objectives.
Assist and may work under the direction of clinical modelers and ML data scientist.
About you
Experience:
Knowledge/experience of handling/accessing data platform potentially through API complex statistical or AI/ML models or large datasets, utilizing parallel computing and cloud computing platforms.
Excellent Python and/or R programming skills are required.
Experience using version control systems (eg, Git, GitHub).
Soft and technical skills:
Good interpersonal and communication skills
Good understanding of statistical concepts. Some knowledge about clinical development is a plus.
Education: MS in statistics, computer science, Data Science, mathematics or other related disciplines
Languages: Python, R, SQL, Json. SAS is a plus.