Jinjoo Shim

Jinjoo Shim

Digital Health Data Scientist

ETH Zurich

About me

I’m a Digital Health Data Scientist. With 9+ years of work experience and solid academic background, my passion lies in exploring innovative digital health applications and biomarker research at the intersection of medicine, AI/ML, statistics, data science, and ubiquitous computing.

I’m currently pursuing a PhD in Applied Machine Learning in the Center for Digital Health Interventions at ETH Zurich.

Before ETH Zurich, I gained extensive expertise in diverse data modalities and analytical techniques in healthcare and medical research as I served as a Senior Data Scientist at Roche specializing in personalized healthcare, a Research Scientist at PRECISIONheor, and a Data Analyst/Biostatistician at Columbia University Medical Center. I obtained a Master’s degree in Biostatistics from Columbia University in the City of New York.

My PhD research focuses on digital biomarkers for aging and healthspan using wearables and digitalizing inflammatory biomarkers for systemic inflammation.

Interests
  • AI/ML in Healthcare
  • Digital Health/Digital Medicine
  • Digital Age/Healthspan Biomarker
  • Digital Inflammatory Biomarker
  • Precision Medicine
  • Pharmacoepidemiology
  • Multimodal, Real-world Data Analysis
Education
  • PhD in Applied Machine Learning, 2024

    ETH Zurich

  • MS in Biostatistics, 2012

    Columbia University in the City of New York

  • BS in International Health, 2010

    University of Alabama at Birmingham

Experience

 
 
 
 
 
Centre for Digital Health Interventions, ETH Zurich
Ph.D. Candidate and Doctoral Researcher
January 2022 – Present Zurich, Switzerland
 
 
 
 
 
Roche, Personalized Health Care Analytics
Senior Data Scientist
July 2018 – December 2021 Basel, Switzerland
Applying advanced statistical models and analytical methods to real-world data (e.g. Flatrion EHR, Flatiron-FMI clinical genomics database, national-level health insurance claims), I executed business-critical and R&D projects to advance personalized health care and access among cancer patients. Research areas include global access, health authority requests, comparative effectiveness, patient-oriented outcomes, genomics-enabled personalized healthcare, pharmacoepidemiology, and R package development.
 
 
 
 
 
PrecisionHEOR
Research Scientist
April 2014 – May 2018 San Francisco, USA
Provided analytical and statistical leadership for study design, data analysis, and manuscript preparation for health economics research evaluating cost-effectiveness, social value of treatment, drug utilization, efficacy, quality of care, and patient/provider characteristics.
 
 
 
 
 
Columbia University Medical Center
Data Analyst/Biostatistician
July 2012 – February 2014 New York, USA
Collaborated in cancer epidemiology research for studying cancer treatment, prevention, survivorship, late-effects of cancer therapy, health outcomes, and health disparities.
 
 
 
 
 
Columbia University Medical Center
Research Assistant
September 2011 – May 2022 New York, USA
Investigate the effect of social networks on adolescents and their friends’ disordered eating and muscle-enhancing behavior using a large survey from 2000+ youths.
 
 
 
 
 
Columbia University Department of Biostatistics
Teaching Assistant
September 2011 – January 2022 New York, USA

Research Areas

1.Digital biomarker for longevity & Healthspan

1.Digital biomarker for longevity & Healthspan

Applying AI/ML and statistical methods for predicting biological age and healthspan using wearable data

2.Explainable AI in biological age estimation

2.Explainable AI in biological age estimation

Applying SHAP (SHapley Additive exPlanations) method to identify factors influencing biological age using wearable data

3.Digital phenotyping to demonstrate the impact of circadian aging using wearable data

3.Digital phenotyping to demonstrate the impact of circadian aging using wearable data

Enhancing digital phenotyping through the application of AI/ML and statistical models to demonstrate using wearable, time-series data

4.Model-driven and data-driven disease classification

4.Model-driven and data-driven disease classification

Using AI/ML (CNN, LSTM, RF, DT, etc.) and statistical models (cosinor, SSA) to longitudinal, time-series physical activity data to classify different diseases

5.Digitization of Inflammatory Biomarkers for Systemic Inflammation

5.Digitization of Inflammatory Biomarkers for Systemic Inflammation

Development of digital, multi-modal, and non-invasive inflammatory biomarkers to improve early diagnosis and patient-centered disease monitoring for systemic inflammation

6.Precision medicine using multi-modal, real-world data

6.Precision medicine using multi-modal, real-world data

Addressing the unmet needs and advancing personalized therapies of cancer patients utilizing multi-modal, real-world data

Contact