Ari Zachary Klein, PhD

faculty photo
Research Assistant Professor of Biostatistics and Epidemiology
Senior Data Analyst, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA
Department: Biostatistics and Epidemiology

Contact information
Department of Biostatistics, Epidemiology, and Informatics
Perelman School of Medicine
University of Pennsylvania
3600 Civic Center Blvd., 5E-313
Philadelphia, PA 19104
Education:
BA (Creative Writing/Philosophy)
Carnegie Mellon University, Pittsburgh, PA, 2008.
MA (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2010.
PhD (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2016.
Permanent link
 
> Perelman School of Medicine   > Faculty   > Details

Selected Publications

Klein AZ, Kunatharaju S, Golder S, Levine LD, Figueiredo JC, Gonzalez-Hernandez G.: Association Between COVID-19 During Pregnancy and Preterm Birth by Trimester of Infection: Retrospective Cohort Study Using Large-Scale Social Media Data. J Med Internet Res 27: e66097, Jul 2025 Notes: doi: 10.2196/66097.

Klein AZ, Weissenbacher D, O'Connor K, Elyaderani A, Amaro IF, Onishi T, Golder S, Spiegel K, Scotch M, Gonzalez-Hernandez G.: Detection of patient metadata in published articles for genomic epidemiology using machine learning and large language models. medRxiv Page: 25326298, Apr 2025 Notes: doi: 10.1101/2025.04.25.25326298.

Klein AZ, Banda JM, Guo Y, Schmidt AL, Xu D, Flores Amaro I, Rodriguez-Esteban R, Sarker A, Gonzalez-Hernandez G.: Overview of the 8th Social Media Mining for Health Applications (#SMM4H) shared tasks at the AMIA 2023 Annual Symposium. J Am Med Inform Assoc 31(4): 991-996, Apr 2024 Notes: doi: 10.1093/jamia/ocae010.

Lee CR, Aysola J, Chen X, Addisu E, Klein A, Weissenbacher D, Gonzalez-Hernandez G, Weissman GE.: Race and Ethnicity and Clinician Linguistic Expressions of Doubt in Hospital Admission Notes. JAMA Netw Open 7(10): e2438550, Oct 2024 Notes: doi: 10.1001/jamanetworkopen.2024.38550.

Sarker A, Klein AZ, Mee J, Harik P, Gonzalez-Hernandez G.: An interpretable natural language processing system for written medical examination assessment. J Biomed Inform 98: 103268, Oct 2019 Notes: DOI: 10.1016/j.jbi.2019.103268.

Klein AZ, Magge A, Gonzalez-Hernandez G.: ReportAGE: Automatically extracting the exact age of Twitter users based on self-reports in tweets. PLoS One 17(1): e0262087, Jan 2022 Notes: doi: 10.1371/journal.pone.0262087. eCollection 2022.

Klein AZ, Magge A, O'Connor K, Flores Amaro JI, Weissenbacher D, Gonzalez Hernandez G.: Toward Using Twitter for Tracking COVID-19: A Natural Language Processing Pipeline and Exploratory Data Set. J Med Internet Res 23(1): e25314, Jan 2021 Notes: doi: 10.2196/25314.

Golder, S., Chiuve, S., Weissenbacher, D., Klein, A., O’Connor, K., Bland, M., Malin, M., Bhattacharya, M., Scarazzini, L.J., & Gonzalez-Hernandez, G.: Pharmacoepidemiologic evaluation of birth defects from health-related postings in social media during pregnancy. Drug Saf 42(3): 389-400, Mar 2019 Notes: DOI: 10.1007/s40264-018-0731-6.

Klein AZ, Sarker A, Weissenbacher D, Gonzalez-Hernandez G.: Towards scaling Twitter for digital epidemiology of birth defects. NPJ Digit Med 2: 96, Oct 2019 Notes: doi: 10.1038/s41746-019-0170-5.

back to top
Last updated: 03/19/2026
The Trustees of the University of Pennsylvania