Ari Zachary Klein, PhD
Research Assistant Professor of Biostatistics and Epidemiology
Senior Data Analyst, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA
Department: Biostatistics and Epidemiology
Contact information
Department of Biostatistics, Epidemiology, and Informatics
Perelman School of Medicine
University of Pennsylvania
3600 Civic Center Blvd., 5E-313
Philadelphia, PA 19104
Perelman School of Medicine
University of Pennsylvania
3600 Civic Center Blvd., 5E-313
Philadelphia, PA 19104
Education:
BA (Creative Writing/Philosophy)
Carnegie Mellon University, Pittsburgh, PA, 2008.
MA (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2010.
PhD (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2016.
Permanent linkBA (Creative Writing/Philosophy)
Carnegie Mellon University, Pittsburgh, PA, 2008.
MA (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2010.
PhD (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2016.
Selected Publications
Klein AZ, Kunatharaju S, Golder S, Levine LD, Figueiredo JC, Gonzalez-Hernandez G.: Association Between COVID-19 During Pregnancy and Preterm Birth by Trimester of Infection: Retrospective Cohort Study Using Large-Scale Social Media Data. J Med Internet Res 27: e66097, Jul 2025 Notes: doi: 10.2196/66097.Klein AZ, Weissenbacher D, O'Connor K, Elyaderani A, Amaro IF, Onishi T, Golder S, Spiegel K, Scotch M, Gonzalez-Hernandez G.: Detection of patient metadata in published articles for genomic epidemiology using machine learning and large language models. medRxiv Page: 25326298, Apr 2025 Notes: doi: 10.1101/2025.04.25.25326298.
Klein AZ, Banda JM, Guo Y, Schmidt AL, Xu D, Flores Amaro I, Rodriguez-Esteban R, Sarker A, Gonzalez-Hernandez G.: Overview of the 8th Social Media Mining for Health Applications (#SMM4H) shared tasks at the AMIA 2023 Annual Symposium. J Am Med Inform Assoc 31(4): 991-996, Apr 2024 Notes: doi: 10.1093/jamia/ocae010.
Lee CR, Aysola J, Chen X, Addisu E, Klein A, Weissenbacher D, Gonzalez-Hernandez G, Weissman GE.: Race and Ethnicity and Clinician Linguistic Expressions of Doubt in Hospital Admission Notes. JAMA Netw Open 7(10): e2438550, Oct 2024 Notes: doi: 10.1001/jamanetworkopen.2024.38550.
Sarker A, Klein AZ, Mee J, Harik P, Gonzalez-Hernandez G.: An interpretable natural language processing system for written medical examination assessment. J Biomed Inform 98: 103268, Oct 2019 Notes: DOI: 10.1016/j.jbi.2019.103268.
Klein AZ, Magge A, Gonzalez-Hernandez G.: ReportAGE: Automatically extracting the exact age of Twitter users based on self-reports in tweets. PLoS One 17(1): e0262087, Jan 2022 Notes: doi: 10.1371/journal.pone.0262087. eCollection 2022.
Klein AZ, Magge A, O'Connor K, Flores Amaro JI, Weissenbacher D, Gonzalez Hernandez G.: Toward Using Twitter for Tracking COVID-19: A Natural Language Processing Pipeline and Exploratory Data Set. J Med Internet Res 23(1): e25314, Jan 2021 Notes: doi: 10.2196/25314.
Golder, S., Chiuve, S., Weissenbacher, D., Klein, A., O’Connor, K., Bland, M., Malin, M., Bhattacharya, M., Scarazzini, L.J., & Gonzalez-Hernandez, G.: Pharmacoepidemiologic evaluation of birth defects from health-related postings in social media during pregnancy. Drug Saf 42(3): 389-400, Mar 2019 Notes: DOI: 10.1007/s40264-018-0731-6.
Klein AZ, Sarker A, Weissenbacher D, Gonzalez-Hernandez G.: Towards scaling Twitter for digital epidemiology of birth defects. NPJ Digit Med 2: 96, Oct 2019 Notes: doi: 10.1038/s41746-019-0170-5.