2e 2 5d 16
19
1b
34

Ari Zachary Klein, PhD

88 faculty photo 63
Research Assistant Professor of Biostatistics and Epidemiology
28
b4
Senior Data Analyst, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA
11
3 75
Department: Biostatistics and Epidemiology
4 1 b
1d
46 Contact information
68
Department of Biostatistics, Epidemiology, and Informatics
23 Perelman School of Medicine
22 University of Pennsylvania
42 3600 Civic Center Blvd., 5E-313
Philadelphia, PA 19104
26
35
f
4 3 3 3 2 4 b 1f
13 Education:
21 7 BA 28 (Creative Writing/Philosophy) c
43 Carnegie Mellon University, Pittsburgh, PA, 2008.
21 7 MA 15 (Rhetoric) c
43 Carnegie Mellon University, Pittsburgh, PA, 2010.
21 8 PhD 15 (Rhetoric) c
43 Carnegie Mellon University, Pittsburgh, PA, 2016.
c
3 3 3 3 8d Permanent link
2 29
 
1d
25
21
b6 > Perelman School of Medicine   > Faculty   > Details a
1e 1d
2b 29
23

Selected Publications

17a Klein AZ, Kunatharaju S, Golder S, Levine LD, Figueiredo JC, Gonzalez-Hernandez G.: Association Between COVID-19 During Pregnancy and Preterm Birth by Trimester of Infection: Retrospective Cohort Study Using Large-Scale Social Media Data. J Med Internet Res 27: e66097, Jul 2025 Notes: doi: 10.2196/66097.

190 Klein AZ, Weissenbacher D, O'Connor K, Elyaderani A, Amaro IF, Onishi T, Golder S, Spiegel K, Scotch M, Gonzalez-Hernandez G.: Detection of patient metadata in published articles for genomic epidemiology using machine learning and large language models. medRxiv Page: 25326298, Apr 2025 Notes: doi: 10.1101/2025.04.25.25326298.

186 Klein AZ, Banda JM, Guo Y, Schmidt AL, Xu D, Flores Amaro I, Rodriguez-Esteban R, Sarker A, Gonzalez-Hernandez G.: Overview of the 8th Social Media Mining for Health Applications (#SMM4H) shared tasks at the AMIA 2023 Annual Symposium. J Am Med Inform Assoc 31(4): 991-996, Apr 2024 Notes: doi: 10.1093/jamia/ocae010.

161 Lee CR, Aysola J, Chen X, Addisu E, Klein A, Weissenbacher D, Gonzalez-Hernandez G, Weissman GE.: Race and Ethnicity and Clinician Linguistic Expressions of Doubt in Hospital Admission Notes. JAMA Netw Open 7(10): e2438550, Oct 2024 Notes: doi: 10.1001/jamanetworkopen.2024.38550.

12f Sarker A, Klein AZ, Mee J, Harik P, Gonzalez-Hernandez G.: An interpretable natural language processing system for written medical examination assessment. J Biomed Inform 98: 103268, Oct 2019 Notes: DOI: 10.1016/j.jbi.2019.103268.

135 Klein AZ, Magge A, Gonzalez-Hernandez G.: ReportAGE: Automatically extracting the exact age of Twitter users based on self-reports in tweets. PLoS One 17(1): e0262087, Jan 2022 Notes: doi: 10.1371/journal.pone.0262087. eCollection 2022.

153 Klein AZ, Magge A, O'Connor K, Flores Amaro JI, Weissenbacher D, Gonzalez Hernandez G.: Toward Using Twitter for Tracking COVID-19: A Natural Language Processing Pipeline and Exploratory Data Set. J Med Internet Res 23(1): e25314, Jan 2021 Notes: doi: 10.2196/25314.

19c Golder, S., Chiuve, S., Weissenbacher, D., Klein, A., O’Connor, K., Bland, M., Malin, M., Bhattacharya, M., Scarazzini, L.J., & Gonzalez-Hernandez, G.: Pharmacoepidemiologic evaluation of birth defects from health-related postings in social media during pregnancy. Drug Saf 42(3): 389-400, Mar 2019 Notes: DOI: 10.1007/s40264-018-0731-6.

10c Klein AZ, Sarker A, Weissenbacher D, Gonzalez-Hernandez G.: Towards scaling Twitter for digital epidemiology of birth defects. NPJ Digit Med 2: 96, Oct 2019 Notes: doi: 10.1038/s41746-019-0170-5.

2c
7 1d
2c back to top
26 Last updated: 03/19/2026
34 The Trustees of the University of Pennsylvania c
1f
27
24
 
1d
18
1 49 2 2 18