Ari Zachary Klein, PhD

faculty photo
Research Assistant Professor of Biostatistics and Epidemiology
Senior Data Analyst, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA
Department: Biostatistics and Epidemiology

Contact information
Blockley Hall, 4th Fl.
423 Guardian Dr.
Philadelphia, PA 19104
Education:
BA (Creative Writing/Philosophy)
Carnegie Mellon University, Pittsburgh, PA, 2008.
MA (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2010.
PhD (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2016.
Permanent link
 
> Perelman School of Medicine   > Faculty   > Details

Selected Publications

Xu D, Lopez-Garcia G, O’Connor K, Holston H, Klein AZ, Flores Amaro I, Scotch M, Gonzalez-Hernandez G.: Mining social media data for influenza vaccine effectiveness using a large language model and chain-of-thought prompting. Proceedings of the American Medical Informatics Association Annual Symposium 2026 Notes: Accepted for publication.

Klein AZ, Spiegel K, Bauermeister JA, Gonzalez-Hernandez G.: Health-Related Concerns of Anti-LGBTQ+ Legislation: Thematic Analysis Using Social Media Data. JMIR Infodemiology 5: e68956, Sep 2025 Notes: doi: 10.2196/68956.

Klein AZ, Kunatharaju S, Golder S, Levine LD, Figueiredo JC, Gonzalez-Hernandez G.: Association Between COVID-19 During Pregnancy and Preterm Birth by Trimester of Infection: Retrospective Cohort Study Using Large-Scale Social Media Data. J Med Internet Res 27: e66097, Jul 2025 Notes: doi: 10.2196/66097.

Klein, A.Z., Dasgupta, T., Gryboski, L., Jana, S., Khademi, S., Lopez-Garcia, G., Mazzotti, D., Onishi, T., Powell, J., Raithel, L., Rajwal, S., Roller, R., Sarker, A., Sinha, M., Thomas, P., Tutubalina, E., Xu, D., Zweigenbaum, P., & Gonzalez-Hernandez, G.: Overview of the 10th Social Media Mining for Health (#SMM4H) and Health Real-World Data (HeaRD) shared tasks at ICWSM 2025. Proceedings of the 10th Social Media Mining for Health (#SMM4H) and Health Real-World Data (HeaRD) Workshop and Shared Tasks, AAAI AAAI, Jun 2025 Notes: DOI: 10.36190/2025.55.

Feng, Y., Hou, B., Klein, A., O’Connor, K., Chen, J., Mondragón, A., Yang, S., Gonzalez-Hernandez, G., & Shen, L.: Analyzing dementia caregivers’ experiences on Twitter: A term-weighted topic modeling approach Proceedings of the American Medical Informatics Association Annual Symposium 2024: 407-416, May 2025.

Klein AZ, Weissenbacher D, O'Connor K, Elyaderani A, Amaro IF, Onishi T, Golder S, Spiegel K, Scotch M, Gonzalez-Hernandez G.: Detection of patient metadata in published articles for genomic epidemiology using machine learning and large language models. medRxiv Page: 25326298, Apr 2025 Notes: doi: 10.1101/2025.04.25.25326298.

Xu D, García GL, O'Connor K, Holston H, Klein AZ, Amaro IF, Scotch M, Gonzalez-Hernandez G.: Mining Social Media Data for Influenza Vaccine Effectiveness Using a Large Language Model and Chain-of-Thought Prompting. medRxiv Mar 2025 Notes: doi: 10.1101/2025.03.26.25324701.

Thanawala SU, Klein A, Raval K, Amaro JIF, Beveridge CA, Muir AB, Falk GW, Gonzalez-Hernandez G, Lynch KL.: Exploring X: barriers to care for eosinophilic esophagitis. Dis Esophagus 38(1): doae043, Jan 2025 Notes: doi: 10.1093/dote/doae043.

Feng, Y., Hou, B., Klein, A., O’Connor, K., Chen, J., Mondragón, A., Yang, S., Gonzalez-Hernandez, G., & Shen, L.: Exploring semantic topics in dementia caregiver tweets. Alzheimers Dement 20(Suppl4): e093035, Jan 2025 Notes: doi: 10.1002/alz.093035.

He, W., Hou, B., Zheng, A., Feng, Y., Klein, A., O’Connor, K., Yang, S., Shang, T., Demiris, G., Gonzalez-Hernandez, G., & Shen, L. : Advanced topic modeling with large language models: Analyzing social media content from dementia caregivers. Innov Aging 2025 Notes: Accepted for publication.

back to top
Last updated: 01/13/2026
The Trustees of the University of Pennsylvania