Ari Zachary Klein, PhD

faculty photo
Senior Data Analyst, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA

Contact information
Blockley Hall, 4th Fl.
423 Guardian Dr.
Philadelphia, PA 19104
Education:
BA (Creative Writing/Philosophy)
Carnegie Mellon University, Pittsburgh, PA, 2008.
MA (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2010.
PhD (Rhetoric)
Carnegie Mellon University, Pittsburgh, PA, 2016.
Permanent link
 
> Perelman School of Medicine   > Faculty   > Details

Selected Publications

Xu D, Lopez-Garcia G, O’Connor K, Holston H, Klein AZ, Flores Amaro I, Scotch M, Gonzalez-Hernandez G.: Mining social media data for influenza vaccine effectiveness using a large language model and chain-of-thought prompting. Proceedings of the American Medical Informatics Association Annual Symposium 2026 Notes: Accepted for publication.

Klein AZ, Spiegel K, Bauermeister JA, Gonzalez-Hernandez G.: Health-Related Concerns of Anti-LGBTQ+ Legislation: Thematic Analysis Using Social Media Data. JMIR Infodemiology 5: e68956, Sep 2025 Notes: doi: 10.2196/68956.

Klein AZ, Kunatharaju S, Golder S, Levine LD, Figueiredo JC, Gonzalez-Hernandez G.: Association Between COVID-19 During Pregnancy and Preterm Birth by Trimester of Infection: Retrospective Cohort Study Using Large-Scale Social Media Data. J Med Internet Res 27: e66097, Jul 2025 Notes: doi: 10.2196/66097.

Klein, A.Z., Dasgupta, T., Gryboski, L., Jana, S., Khademi, S., Lopez-Garcia, G., Mazzotti, D., Onishi, T., Powell, J., Raithel, L., Rajwal, S., Roller, R., Sarker, A., Sinha, M., Thomas, P., Tutubalina, E., Xu, D., Zweigenbaum, P., & Gonzalez-Hernandez, G.: Overview of the 10th Social Media Mining for Health (#SMM4H) and Health Real-World Data (HeaRD) shared tasks at ICWSM 2025. Proceedings of the 10th Social Media Mining for Health (#SMM4H) and Health Real-World Data (HeaRD) Workshop and Shared Tasks, AAAI AAAI, Jun 2025 Notes: DOI: 10.36190/2025.55.

Feng, Y., Hou, B., Klein, A., O’Connor, K., Chen, J., Mondragón, A., Yang, S., Gonzalez-Hernandez, G., & Shen, L.: Analyzing dementia caregivers’ experiences on Twitter: A term-weighted topic modeling approach Proceedings of the American Medical Informatics Association Annual Symposium 2024: 407-416, May 2025.

Klein AZ, Weissenbacher D, O'Connor K, Elyaderani A, Amaro IF, Onishi T, Golder S, Spiegel K, Scotch M, Gonzalez-Hernandez G.: Detection of patient metadata in published articles for genomic epidemiology using machine learning and large language models. medRxiv Page: 25326298, Apr 2025 Notes: doi: 10.1101/2025.04.25.25326298.

Xu D, García GL, O'Connor K, Holston H, Klein AZ, Amaro IF, Scotch M, Gonzalez-Hernandez G.: Mining Social Media Data for Influenza Vaccine Effectiveness Using a Large Language Model and Chain-of-Thought Prompting. medRxiv Mar 2025 Notes: doi: 10.1101/2025.03.26.25324701.

Thanawala SU, Klein A, Raval K, Amaro JIF, Beveridge CA, Muir AB, Falk GW, Gonzalez-Hernandez G, Lynch KL.: Exploring X: barriers to care for eosinophilic esophagitis. Dis Esophagus 38(1): doae043, Jan 2025 Notes: doi: 10.1093/dote/doae043.

Feng, Y., Hou, B., Klein, A., O’Connor, K., Chen, J., Mondragón, A., Yang, S., Gonzalez-Hernandez, G., & Shen, L.: Exploring semantic topics in dementia caregiver tweets. Alzheimers Dement 20(Suppl4): e093035, Jan 2025 Notes: doi: 10.1002/alz.093035.

He, W., Hou, B., Zheng, A., Feng, Y., Klein, A., O’Connor, K., Yang, S., Shang, T., Demiris, G., Gonzalez-Hernandez, G., & Shen, L. : Advanced topic modeling with large language models: Analyzing social media content from dementia caregivers. Innov Aging 2025 Notes: Accepted for publication.

back to top
Last updated: 01/13/2026
The Trustees of the University of Pennsylvania