Nobin Sarwar

Graduate Research Assistant @ UMBC

sarwar.png

I am a second-year Computer Science Ph.D. student in the Language Understanding Lab at the University of Maryland, Baltimore County, advised by Prof. Francis Ferraro.

My current research studies post-training optimization and system-level methods for LLMs and multimodal foundation models, with a focus on agentic reasoning and controllable adaptation. Ongoing work spans

  • Reasoning reliability and verification: retrieval-grounded structured inference and agentic retrieve-verify workflows for scientific claim and feasibility assessment, hallucination mitigation in multimodal QA (FilterRAG)

  • Privacy-preserving and controllable model adaptation: federated fine-tuning of LLMs with Differential Privacy (FedMentor, FedMentalCare), targeted multimodal unlearning (Multimodal Unlearning Survey)

Previously, I earned my MS in Computer Science from the University of Texas Rio Grande Valley, where I worked on privacy-preserving Federated Learning for biometrics and Differential Privacy, contributing to projects that received NSF funding.

Research News (View All)

May 14, 2026 ๐Ÿฅ Our paper Croissant Baker is now available on arXiv, with code released on GitHub.
Apr 06, 2026 ๐ŸŽ‰ Our Multimodal Unlearning Survey has been accepted as Findings of ACL 2026.
Feb 28, 2026 ๐Ÿš€ Released the repository and project page for our survey paper on Multimodal Unlearning.
Feb 01, 2026 ๐Ÿ›๏ธ Awesome Academic NLP Research Labs Worldwide released โ€” a curated list of academic NLP research labs worldwide.
Jan 21, 2026 ๐Ÿ“ข Our paper Multimodal Unlearning Survey has been online.

Publications Spotlight

Full publication list on Google Scholar โ†’

  1. Preprintโ€™26
    croissant_baker.png
    Croissant Baker: Metadata Generation for Discoverable, Governable, and Reusable ML Datasets
    Rafi Al Attrach, Rajna Fani, Sebastian Lobentanzer, Joan Giner-Miguelez, Debanshu Das, Varuni H. K., Nobin Sarwar, and 13 more authors
    arXiv preprint arXiv:2605.15079
  2. Coming Soon
    croissant_miner.png
    CroissantMiner: Automated Extraction and Validation of Croissant Metadata for ML Datasets
    Berke Arda, Mubashara Akhtar, Ahmetcan Yavuz, Paul Gerry, Sebastian Lobentanzer, Nobin Sarwar, Joan Giner-Miguelez, and 3 more authors
    Coming soon
  3. ACLโ€™26
    multimodal_unlearning_survey.jpg
    Multimodal Unlearning Across Vision, Language, Video, and Audio: Survey of Methods, Datasets, and Benchmarks
    In Findings of ACL
  4. NeurIPS-Wโ€™25
    fedmentor.png
    FedMentor: Domain-Aware Differential Privacy for Heterogeneous Federated LLMs in Mental Health
    Nobin Sarwar and Shubhashis Roy Dipta
    In GenAI4Health Workshop, NeurIPS
  5. ICCV-Wโ€™25
    filterrag.png
    FilterRAG: Zero-Shot Informed Retrieval-Augmented Generation to Mitigate Hallucinations in VQA
    Nobin Sarwar
    In T2FM Workshop, ICCV