Publications

Full publication list on Google Scholar →

2026

  1. Coming Soon
    croissant_miner.png
    CroissantMiner: Automated Extraction and Validation of Croissant Metadata for ML Datasets
    Berke Arda, Mubashara Akhtar, Ahmetcan Yavuz, Paul Gerry, Sebastian Lobentanzer, Nobin Sarwar, Joan Giner-Miguelez, and 3 more authors
    Coming soon
  2. Preprint’26
    croissant_baker.png
    Croissant Baker: Metadata Generation for Discoverable, Governable, and Reusable ML Datasets
    Rafi Al Attrach, Rajna Fani, Sebastian Lobentanzer, Joan Giner-Miguelez, Debanshu Das, Varuni H. K., Nobin Sarwar, and 13 more authors
    arXiv preprint arXiv:2605.15079
  3. ACL’26
    multimodal_unlearning_survey.jpg
    Multimodal Unlearning Across Vision, Language, Video, and Audio: Survey of Methods, Datasets, and Benchmarks
    In Findings of ACL

2025

  1. NeurIPS-W’25
    fedmentor.png
    FedMentor: Domain-Aware Differential Privacy for Heterogeneous Federated LLMs in Mental Health
    Nobin Sarwar and Shubhashis Roy Dipta
    In GenAI4Health Workshop, NeurIPS
  2. ICCV-W’25
    filterrag.png
    FilterRAG: Zero-Shot Informed Retrieval-Augmented Generation to Mitigate Hallucinations in VQA
    Nobin Sarwar
    In T2FM Workshop, ICCV