Shayekh Bin Islam

my.jpg

I am broadly interested in natural language processing and machine learning. My research involves complex reasoning, language agents, preference optimization and parameter-efficient systems for real-world problems using deep neural networks.

I have been collaborating with Cohere Labs (Prev. Cohere For AI), Allen Institute for AI, and Salesforce Research which led to publications in ACL’25 (Main, first-author), EMNLP’24 (Findings, first-author), ICLR’25 (Spotlight, co-author) and a submission in NeurIPS’25. During the Fatima Fellowship 2023 program, I was fortunate to work with Mohamed El Banani from the University of Michigan and World Labs. I completed my Bachelor’s in Computer Science at Bangladesh University of Engineering and Technology (BUET).

News

May 15, 2025 M-RewardBench is accepted by ACL 2025 Main! 🥳📄✨
Apr 09, 2025 Kaleidoscope is now available on arXiv.
Feb 12, 2025 INCLUDE is accepted as a ✨Spotlight✨ at ICLR 2025.
Nov 11, 2024 Attending EMNLP 2024 at Miami, Florida 🌴 Presenting Open-RAG.
Oct 24, 2024 Will be serving as a reviewer in COLING 2025.

Selected Publications

  1. EMNLP
    rag2024.jpg
    Open-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models
    Shayekh Bin Islam*, Md Asib Rahman*, K S M Tozammel Hossain, Enamul Hoque, Shafiq Joty, and Md Rizwan Parvez
    EMNLP Findings, Nov 2024
  2. ACL 2025
    mrewardbench.png
    M-RewardBench: Evaluating Reward Models in Multilingual Settings
    Srishti Gureja*, Lester James V Miranda*Shayekh Bin Islam*, Rishabh Maheshwary*, Drishti Sharma, Gusti Winata, Nathan Lambert, Sebastian Ruder, Sara Hooker, and Marzieh Fadaee
    arXiv preprint arXiv:2410.15522, Nov 2024
  3. In Submission
    kaleidoscope.png
    Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation
    Israfel Salazar*, Manuel Fernández Burda*Shayekh Bin Islam*, Arshia Soltani Moakhar*, Shivalika Singh*, Fabian Farestam*, Angelika Romanou*, Danylo Boiko, Dipika Khullar, Mike Zhang, and 35 more authors
    arXiv preprint arXiv:2504.07072, Nov 2025
  4. ICLR
    include.png
    INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
    Angelika Romanou, Negar Foroutan, Anna Sotnikova, Zeming Chen, Sree Harsha Nelaturu, Shivalika Singh, Rishabh Maheshwary, Micol Altomare, Mohamed A. Haggag, Snegha A, and 49 more authors
    ICLR, Nov 2025
  5. Preprint
    mmeval.png
    MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
    Guijin Son, Dongkeun Yoon, Juyoung Suk, Javier Aula-Blasco, Mano Aslan, Vu Trong Kim, Shayekh Bin Islam, Jaume Prats-Cristià, Lucı́a Tormo-Bañuelos, and Seungone Kim
    arXiv preprint arXiv:2410.17578, Nov 2024