cv
Education
-
2021 – 2025 Ph.D.
Örebro University & WASP, Sweden - Thesis: Neurosymbolic decision-making with LLMs.
- Supervisors: Luc De Raedt, Pedro Zuidberg Dos Martires.
-
2017 – 2019 M.Tech in Artificial Intelligence
Indian Institute of Science, Bangalore - Thesis: Active learning in sequence tagging.
- GPA: 8.10/10. Supervisor: Ambedkar Dukkipati.
-
2013 – 2017 B.Tech in Electrical Engineering
Birsa Institute of Technology, India - GPA: 8.03/10.
Research & Professional Experience
-
Jul 2025 – Present Postdoctoral Researcher
FAIR (Meta AI), London, UK - AI Research Agents.
-
Aug 2024 – Feb 2025 Research Science Intern
FAIR (Meta AI), London, UK - SAM 3; AI Research Agents.
-
Jul 2022 – Dec 2022 Research Science Intern
Meta Reality Labs Research, Redmond, USA - Vision and language-based task tracking.
-
Apr 2020 – Sep 2020 Data Scientist
Amazon Alexa-AI, Bangalore, India - NLU metrics for Alexa.
-
Jun 2019 – Mar 2020 Research Associate
Statistics & Machine Learning Group, IISc Bangalore - Multi-agent reinforcement learning.
Selected Publications
- AIRA2: Overcoming Bottlenecks in AI Research Agents
K Hambardzumyan*, N Baldwin*, E Toledo*, R Hazra*, M Kuchnik*, M Josifosky* et al. (*: equal contribution)
NeurIPS 2026 (under review) [pdf] - Training AI Co-Scientists Using Rubric Rewards
S Goel, R Hazra, D Jayalath, T Willi, P Jain, WF Shen, I Leontiadis, F Barbieri, Y Bachrach, J Geiping, C Whitehouse
ICML 2026 [pdf] [dataset] - COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game
A Sygkounas, R Hazra, A Persson, PZD Martires, A Loutfi
GECCO 2026 [pdf] - SAM 3: Segment Anything with Concepts
N Carion et al.
ICLR 2026 [pdf] [code] [website] - AI Research Agents for ML: Search, Exploration, and Generalization in MLE-bench
E Toledo, K Hambardzumyan, M Josifoski, R Hazra et al.
NeurIPS 2025 (Spotlight) [pdf] [code] - LexiCon: Planning under Temporal Constraints in Natural Language
P Mantenoglou, R Hazra, PZD Martires, L De Raedt
NeurIPS 2025 (DB Track) [pdf] [code] - Have Large Language Models Learned to Reason? A Characterization via 3-SAT
R Hazra, G Venturato, PZD Martires, L De Raedt
COLM 2025 [pdf] [code] - REvolve: Reward Evolution with Large Language Models using Human Feedback
R Hazra*, A Sygkounas*, A Persson, A Loutfi, PZD Martires
ICLR 2025 [web] [pdf] [code] - SayCanPay: Heuristic Planning with LLMs using Learnable Domain Knowledge
R Hazra, PZD Martires, L De Raedt
AAAI 2024 [web] [pdf] [code] - EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
R Hazra, B Chen, A Rai, N Kamra, R Desai
ICCV 2023 [web] [pdf] [code] - Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach
R Hazra, L De Raedt
ECML PKDD 2023 [pdf] - Active2 Learning: Reducing Redundancies in Active Learning for Sequence Tagging
R Hazra, P Dutta, S Gupta, MA Qaathir, A Dukkipati
NAACL-HLT 2021 [pdf] [code] - Networked Multi-Agent Reinforcement Learning with Emergent Communication
S Gupta*, R Hazra*, A Dukkipati
AAMAS 2020 [pdf] - Workshop papers (ViGIL @ NAACL-HLT 2021, IEEE RO-MAN 2024, HRI 2025 LBR) omitted for brevity.
Mentorship
-
2024 – 2025 Master Thesis Supervision
Jens V Rüppel, TU Chemnitz - Co-supervisor: Tim Schreiter
Academic Service
- Program Committee & Session Chair: AAMAS 2022
- Reviewer: NeurIPS 2022 (Top), 2023, 2024 (Top), 2025 | ICML 2023–25 | ICLR 2024–25 | KR 2024 | EACL 2023 | TMLR
Community
- PRAYAAS India (2013–2016): Taught mathematics to underprivileged children.
- Tarumitra (2011–2013): Student President; led plantation drives and awareness programs.