cv | Rishi Hazra

Education

2021 – 2025
Ph.D.

Örebro University & WASP, Sweden
- Thesis: Neurosymbolic decision-making with LLMs.
- Supervisors: Luc De Raedt, Pedro Zuidberg Dos Martires.
2017 – 2019
M.Tech in Artificial Intelligence

Indian Institute of Science, Bangalore
- Thesis: Active learning in sequence tagging.
- GPA: 8.10/10. Supervisor: Ambedkar Dukkipati.
2013 – 2017
B.Tech in Electrical Engineering

Birsa Institute of Technology, India
- GPA: 8.03/10.

Research & Professional Experience

Jul 2025 – Present
Postdoctoral Researcher

FAIR (Meta AI), London, UK
- AI Research Agents.
Aug 2024 – Feb 2025
Research Science Intern

FAIR (Meta AI), London, UK
- SAM 3; AI Research Agents.
Jul 2022 – Dec 2022
Research Science Intern

Meta Reality Labs Research, Redmond, USA
- Vision and language-based task tracking.
Apr 2020 – Sep 2020
Data Scientist

Amazon Alexa-AI, Bangalore, India
- NLU metrics for Alexa.
Jun 2019 – Mar 2020
Research Associate

Statistics & Machine Learning Group, IISc Bangalore
- Multi-agent reinforcement learning.

Selected Publications

AIRA₂: Overcoming Bottlenecks in AI Research Agents
K Hambardzumyan*, N Baldwin*, E Toledo*, R Hazra*, M Kuchnik*, M Josifosky* et al. (*: equal contribution)
NeurIPS 2026 (under review) [pdf]
Training AI Co-Scientists Using Rubric Rewards
S Goel, R Hazra, D Jayalath, T Willi, P Jain, WF Shen, I Leontiadis, F Barbieri, Y Bachrach, J Geiping, C Whitehouse
ICML 2026 [pdf] [dataset]
COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game
A Sygkounas, R Hazra, A Persson, PZD Martires, A Loutfi
GECCO 2026 [pdf]
SAM 3: Segment Anything with Concepts
N Carion et al.
ICLR 2026 [pdf] [code] [website]
AI Research Agents for ML: Search, Exploration, and Generalization in MLE-bench
E Toledo, K Hambardzumyan, M Josifoski, R Hazra et al.
NeurIPS 2025 (Spotlight) [pdf] [code]
LexiCon: Planning under Temporal Constraints in Natural Language
P Mantenoglou, R Hazra, PZD Martires, L De Raedt
NeurIPS 2025 (DB Track) [pdf] [code]
Have Large Language Models Learned to Reason? A Characterization via 3-SAT
R Hazra, G Venturato, PZD Martires, L De Raedt
COLM 2025 [pdf] [code]
REvolve: Reward Evolution with Large Language Models using Human Feedback
R Hazra*, A Sygkounas*, A Persson, A Loutfi, PZD Martires
ICLR 2025 [web] [pdf] [code]
SayCanPay: Heuristic Planning with LLMs using Learnable Domain Knowledge
R Hazra, PZD Martires, L De Raedt
AAAI 2024 [web] [pdf] [code]
EgoTV: Egocentric Task Verification from Natural Language Task Descriptions
R Hazra, B Chen, A Rai, N Kamra, R Desai
ICCV 2023 [web] [pdf] [code]
Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach
R Hazra, L De Raedt
ECML PKDD 2023 [pdf]
Active² Learning: Reducing Redundancies in Active Learning for Sequence Tagging
R Hazra, P Dutta, S Gupta, MA Qaathir, A Dukkipati
NAACL-HLT 2021 [pdf] [code]
Networked Multi-Agent Reinforcement Learning with Emergent Communication
S Gupta*, R Hazra*, A Dukkipati
AAMAS 2020 [pdf]
Workshop papers (ViGIL @ NAACL-HLT 2021, IEEE RO-MAN 2024, HRI 2025 LBR) omitted for brevity.

Mentorship

2024 – 2025
Master Thesis Supervision

Jens V Rüppel, TU Chemnitz
- Co-supervisor: Tim Schreiter

Academic Service

Program Committee & Session Chair: AAMAS 2022
Reviewer: NeurIPS 2022 (Top), 2023, 2024 (Top), 2025 | ICML 2023–25 | ICLR 2024–25 | KR 2024 | EACL 2023 | TMLR

Community

PRAYAAS India (2013–2016): Taught mathematics to underprivileged children.
Tarumitra (2011–2013): Student President; led plantation drives and awareness programs.

Education

Ph.D.

M.Tech in Artificial Intelligence

B.Tech in Electrical Engineering

Research & Professional Experience

Postdoctoral Researcher

Research Science Intern

Research Science Intern

Data Scientist

Research Associate

Selected Publications

Mentorship

Master Thesis Supervision

Academic Service

Community