Abstract
Implicit Human-in-the-Loop Reinforcement Learning (HITL-RL) is a methodologythat integrates passive human feedback into autonomous agent training whileminimizing human workload. However, existing methods often rely on activeinstruction, requiring participants to teach an agent through unnaturalexpression or gesture. We introduce NEURO-LOOP, an implicit feedback frameworkthat utilizes the intrinsic human reward system to drive human-agentinteraction. This work demonstrates the feasibility of a critical first step inthe NEURO-LOOP framework: mapping brain signals to agent performance. Usingfunctional near-infrared spectroscopy (fNIRS), we design a dataset to enablefuture research using passive Brain-Computer Interfaces for Human-in-the-LoopReinforcement Learning. Participants are instructed to observe or guide areinforcement learning agent in its environment while signals from theprefrontal cortex are collected. We conclude that a relationship between fNIRSdata and agent performance exists using classical machine learning techniques.Finally, we highlight the potential that neural interfaces may offer to futureapplications of human-agent interaction, assistive AI, and adaptive autonomoussystems.