io.github.IgorGanapolsky/rlhf-feedback-loop
编码与调试by igorganapolsky
RLHF feedback loop for AI agents. Capture signals, promote memories, block mistakes, export DPO.
by igorganapolsky
RLHF feedback loop for AI agents. Capture signals, promote memories, block mistakes, export DPO.