1.
Lambert P, Eriksen M. Reward Modeling from Human Feedback Improves Controllability in Large Generative Models. IJAETR. 2026;2(1):13-19. doi:10.54097/z5t42855