Lambert, Pierre, and Maja Eriksen. “Reward Modeling from Human Feedback Improves Controllability in Large Generative Models”. International Journal of Advanced Engineering and Technology Research 2, no. 1 (May 12, 2026): 13–19. Accessed May 29, 2026. https://ijaetr.org/index.php/ojs/article/view/85.