Skip to content

VIRAL: Vision-grounded Integration for Reward design And Learning

▶️ Watch the Demo Video

Click to watch the VIRAL demo on YouTube

Click the image above to watch our demo video on YouTube.


🚀 Overview

VIRAL (Vision-grounded Integration for Reward design And Learning) provides a new approach to reward engineering, inspired by DREFUN-V. This project investigates how VideoLLMs (Video Large Language Models) can be used to better align reward functions with task objectives in RL settings.

VIRAL Framework Overview

For more details, see our paper (PDF) or ArXiv preprint.


📁 Repository Contents


📖 Get Started

For installation instructions, usage examples, and detailed documentation, please visit our project website.


📚 Learn More


🙏 Acknowledgements

This project was developed as part of the course on Large Language Models at UCBL1, under the guidance of Bruno YUN.


Questions or suggestions?
Feel free to open an issue.