Skip to content

Note¶

Paper Reading¶

Reinforcement Learning
Language Modeling
- Woodpecker: Hallucination Correction for Multimodal Large Language Models

Book Reading¶

Reinforcement Learning: An Introduction (Richard S. Sutton and Andrew G. Barto)

Last update: November 2, 2023