--- base_model: - OpenGVLab/InternVL2_5-8B datasets: - Cierra0506/MM-K12 language: - en - zh license: mit pipeline_tag: image-text-to-text library_name: transformers --- # MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision [\[📂 GitHub\]](https://github.com/ModalMinds/MM-PRM) [\[📜 Paper\]](https://arxiv.org/abs/2505.13427) ## Citation If you find this project useful in your research, please consider citing: ```BibTeX @article{du2025mmprm, title={MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision}, author={Lingxiao Du and Fanqing Meng and Zongkai Liu and Zhixiang Zhou and Ping Luo and Qiaosheng Zhang and Wenqi Shao}, year={2025}, journal={arXiv preprint arXiv:2505.13427}, } ```