BSC-LT
/

salamandra-7b-vision

@@ -169,21 +169,23 @@ Diverse thematic data were included to enhance the model's capabilities in subta
 As there is a lack of multimodal multilingual evaluation data, we haven't performed a thorough multilingual evaluation yet (coming soon). The English evaluations are shown in the table below:
-|     Task     |          Subtask        |          Metric         |   Value   |
-|--------------|-------------------------|-------------------------|-----------|
-| ai2d         |                         | exact_match             |    0.7451 |
-| mme          | cognition_score         | mme_cognition_score     |  246.4286 |
-|              | perception_score        | mme_perception_score    | 1371.8164 |
-| mmmu_val     |                         | accuracy                |    0.3689 |
-| mmstar       | average                 | accuracy                |    0.4865 |
-|              | coarse perception       | accuracy                |    0.7127 |
-|              | fine-grained perception | accuracy                |    0.3799 |
-|              | instance reasoning      | accuracy                |    0.5674 |
-|              | logical reasoning       | accuracy                |    0.4478 |
-|              | math                    | accuracy                |    0.4279 |
-|              | science & technology    | accuracy                |    0.3832 |
-| realworldqa  |                         | exact_match             |    0.5699 |
-|mmbench_en_dev|                         | exact_match             |    0.7113 |
 ---

 As there is a lack of multimodal multilingual evaluation data, we haven't performed a thorough multilingual evaluation yet (coming soon). The English evaluations are shown in the table below:
+|     Task       |          Subtask        |          Metric         |   Value   |
+|----------------|-------------------------|-------------------------|-----------|
+| ai2d           |                         | exact_match             |    0.7451 |
+| mme            | cognition_score         | mme_cognition_score     |  246.4286 |
+|                | perception_score        | mme_perception_score    | 1371.8164 |
+| mmmu_val       |                         | accuracy                |    0.3689 |
+| mmstar         | average                 | accuracy                |    0.4865 |
+|                | coarse perception       | accuracy                |    0.7127 |
+|                | fine-grained perception | accuracy                |    0.3799 |
+|                | instance reasoning      | accuracy                |    0.5674 |
+|                | logical reasoning       | accuracy                |    0.4478 |
+|                | math                    | accuracy                |    0.4279 |
+|                | science & technology    | accuracy                |    0.3832 |
+| realworldqa    |                         | exact_match             |    0.5699 |
+| mmbench_en_dev |                         | exact_match             |    0.7113 |
+| docvqa_val     |                         | anls                    |    0.6805 |
+| infovqa_val    |                         | anls                    |    0.4859 |
 ---