openbmb
/

MiniCPM-o-2_6

Model card Files Files and versions Community

finalf0 commited on 18 days ago

Commit

41a38b3

·

1 Parent(s): 2ea48c8

update readme

Files changed (1) hide show

README.md +10 -13

README.md CHANGED Viewed

@@ -73,10 +73,9 @@ MiniCPM-o 2.6 can be easily used in various ways: (1) [llama.cpp](https://github
     <img src="https://github.com/OpenBMB/MiniCPM-o/raw/main/assets/radar.jpg" width=90% />
 </div>
-<details>
-<summary>Click to view visual understanding results.</summary>
-**Image Understanding**
 <div align="center">
 <table style="margin: 0px auto;">
@@ -394,8 +393,10 @@ MiniCPM-o 2.6 can be easily used in various ways: (1) [llama.cpp](https://github
 Note: For proprietary models, we calculate token density based on the image encoding charging strategy defined in the official API documentation, which provides an upper-bound estimation.
-**Multi-image and Video Understanding**
 <div align="center">
 <table style="margin: 0px auto;">
@@ -497,10 +498,9 @@ Note: For proprietary models, we calculate token density based on the image enco
 </details>
-<details>
-<summary>Click to view audio understanding and speech conversation results.</summary>
-**Audio Understanding**
 <div align="center">
 <table style="margin: 0px auto;">
@@ -624,7 +624,7 @@ Note: For proprietary models, we calculate token density based on the image enco
 </div>
 * We evaluate officially released checkpoints by ourselves.<br><br>
-**Speech Generation**
 <div align="center">
 <table style="margin: 0px auto;">
@@ -790,12 +790,10 @@ All results are from AudioEvals, and the evaluation methods along with further d
 </table>
 </div>
-</details>
-<details>
-<summary>Click to view multimodal live streaming results.</summary>
-**Multimodal Live Streaming**: results on StreamingBench
 <table style="margin: 0px auto;">
     <thead>
@@ -922,7 +920,6 @@ All results are from AudioEvals, and the evaluation methods along with further d
     </tbody>
 </table>
-</details>
 ### Examples <!-- omit in toc -->

     <img src="https://github.com/OpenBMB/MiniCPM-o/raw/main/assets/radar.jpg" width=90% />
 </div>
+#### Visual understanding results
+**Image Understanding:**
 <div align="center">
 <table style="margin: 0px auto;">
 Note: For proprietary models, we calculate token density based on the image encoding charging strategy defined in the official API documentation, which provides an upper-bound estimation.
+**Multi-image and Video Understanding:**
+<details>
+<summary>click to view</summary>
 <div align="center">
 <table style="margin: 0px auto;">
 </details>
+#### Audio understanding and speech conversation results.
+**Audio Understanding:**
 <div align="center">
 <table style="margin: 0px auto;">
 </div>
 * We evaluate officially released checkpoints by ourselves.<br><br>
+**Speech Generation:**
 <div align="center">
 <table style="margin: 0px auto;">
 </table>
 </div>
+#### Multimodal live streaming results.
+**Multimodal Live Streaming:** results on StreamingBench
 <table style="margin: 0px auto;">
     <thead>
     </tbody>
 </table>
 ### Examples <!-- omit in toc -->