bkai-foundation-models
/

vietnamese-bi-encoder

Sentence Similarity

sentence-transformers

feature-extraction

Model card Files Files and versions

sangdv commited on Sep 29, 2023

Commit

26d93a5

·

1 Parent(s): c1d85a2

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -22,6 +22,23 @@ widget:
 # bkai-foundation-models/vietnamese-bi-encoder
 This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
 <!--- Describe your model here -->

 # bkai-foundation-models/vietnamese-bi-encoder
 This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
+We train the model on a merged training dataset that consists of:
+  - MS Macro (translated in Vietnamese)
+  - Squadv2 (translated in Vietnamese)
+  - 80% of the training set from the Legal Text Retrieval Zalo 2021 challenge
+We use phobert-base-v2 as the pre-trained backbone.
+Here are the results on the remaining 20% of the training set from the Legal Text Retrieval Zalo 2021 challenge:
+|     Pretrained Model          |     Trained Datasets                  |     Acc@1    |     Acc@10    |     Acc@100    |     Pre@10    |     MRR@10    |
+|-------------------------------|---------------------------------------|:------------:|:-------------:|:--------------:|:-------------:|:-------------:|
+|     [Vietnamese-SBERT](https://huggingface.co/keepitreal/vietnamese-sbert)     |     -                                 |     32.34    |      52.97    |      89.84     |      7.05     |      45.30    |
+|                               |     MSMACRO                           |     54.06    |      84.69    |      93.75     |      8.33     |      64.56    |
+|     PhoBERT-base-v2           |     MSMACRO                           |     47.81    |      77.19    |      92.34     |      7.72     |      58.37    |
+|                               |     MSMACRO + SQuADv2.0 + 80% Zalo    |     73.28    |      93.59    |      98.85     |      9.36     |      80.73    |
+![Uploading image.png…]()
 <!--- Describe your model here -->