bkai-foundation-models
/

vietnamese-bi-encoder

Sentence Similarity

sentence-transformers

feature-extraction

Model card Files Files and versions

sangdv commited on Sep 30, 2023

Commit

dfb7bc5

·

1 Parent(s): bc7a1e5

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -25,9 +25,10 @@ license: apache-2.0
 # bkai-foundation-models/vietnamese-bi-encoder
 This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
 We train the model on a merged training dataset that consists of:
-  - MS Macro (translated in Vietnamese)
-  - SQuAD v2  (translated in Vietnamese)
   - 80% of the training set from the Legal Text Retrieval Zalo 2021 challenge
 We use [phobert-base-v2](https://github.com/VinAIResearch/PhoBERT) as the pre-trained backbone.

 # bkai-foundation-models/vietnamese-bi-encoder
 This is a [sentence-transformers](https://www.SBERT.net) model: It maps sentences & paragraphs to a 768 dimensional dense vector space and can be used for tasks like clustering or semantic search.
 We train the model on a merged training dataset that consists of:
+  - MS Macro (translated into Vietnamese)
+  - SQuAD v2  (translated into Vietnamese)
   - 80% of the training set from the Legal Text Retrieval Zalo 2021 challenge
 We use [phobert-base-v2](https://github.com/VinAIResearch/PhoBERT) as the pre-trained backbone.