Added more dropdowns

MinishLab · Oct 13, 2024 · 26b4631 · 26b4631
1 parent 6a629e1
commit 26b4631
Showing 1 changed file with 25 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -145,8 +145,12 @@ Model2Vec is:
 
 ## Usage
 
+
 ### Distilling a Model2Vec model
 
+<details>
+<summary>  Distilling from a Sentence Transformer </summary>
+
 Distilling a model from the output embeddings of a Sentence Transformer model. As mentioned above, this leads to really small model that might be less performant.
 ```python
 from model2vec.distill import distill
@@ -161,7 +165,10 @@ m2v_model = distill(model_name=model_name, pca_dims=256)
 m2v_model.save_pretrained("m2v_model")
 
 ```
+</details>
 
+<details>
+<summary>  Distilling from a loaded model </summary>
 
 If you already have a model loaded, or need to load a model in some special way, we also offer an interface to distill models in memory.
 
@@ -181,6 +188,11 @@ m2v_model.save_pretrained("m2v_model")
 
 ```
 
+</details>
+
+<details>
+<summary>  Distilling with a custom vocabulary </summary>
+
 If you pass a vocabulary, you get a set of static word embeddings, together with a custom tokenizer for exactly that vocabulary. This is comparable to how you would use GLoVe or traditional word2vec, but doesn't actually require a corpus or data.
 ```python
 from model2vec.distill import distill
@@ -205,12 +217,23 @@ m2v_model.push_to_hub("my_organization/my_model", token="<it's a secret to every
 
 **Important note:** we assume the passed vocabulary is sorted in rank frequency. i.e., we don't care about the actual word frequencies, but do assume that the most frequent word is first, and the least frequent word is last. If you're not sure whether this is case, set `apply_zipf` to `False`. This disables the weighting, but will also make performance a little bit worse.
 
+</details>
+
+<details>
+<summary>  Distilling via CLI </summary>
+
 We also provide a command line interface for distillation. Note that `vocab.txt` should be a file with one word per line.
 ```bash
 python3 -m model2vec.distill --model-name BAAI/bge-base-en-v1.5 --vocabulary-path vocab.txt --device mps --save-path model2vec_model
 ```
 
+</details>
+
 ### Inference with a Model2Vec model
+
+<details>
+<summary>  Inference a pretrained model </summary>
+
 Inference works as follows. The example shows one of our own models, but you can also just load a local one, or another one from the hub.
 ```python
 from model2vec import StaticModel
@@ -226,8 +249,10 @@ embeddings = model.encode(["It's dangerous to go alone!", "It's a secret to ever
 # Make sequences of token embeddings
 token_embeddings = model.encode_as_sequence(["It's dangerous to go alone!", "It's a secret to everybody."])
 ```
+</details>
 
 ### Evaluating a Model2Vec model
+
 Our models can be evaluated using our [evaluation package](https://github.com/MinishLab/evaluation).
 <details>
 <summary>  Installation </summary>