feat: add mpnet model family #405

snewcomer · 2024-11-05T12:43:05Z

https://arxiv.org/pdf/2004.09297

Huggingface cards
https://huggingface.co/microsoft/mpnet-base
https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-cos-v1

snewcomer · 2024-11-05T12:49:45Z

lib/bumblebee/text/mpnet.ex

@@ -0,0 +1,458 @@
+defmodule Bumblebee.Text.MPNet do


note this was copied from the Bert implementation. A few adjustments in the options but that was about it

snewcomer · 2024-11-05T12:56:56Z

test/bumblebee/text/mpnet_test.exs

+
+  test ":for_masked_language_modeling" do
+    assert {:ok, %{model: model, params: params, spec: spec}} =
+             Bumblebee.load_model({:hf, "hf-internal-testing/tiny-random-MPNetForMaskedLM"})


https://huggingface.co/hf-internal-testing/tiny-random-MPNetModel

jonatanklosko · 2024-11-06T06:34:18Z

test/bumblebee/text/mpnet_test.exs

+    assert_all_close(
+      outputs.hidden_state[[.., 1..3, 1..3]],
+      Nx.tensor([
+        [[-0.2331, 1.7817, 1.1736], [-1.1001, 1.3922, -0.3391], [0.0408, 0.8677, -0.0779]]
+      ])
+    )


We want to compare against the reference values from hf/transformers:

from transformers import MPNetModel import torch model = MPNetModel.from_pretrained("hf-internal-testing/tiny-random-MPNetModel") inputs = { "input_ids": torch.tensor([[10, 20, 30, 40, 50, 60, 70, 80, 0, 0]]), "attention_mask": torch.tensor([[1, 1, 1, 1, 1, 1, 1, 1, 0, 0]]) } outputs = model(**inputs) print(outputs.last_hidden_state.shape) print(outputs.last_hidden_state[:, 1:4, 1:4]) #=> torch.Size([1, 10, 64]) #=> tensor([[[ 0.0033, -0.2547, 0.4954], #=> [-1.5348, -1.5433, 0.4846], #=> [ 0.7795, -0.3995, -0.9499]]], grad_fn=<SliceBackward0>)

I believe there are a few differences between MPNet and BERT, so we need to align the implementation accordingly. In particular, by a quick look some layer names differ, for example key->k, value->v, query->q, so we need to update the layer mapping as well :)

Thanks a ton! Regarding your last comment, are you looking here?

https://github.com/huggingface/transformers/blob/main/src/transformers/models/mpnet/modeling_mpnet.py

Yes! And the Bert implementation is https://github.com/huggingface/transformers/blob/main/src/transformers/models/bert/modeling_bert.py, which may be helpful for differences.

ok made another round of improvements! thx for the direction.

jonatanklosko · 2024-11-06T06:39:34Z

lib/bumblebee/text/mpnet.ex

@@ -0,0 +1,458 @@
+defmodule Bumblebee.Text.MPNet do


Nitpick: I think we want the name to be MpNet to align with our naming conventions. Basically, in acronyms we capitalize only the first letter, as in BERT -> Bert, RoBERTa -> Roberta. And we capitalize on each word, such as ResNet, ConvNext. We do this, because the reference names are often arbitrarily capitalized, and it's not ergonomic for library users to know the exact capitalization.

feat: add mpnet model family

a026833

snewcomer commented Nov 5, 2024

View reviewed changes

add test

34a43d8

snewcomer commented Nov 5, 2024

View reviewed changes

jonatanklosko reviewed Nov 6, 2024

View reviewed changes

snewcomer added 4 commits November 6, 2024 07:47

address some comments

356f92a

update to torch

848be43

no cross attention due to mpnet architecture

f36abb6

moar alignment

e2312e9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add mpnet model family #405

feat: add mpnet model family #405

snewcomer commented Nov 5, 2024

snewcomer Nov 5, 2024

snewcomer Nov 5, 2024

jonatanklosko Nov 6, 2024

snewcomer Nov 6, 2024

jonatanklosko Nov 6, 2024

snewcomer Nov 6, 2024

jonatanklosko Nov 6, 2024

feat: add mpnet model family #405

Are you sure you want to change the base?

feat: add mpnet model family #405

Conversation

snewcomer commented Nov 5, 2024

snewcomer Nov 5, 2024

Choose a reason for hiding this comment

snewcomer Nov 5, 2024

Choose a reason for hiding this comment

jonatanklosko Nov 6, 2024

Choose a reason for hiding this comment

snewcomer Nov 6, 2024

Choose a reason for hiding this comment

jonatanklosko Nov 6, 2024

Choose a reason for hiding this comment

snewcomer Nov 6, 2024

Choose a reason for hiding this comment

jonatanklosko Nov 6, 2024

Choose a reason for hiding this comment