-
Notifications
You must be signed in to change notification settings - Fork 894
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
French regression between Stanza 1.8.1 and 1.8.2 #1404
Comments
So, I'm not surprised there are FR changes over time. We created a "combined" FR model to be the default out of four mostly compatible treebanks. There is exactly one line with
Does this dependency look reasonable to you? At any rate, I can rebuild the FR models with the latest versions of the datasets, and perhaps it will improve performance somewhat. |
Thanks for your quick reply. Yes Thanks, Bernard |
Mmm, unfortunately, the models continue to call it (We could also start with parses for a couple sentences with that pair of words and correct the errors that show up.) |
Hello, I am happy to contribute by providing a couple of corrected sentences. What would be the expected format and the proper repository ? I also noticed some other regressions after the rebuilding with the latest versions of the git data. Is there any way to access the previous versions ? Thanks |
Well..... yes, that's technically possible. They should be in the HuggingFace history for the FR models. Although the idea behind making the newer models is there will be other things that work better with the updated data https://huggingface.co/stanfordnlp/stanza-fr If you can come up with some example regression sentences, perhaps the best format would just be text sentences (cut down so they demonstrate the error but aren't 50 words long), I'll run them through our best models, and you can let me know where you spot the errors |
Here are some example regression sentences:
Thanks, Bernard |
If I put some of these into the "accurate" models with a Transformer, it already does some of these recommendations. I can post some here:
|
Everything looks good ! Thank you |
This is what it came up with for ...
but you were saying the Can you suggest one or two other sentences with |
yes, sure. Here are a few sentences:
|
Each of the |
I would say that the change |
Alright, I put a candidate fake training file here: stanfordnlp/handparsed-treebank@0fac6a8 Any thoughts on these? Also sent them to a former colleague who's worked on French datasets before. |
If you find any other regressions, please don't hesitate to send them our way. I can rerun the depparse training with these sentences and see if it helps. |
welll.... just training on those sentences isn't helping either model get the |
Alright, I realized I had mistrained the models with the new dependencies. The new models seem to get |
Describe the bug
Take the following sentence: Assurez-vous d'être à l'heure !
The word vous has a wrong dependency relation with Stanza 1.8.2, but correct with Stanza 1.8.1
Stanza 1.8.1 :
Stanza 1.8.2 :
To Reproduce
Steps to reproduce the behavior: see above
Expected behavior
I would expect the same analysis independent of the version
Environment (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: