We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
您好,W2NER模型如何load中文不连续实体呢?就像原文中的CADEC数据集一样的格式,以所有字符的下标数组作为实体位置标记:
{ "sentence": ["For", "all", "of", "you", "who", "now", "have", "extremely", "low", "LDL", "and", "a", "bad", "case", "of", "joint", "pain", "to", "the", "extent", "that", "it", "is", "very", "arthritic", "or", "having", "bad", "muscle", "cramps", "that", "you", "never", "got", "prior", "to", "the", "drug", ",", "it", "is", "from", "the", "statins", "."], "ner": [{ "index": [15, 16, 23, 24], "type": "ADR" }, { "index": [28, 29], "type": "ADR" }] }
The text was updated successfully, but these errors were encountered:
不好意思,最近在搞大模型的一些东西,没顾上看这块的内容,请问你这边已经搞定了吗?
Sorry, something went wrong.
我参考别人对W2NER源码的修改和注释改了一下,能在论文源码的基础上load和预测上面给出的不连续实体了,供您参考https://github.com/Bureaux-Tao/discontinuous-ner/blob/main/data_loader.py
好的,谢谢,后续我会参考下的您的代码看看
No branches or pull requests
您好,W2NER模型如何load中文不连续实体呢?就像原文中的CADEC数据集一样的格式,以所有字符的下标数组作为实体位置标记:
The text was updated successfully, but these errors were encountered: