lyutyuh / gazetteer-ner-acl19 Goto Github PK
View Code? Open in Web Editor NEWCode for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers
Home Page: https://www.aclweb.org/anthology/P19-1524
Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers
Home Page: https://www.aclweb.org/anthology/P19-1524
您好,请问一下我用不同的语料做训练,但是tokens.txt和预训练的embedding都一样,为什么vocab.get_vocab_size('tokens')返回的size会不一样呢
谢谢
https://github.com/lyutyuh/acl19_subtagger/blob/227d6b0389a6d01de7c43c229e293ffe09915b3d/modules/hscrf_layer_SoftDict.py#L229
这一行是不是应该注释掉啊,上面已经得到了BILOU_features,为什么又重新置0了呢
Looks like the server that hosts pre trained models is down. I'm getting Bad Gateway error at https://www.jeffliu.page
I am keep getting error - "CUDA out of memory", even though I have reduced batch size (to 2) and model size. Tried to use multiple GPUs, doesn't work. Any ideas?
Hi I am trying to train this on CPU and I am getting the below error.
result = self.forward(*input, **kwargs)
File "./models/HSCRF_SoftDict.py", line 228, in forward
] += self.end_token_embedding.cuda(util.get_device_of(spans))
RuntimeError: Device index must not be negative
I am not very much aware of torch. Let me know how to resolve it.
请问您有没有遇到过子分类器单独预测时候用model.tar.gz加载模型得到的参数和在代码中用.th文件加载的参数不一致的情况?
Hi, I am getting the following error while running the training script.
FileNotFoundError: [Errno 2] No such file or directory: 'https://www.jeffliu.page/files/state.th'
It seems that jeffliu.page website is down. Any idea how to resolve this issue
Hi,
could you link back from the main README to the https://github.com/microsoft/vert-papers repo from KC?
请问如何预训练一个subtagger呢,因为想把这个框架用在别的gazetteer上
is there any difference between this code and https://github.com/microsoft/vert-papers/tree/master/papers/SubTagger , which should I use for the ACL 19 paper? Thanks .
hi,我想请问一下,论文里另外两个模型HSCRF+concat和HSCRF+gazemb代码啥时候能上传啊?
HSCRF+concat主要用的哪些特征啊?
Unable to access the data on the page : "https://www.jeffliu.page/files/softdict.zip" . Probably the page is down.
Throws this error :
2020-10-16 01:15:42,574 - WARNING - urllib3.connectionpool - Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'NewConnectionError('<urllib3.connection.VerifiedHTTPSConnection object at 0x7f60d4afb240>: Failed to establish a new connection: [Errno 111] Connection refused',)': /files/DATA/conll2003/train.txt
能问一下这个算法Viterbi的动态规划是怎么写的吗,多加了一层循环不太会了,代码看的有点儿不懂
谢谢
按照soft_dictionary_span_classifier_HSCRF.py里的写法,是对gazetteers的条目进行了序列标注而不是像论文里写的是对条目进行分类呀
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.