Comments (1)
您好,十分抱歉抱歉一直没注意到issue。
- Yun.py里韵查找需要使用一个大的语料库构建,因为我们的数据暂未开放,所以相关构建函数没有提供。yun.py已经更新,目前只保留了调用部分的代码,并且我们提供了构建好的韵数据yun.pkl。
- 因为可能存在属于多个韵部的韵字,我们目前简单地根据统计的方式排歧,先用bi-gram,通常可能一个词来对读音(韵部排歧),如果bi-gram无法确定,再使用三元组来排歧。
- 平声字典和仄声字典是根据平水韵构建的,即取平水韵里所有的平声字和仄声字。FunctionWords.txt fchar.txt goodwords.txt这三个文件是我们自己整理的。
- 我们做了简单的归纳,采用了比较常见的四种句式,对于一些可平可仄的情况可以纳入其中。少数例外情况为了简化问题就暂时没有考虑。
- 我们论文中就是使用实验所用的语料得到的。我们目前在google drive中提供了一份处理好的版本,这是用一个较大的库处理得到的。
之前我们升级了代码的python版本和tensorflow版本,一些细节也进行了优化,如有问题欢迎提出。
from wmpoetry.
Related Issues (7)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from wmpoetry.