kindle15 / cx-extractor-python Goto Github PK
View Code? Open in Web Editor NEWThis project forked from chrislinan/cx-extractor-python
基于行块分布函数的通用网页正文抽取算法的Python版本实现,添加了英文支持/ Web page content extraction algorithm, support both Chinese and English
License: MIT License