Git Product home page Git Product logo

soundlabel's People

Contributors

deepdarkssj avatar kslz avatar ownhere avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

soundlabel's Issues

很快,比这个工具强十倍甚至九倍的新数据集工具就会上线了

【很快!】
目前已支持:
通过音频文件+srt字幕文件导入数据集
导出aishell3/VITS格式数据集
调用标贝语音评测接口为数据集打分,筛选出优质数据

todo:
通过但音频文件导入数据集
调用讯飞声纹识别接口从一个有多说话人的音频中筛选出不同人所说的话
调用讯飞语音评测接口获取数据的MFA结果
。。。

有没有需求

目前没什么迭代方向,可能以后会做一个基于web的,支持多人协作的标注平台
有什么想要的功能可以提,我不一定会但是可以慢慢学2333

切割问题

image
image
我的srt文件有100多个字幕,但是却只显示了8个字幕

英文带引号的文本插入sqlite报错

带单引号的文本插入sqlite会报错,需要将再加一个单引号
例句:I'm looking for a dog
插入sqlite时需要改成
I'’m looking for a dog
插入后数据库显示的结果是正确的
I'm looking for a dog

需要修改的代码:
utils.insert_sound_line
utils.update_sound

英文标注的问题

我尝试用这个标注软件做英文语音的标注,但是因为英文很多带连写的单词,srt文件中很多类似 it's I'm这种省略的,我测试带了'的都会失败,将这个改为it is 或者I am后才可以导入成功,但是看代码没确定是哪里导致的问题,是数据库本身不支持这些字符吗,不太懂这个问题,不知道有人能解释下吗

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.