Git Product home page Git Product logo

shiro's People

Contributors

sleepwalking avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

shiro's Issues

Supported audio/index length?

Hello,

I am building a dataset to train with and need to ask a few questions before proceeding.

What is the max supported/suggested audio length? is several minutes alright or should the audio be limited to about ~20 seconds or so? Likewise, is there a reasonable limit to the length of the index?

Thank you.

content of index.csv

Hi,
lua shiro-fextr.lua index.csv -d "../cmu_us_bdl_arctic/orig/" -x ./extractors/extractor-xxcc-mfcc12-da-16k -r 16000
Can you tell what is the content of index.csv file which is one of the input argument for speech-phoneme alignment.
Also what path should be provided for -d argument

Thanks

Phonetic stress without creating new phonemes?

I am looking to use Shiro to label speech with the stresses in-place. Does Shiro have support for this without treating them as a unique phoneme?

If not then would it be ok to request this as a feature? Being able to do something like ah durfloor 0.4 aka ah0 aka ah1 as to not waste data but still output the stress in the final label would be very useful.

Thank you.

when load the model, Null point always returned.

hello, first thanks for the nice framework.

the extraction of mfcc and first, second-order delta feature works well.
After that, when I load the model(.hsmm)

the Error: failed to load model from blah blah

.. error is occurred.

Some model file(empty.hsmm) doesn't occur above error.
And i made some test.txt or text.hsmm file and change the from path to test file to check the fopen function in hsmm = load_model(optarg) in shiro-rest.c whether it works well. But it also got an error!

fopen return success by checking 'perror', it returns 'Success'. the custom c file i made also can read any .hsmm and test.txt.
but it doesn't works only in your shiro-rest.c code.

I can't resolve this situation, how can i resolve this problem?

image

lua5.2: shiro-fextr.lua:54: module '/home/___/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-dae-16k.lua' not found:

Hi,

Building on linux, I'm encountering a problem running SHIRO.

I've tried with adding the .lua to the extrator as well but I get the same error.

lua5.2 shiro-fextr.lua ~/Downloads/UTAU/Resonance_Harmony_Arpasing_English/Base_B3/index.csv -d ~/Downloads/UTAU/Resonance_Harmony_Arpasing_English/Base_B3/ -x ~/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k -r 16000
lua5.2: shiro-fextr.lua:54: module '/home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k' not found:
no field package.preload['/home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k']
no file '/usr/local/share/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.lua'
no file '/usr/local/share/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k/init.lua'
no file '/usr/local/lib/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.lua'
no file '/usr/local/lib/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k/init.lua'
no file '/usr/share/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.lua'
no file '/usr/share/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k/init.lua'
no file './/home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.lua'
no file '/usr/local/lib/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.so'
no file '/usr/lib/x86_64-linux-gnu/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.so'
no file '/usr/lib/lua/5.2//home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.so'
no file '/usr/local/lib/lua/5.2/loadall.so'
no file './/home/myname/Prog/audio/SHIRO/extractors/extractor-xxcc-mfcc12-da-16k.so'
stack traceback:
[C]: in function 'require'
shiro-fextr.lua:54: in main chunk
[C]: in ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.