Comments (4)
- サンプルの作成先は https://github.com/axinc-ai/ailia-models/tree/master/audio_processing
- pyannote-audioのpthファイルをONNXに変換 https://medium.com/axinc/%E5%AD%A6%E7%BF%92%E3%81%97%E3%81%9F%E3%83%A2%E3%83%87%E3%83%AB%E3%82%92ailia-sdk%E3%81%A7%E4%BD%BF%E7%94%A8%E3%81%A7%E3%81%8D%E3%82%8B%E5%BD%A2%E3%81%AB%E3%82%A8%E3%82%AF%E3%82%B9%E3%83%9D%E3%83%BC%E3%83%88%E3%81%99%E3%82%8B-add271b8ebdd
- サンプルコード(Python)の作成(pytorchからonnx (ailia SDK)に移行)
- サンプルコードとモデルのPR
from ailia-models.
モデル登録の手順はSlackにリンクを共有しました。
from ailia-models.
pyannote audioのアーキテクチャ
https://herve.niderb.fr/fastpages/2022/10/23/One-speaker-segmentation-model-to-rule-them-all
from ailia-models.
入力波形から、各人物の発言のProbablityのグラフを算出し、それをセグメンテーションする。
モデルは5秒単位に実行し、2.5秒のオーバラップでスライディングウィンドウで処理する。
from ailia-models.
Related Issues (20)
- ADD dreamtalk HOT 1
- ADD llava
- ADD OOTDiffusion
- ADD DeepFlowGuidedVideoInpainting HOT 1
- ADD TripoSR
- ADD sd-turbo
- ADD bert ner japanese
- ADD AudioGen
- ADD MusicGen
- Add bert-network-packet-flow-header-payload
- PaddleOCRの標準モデルをServerモデルにする
- ADD AniPortrait HOT 2
- ADD sdxl-turbo HOT 3
- ModuleNotFoundError: No module named 'fvcore'
- ADD bge-m3 HOT 1
- ADD VISTA (hands-segmentation-pytorch)
- ADD Ego2Hands
- ADD japanese-reranker-cross-encoder-large-v1 HOT 1
- ADD cross-encoder-mmarco-mMiniLMv2-L12-H384-v1 HOT 18
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ailia-models.