Comments (8)
It would be important to know more details to be able to understand this, e.g.
- which permission is missing?
- where/how does this error pop up?
- which closed issue are you referring to?
If it is about Kõnele lacking the audio recording permission then you could try to grant it after installation using "adb":
adb shell pm grant ee.ioc.phon.android.speak android.permission.RECORD_AUDIO
from k6nele.
hi kaljurand,
sorry. i should have been more explicit.
- it's the microphone permission issue
- the error shows up on the yellow initial dialog when we press the microphone icon
- i was referenceing #43
i will try the adb command you suggested.
however i was able to use the android client app that you referenced in your code:
https://github.com/truongdo/kaldi-gstreamer-android-client
i had some problems building the above mentioned project but once i built it i did not have the mic permissions issue(ofcourse i am not able to get any transcribed output though. may be bcos i am speaking english instead of Estonian? i will be posting this question to alumae though)
thanks
from k6nele.
It think this is not an error but the standard permissions asking dialog that pops up.
Just press "Allow", or run on the commandline:
adb shell pm grant ee.ioc.phon.android.speak android.permission.RECORD_AUDIO
Recording/transcription seems to work fine on my emulator. (But there have been issues with recording on an emulator in the past.) For a non-Estonian server, consider setting up your own server, e.g. based on https://github.com/jcsilva/docker-kaldi-gstreamer-server
(https://github.com/truongdo/kaldi-gstreamer-android-client targets an older version of Android that does not support runtime permissions, i.e. all permissions are granted automatically at installation time. I guess that is why there is no additional popup.)
from k6nele.
hi kaljurand,
thanks for the response. i was able to get past the permissions error by running the adb command you suggested.
now i am able to speak into i.but as soon as i finish the speaking and press the mic button it brings up the google search screen. is this how its supposed to work?
you are right. the "truongdo" android client targets an older version. but the reason i was trying to get it working is bcos it's simpler than the one you have created.yours is quite robust and has lot more features.honestly it's little bit overhelming for me to learn your sample as i am relatively new to android.
i just want to write a bare bones android client that will talk to the kaldi-gstreamer serrvice running on my server with my trained english models and just spits out the transcribed text to be shown on the android client app(this is the way i want to get my feet wet. once i make some progress i will reference your code for adding more features)
can u please suggest what files i need to look into in your sample to achieve my goal? or if there is an easier android client sample that i will be able to quickly modify for my needs?
thanks in advance
from k6nele.
hi kaljurand,
following this https://github.com/jcsilva/docker-kaldi-gstreamer-server really helped me understand the app. i got ur app working on a notepad like app on my android emulator. thanks.
now if only i can write a bare bones version of your app i will be very very happy. any help is greatly appreciated.
from k6nele.
hi kaljurand,
sorry.one more question. i am looking at the architecture diagram in the README and the way i understood it is that the net-speech-api is only used for the http based Grammar recognition for the pocket-sphinx server. since in my case i will not be using the porcket-sphinx server, is it safe to remove this module?
thanks
from k6nele.
By default, Kõnele shows the transcription using a web search activity (e.g. "google search"). This can be changed by calling Kõnele from another app (instead of the launcher), or by using rewrite rules.
Yes, net-speech-api is not really requred for the kaldi-gstreamer-server. However, there is currently a dependency via ChunkedWebRecSessionBuilder, which is probably not so difficult to remove.
In order to get a basic kaldi-gstreamer-server client working, take the code from
https://github.com/Kaljurand/speechdemo1 and add the following Kõnele classes to it:
However, even simpler is if you can assume that Kõnele is installed on the device, i.e. maybe you can bundle it with your app somehow. Then you just need to call the standard Android speech API from your app and require that it should be serviced by Kõnele. You can specify your server via the SERVER_URL extra. (See also: http://kaljurand.github.io/K6nele/docs/en/developer_guide.html).
from k6nele.
awesome.thank you for your patient and prompt responses.really appreciate it. will let you know how it goes.
thanks again.
from k6nele.
Related Issues (20)
- Dynamically Enable/Disable Rewrite Rules via Buttons HOT 3
- Remove need for QUERY_ALL_PACKAGES
- Mic icon goes black, no speech transcribed, no errors HOT 3
- Material design 3 HOT 1
- If Kõnele is launched through AnySoftKeyboard dictation, it opens a web search instead of inputting text HOT 3
- Other similar projects to collaborate with.
- New way to select and configure services
- konele's text doesn't stay in input box after MIUI 14 update HOT 8
- Recognition mostly fails with "general app error" / "not connected to the recognition service" (error 5) in recent Android v14 updates
- F-Droid can't build HOT 9
- [FR] Web Speech API HOT 1
- android 11 insufficient permissions HOT 11
- Screenshots for F-Droid HOT 1
- Fail to compile the source code, too many errors, please fix them. HOT 1
- Checkbox does not disable after on-screen rotation. HOT 2
- Microphone stays active after using Kõnele in Android 12 HOT 1
- Build error HOT 1
- Android TV 13: RecognitionService: caller doesn't have permission:android.permission.RECORD_AUDIO HOT 15
- Searching recognition results via launcher icon fails on Android 13 HOT 2
- Temporarily disabling "Microphone access" centrally breaks access to Kõnele services HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from k6nele.