Git Product home page Git Product logo

xszyou / fay Goto Github PK

View Code? Open in Web Editor NEW
8.0K 107.0 1.6K 155.88 MB

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.

License: GNU General Public License v3.0

ai python douyin ue4

fay's People

Contributors

andrewsty avatar theramu avatar wangzai23333 avatar xszyou avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

fay's Issues

卡在语音识别了

在阿里云上测试正常,说明设备没问题。总是返回文本是空白,求助
00:42:12][系统] 聆听中...
[00:42:14][系统] 语音处理中...
[00:42:15][系统] 语音处理完成! 耗时: 1025 ms
[00:42:15][系统] [!] 语音未检测到内容!
[00:42:28][系统] 聆听中...
[00:42:30][系统] 语音处理中...
[00:42:31][系统] 语音处理完成! 耗时: 1010 ms
[00:42:31][系统] [!] 语音未检测到内容!
[00:43:44][系统] 聆听中...
[00:43:46][系统] 语音处理中...
[00:43:47][系统] 语音处理完成! 耗时: 1001 ms
[00:43:47][系统] [!] 语音未检测到内容!
[00:43:49][系统] 聆听中...
[00:43:50][系统] 语音处理中...
[00:43:51][系统] 语音处理完成! 耗时: 1004 ms
[00:43:51][系统] [!] 语音未检测到内容!

mac M1 pip install ./FunASR 安装报错

(base) yyw@yywdeMac-mini Fay % pip install ./FunASR
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple
Processing ./FunASR
Preparing metadata (setup.py) ... error
error: subprocess-exited-with-error

× python setup.py egg_info did not run successfully.
│ exit code: 1
╰─> [30 lines of output]
/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/installer.py:27: SetuptoolsDeprecationWarning: setuptools.installer is deprecated. Requirements should be satisfied by a PEP 517 installer.
warnings.warn(
Traceback (most recent call last):
File "", line 2, in
File "", line 34, in
File "/Users/yyw/Desktop/QuWan/AIProject/Fay/FunASR/setup.py", line 125, in
setup(
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/init.py", line 86, in setup
_install_setup_requires(attrs)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/init.py", line 80, in _install_setup_requires
dist.fetch_build_eggs(dist.setup_requires)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/dist.py", line 874, in fetch_build_eggs
resolved_dists = pkg_resources.working_set.resolve(
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/pkg_resources/init.py", line 789, in resolve
dist = best[req.key] = env.best_match(
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/pkg_resources/init.py", line 1075, in best_match
return self.obtain(req, installer)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/pkg_resources/init.py", line 1087, in obtain
return installer(requirement)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/dist.py", line 944, in fetch_build_egg
return fetch_build_egg(self, req)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/installer.py", line 87, in fetch_build_egg
wheel.install_as_egg(dist_location)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/wheel.py", line 110, in install_as_egg
self._install_as_egg(destination_eggdir, zf)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/wheel.py", line 118, in _install_as_egg
self._convert_metadata(zf, destination_eggdir, dist_info, egg_info)
File "/Users/yyw/miniforge3/lib/python3.10/site-packages/setuptools/wheel.py", line 162, in _convert_metadata
os.rename(dist_info, egg_info)
OSError: [Errno 66] Directory not empty: '/Users/yyw/Desktop/QuWan/AIProject/Fay/FunASR/.eggs/numpy-1.21.3-py3.10-macosx-11.0-arm64.egg/numpy-1.21.3.dist-info' -> '/Users/yyw/Desktop/QuWan/AIProject/Fay/FunASR/.eggs/numpy-1.21.3-py3.10-macosx-11.0-arm64.egg/EGG-INFO'
[end of output]

note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed

× Encountered error while generating package metadata.
╰─> See above for output.

note: This is an issue with the package mentioned above, not pip.
hint: See above for details.

macos系统,运行python main.py 报错No module named 'winreg'

winreg模块只是windows系统中的吧?
macos中安装python3.9没报错,按照
https://blog.csdn.net/aa84758481/article/details/127551258
步骤进行的安装,最后运行python main.py报错:
`
japhone@localhost live-virtual-digital-person$ python main.py

pygame 2.1.3 (SDL 2.0.22, Python 3.9.10)

Hello from the pygame community. https://www.pygame.org/contribute.html

Traceback (most recent call last):

File "/private/var/www/html/fay/live-virtual-digital-person/main.py", line 10, in

from gui import flask_server

File "/private/var/www/html/fay/live-virtual-digital-person/gui/flask_server.py", line 8, in

import fay_booter

File "/private/var/www/html/fay/live-virtual-digital-person/fay_booter.py", line 12, in

from core.viewer import Viewer

File "/private/var/www/html/fay/live-virtual-digital-person/core/viewer.py", line 15, in

import winreg

ModuleNotFoundError: No module named 'winreg'

`

python main error, docker ubuntu 18.04

使用synology nas 作为服务器端,使用docker执行python main.py后报错,目前不知道qt问题是否会引起吐核,正在解决x11 转发问题

pygame 2.1.2 (SDL 2.0.16, Python 3.8.2)
Hello from the pygame community. https://www.pygame.org/contribute.html

  • Serving Flask app 'gui.flask_server' (lazy loading)
  • Environment: production
    WARNING: This is a development server. Do not use it in a production deployment.
    Use a production WSGI server instead.
    qt.qpa.plugin: Could not load the Qt platform plugin "xcb" in "" even though it was found.
    This application failed to start because no Qt platform plugin could be initialized. Reinstalling the application may fix this problem.

Available platform plugins are: eglfs, linuxfb, minimal, minimalegl, offscreen, vnc, wayland-egl, wayland, wayland-xcomposite-egl, wayland-xcomposite-glx, webgl, xcb.

Aborted (core dumped)

python3.10 无法打开Fay Eyes

报错如下:
[11:29:54][系统] YOLO v8正在启动...
127.0.0.1 - - [2023-06-28 11:29:54] "POST /api/control-eyes HTTP/1.1" 200 246 1.047371
Exception in thread Thread-10 (run):
Traceback (most recent call last):
File "/opt/homebrew/Cellar/[email protected]/3.10.6_2/Frameworks/Python.framework/Versions/3.10/lib/python3.10/threading.py", line 1016, in _bootstrap_inner
self.run()
File "/opt/homebrew/Cellar/[email protected]/3.10.6_2/Frameworks/Python.framework/Versions/3.10/lib/python3.10/threading.py", line 953, in run
self._target(*self._args, **self._kwargs)
File "/Users/sam/Dev/dev/Fay/ai_module/yolov8.py", line 89, in run
operated_frame = frame.copy()
AttributeError: 'NoneType' object has no attribute 'copy'

ali_nls_key_id 和 ali_nls_key_secret 找不到。

阿里云 实时语音识别 服务密钥(必须)

ali_nls_key_id=
ali_nls_key_secret=

您好,已经注册阿里云-语音,目前只有ali_nls_app_key,

问题:这两个参数,是哪里的?

启动时报错

[17:50:34][系统] 开启核心服务...
[1] 78941 illegal hardware instruction python main.py

启动main.py报错

python版本:3.8.7
系统:windows 7
按照文档,各个依赖库都按相应版本pip install成功
导入pycharm项目中,运行main.py 报错,麻烦作者大大,帮忙看一下问题,谢谢!

错误信息如下:
Traceback (most recent call last):
File "D:\Python\Python38\lib\site-packages\azure\cognitiveservices\speech\speech_py_impl.py", line 27, in swig_import_helper
return importlib.import_module(mname)
File "D:\Python\Python38\lib\importlib_init_.py", line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File "", line 1014, in _gcd_import
File "", line 991, in _find_and_load
File "", line 975, in _find_and_load_unlocked
File "", line 657, in _load_unlocked
File "", line 556, in module_from_spec
File "", line 1101, in create_module
File "", line 219, in _call_with_frames_removed
ImportError: DLL load failed while importing _speech_py_impl: 找不到指定的程序。

python main.py 报错 ERROR:zygote_host_impl_linux.cc(262)] Failed to adjust OOM score of renderer with pid 21908: Permission denied (13)

如题
报错信息如下
WebEngineContext used before QtWebEngine::initialize() or OpenGL context creation failed.
[21839:21906:0613/170710.643830:ERROR:zygote_host_impl_linux.cc(262)] Failed to adjust OOM score of renderer with pid 21908: Permission denied (13)
Failed to create OpenGL context for format QSurfaceFormat(version 2.0, options QFlagsQSurfaceFormat::FormatOption(), depthBufferSize 24, redBufferSize -1, greenBufferSize -1, blueBufferSize -1, alphaBufferSize -1, stencilBufferSize 8, samples 0, swapBehavior QSurfaceFormat::DefaultSwapBehavior, swapInterval 1, colorSpace QSurfaceFormat::DefaultColorSpace, profile QSurfaceFormat::NoProfile)

窗口异常

python main.py 窗
QQ截图20230318165113
QQ截图20230318165051
口显示只有 icon 和 title , 其他的一片空白 ,也没有报错 ,不知道问题出现在哪里

python main.py 报错,qt打开后空白几秒然后就关闭了,控制台信息如下

C:\Users\淇淇\AppData\Local\Programs\Python\Python39\python.exe C:/Users/淇淇/Desktop/GPTapi/数字人/Fay/main.py
pygame 2.1.3 (SDL 2.0.22, Python 3.9.7)
Hello from the pygame community. https://www.pygame.org/contribute.html
Traceback (most recent call last):
File "C:\Users\淇淇\AppData\Local\Programs\Python\Python39\lib\asyncio\windows_events.py", line 434, in select
Traceback (most recent call last):
File "C:\Users\淇淇\AppData\Local\Programs\Python\Python39\lib\asyncio\windows_events.py", line 434, in select
self._poll(timeout)
RuntimeError: <_overlapped.Overlapped object at 0x000001BF935AC930> still has pending operation at deallocation, the process may crash
self._poll(timeout)
RuntimeError: <_overlapped.Overlapped object at 0x000001BF935AC9C0> still has pending operation at deallocation, the process may crash

进程已结束,退出代码-1073740791 (0xC0000409)

websocket版本冲突

  • awesome project!
  • 部署时会遇到一点小问题
  • websocket has no attribute ‘enableTrace’
  • 'module' object has no attribute 'enableTrace'

Don't forget to uninstall websocket, uninstall websocket-client if you already intalled it before, then re install websocket-client:
pip uninstall websocket
pip uninstall websocket-client
pip install websocket-client

  • 或许在requirements.txt中直接删除websocket可以解决冲突问题。
  • 如有错误还请指正

无法调用chatgpt key

Error:

20:22:36][系统] 自然语言处理...
can only concatenate str (not "TypeError") to str
[20:23:06][系统] 自然语言处理错误!

已确认网络可正常访问chatgpt,且api有效。
环境:Windows11, Anaconda

system.conf:

[key]
#funasr / ali
ASR_mode = ali
#ASR_mode = funasr
#ASR二选一(需要运行fay/test/funasr服务)集成达摩院asr项目、感谢中科大脑算法工程师张聪聪提供集成代码
local_asr_ip=127.0.0.1
local_asr_port=10197

# ASR二选一(第1次运行建议用这个,免费3个月), 阿里云 实时语音识别 服务密钥(必须)https://ai.aliyun.com/nls/trans
ali_nls_key_id=LTAId2LA
ali_nls_key_secret=CDCNGcCczbX8ZYF
ali_nls_app_key=3CkjbsF54


# 微软 文字转语音 服务密钥(非必须,使用可产生不同情绪的音频)https://azure.microsoft.com/zh-cn/services/cognitive-services/text-to-speech/
ms_tts_key=
ms_tts_region=

# 讯飞 情绪分析 服务密钥 https://www.xfyun.cn/service/emotion-analysis/
xf_ltp_app_id=91688
xf_ltp_api_key=697216706abdd1bed

#NLP五选一:lingju、yuan、chatgpt、rasa(需启动chatglm及rasa,https://m.bilibili.com/video/BV1D14y1f7pr)、VisualGLM
#chat_module=lingju
chat_module=chatgpt
#灵聚 服务密钥(NLP多选1) https://open.lingju.ai
lingju_api_key=7b2fe0f1a2
lingju_api_authcode=b7900f

#浪.潮源大模型 服务密钥(NLP5多1) https://air.inspur.com/
yuan_1_0_account=  
yuan_1_0_phone=

#gpt 服务密钥(NLP多选1) https://openai.com/
chatgpt_api_key=sk-1gwGRWawx


#ngrok内网穿透id,远程设备可以通过互联网连接Fay(非必须)http://ngrok.cc
ngrok_cc_id=

#revChatGPT对接(非必须,https://chat.openai.com登录后访问https://chat.openai.com/api/auth/session获取)

gpt_access_token=
gpt_conversation_id=


proxy_config=

异常解决

Exception in thread Thread-38:
Traceback (most recent call last):
File "/Users/libn/miniconda3/lib/python3.9/threading.py", line 973, in _bootstrap_inner
self.run()
File "/Users/libn/miniconda3/lib/python3.9/threading.py", line 910, in run
self._target(*self._args, **self._kwargs)
File "/Users/libn/Documents/XZNS/Research/chatgpt/Fay/ai_module/ali_nls.py", line 138, in __connect
websocket.enableTrace(False)

解决
pip uninstall websocket-client
pip uninstall websocket

pip install websocket-client

网易云不可用的bug

因为网易云加了一个验证,没有header的请求被拒了。
解决很简单,song_player.py的__random_song方法,添加header
response = requests.request("GET", url,headers={ 'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/112.0.0.0 Safari/537.36 Edg/112.0.1722.39'})

ValueError: Invalid audio channels 我已经插入麦克风了,为什么还报错呀

[11:10:58][系统] 使用 'help' 获取帮助.
File "C:\Program Files\Python310\lib\threading.py", line 953, in run
self._target(*self._args, **self._kwargs)
File "D:\Fay-main\Fay-main\core\recorder.py", line 98, in __record
stream = self.get_stream() #把get stream的方式封装出来方便实现麦克风录制及网络流等不同的流录制子类
File "D:\Fay-main\Fay-main\fay_booter.py", line 39, in get_stream
self.stream = self.paudio.open(input_device_index=device_id, rate=self.__RATE, format=self.FORMAT, channels=channels, input=True)
File "C:\Program Files\Python310\lib\site-packages\pyaudio_init
.py", line 639, in open
stream = PyAudio.Stream(self, *args, **kwargs)
File "C:\Program Files\Python310\lib\site-packages\pyaudio_init
.py", line 441, in init
self._stream = pa.open(**arguments)
ValueError: Invalid audio channels

在控制台上麦克风选择的是 微软声音映射器-Output

启动main.py报错

(base) zz@ubuntu:~/app/python/Fay$ python main.py 
pygame 2.1.3 (SDL 2.0.22, Python 3.9.13)
Hello from the pygame community. https://www.pygame.org/contribute.html
/home/zz/software/conda/anaconda3/lib/python3.9/site-packages/revChatGPT/__init__.py:29: UserWarning: The current Python is not a recommended version, 3.10+ is recommended
  __import__("warnings").warn(
Traceback (most recent call last):
  File "/home/zz/app/python/Fay/main.py", line 11, in <module>
    from gui.window import MainWindow
  File "/home/zz/app/python/Fay/gui/window.py", line 8, in <module>
    from PyQt5.QtWebEngineWidgets import *
ImportError: /usr/lib/x86_64-linux-gnu/libgssapi_krb5.so.2: symbol krb5_ser_context_init version krb5_3_MIT not defined in file libkrb5.so.3 with link time reference

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.