Git Product home page Git Product logo

deepsc-s's Introduction

Semantic Communication Systems for Speech Transmission

Introduction

This repository contains code for the project of a deep learning enabled semantic communcation system for speech signals, named DeepSC-S. The paper can be found here. This project is trained with TensorFlow 2.4.1.

Dataset

The dataset adopted in this project can be found in Edinburgh DataShare. Note that the original dataset version utilized in this project has been upgraded, and the orginal simulation environment is TensorFlow 1.14.0. We train the upgraded dataset with some modified hyperparameters in TensorFlow 2.4.1 instead of TensorFlow 1.14.0, which may results in a slight difference of the training loss.

Contact

Email: [email protected]

deepsc-s's People

Contributors

zhenzi-weng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

deepsc-s's Issues

What mean of frame values in speech

Hi author,

When I am training your code. I printed the input tensor and it has a value matrix as in the below picture.
Here, what is the meaning of negative and positive values? Can you explain it to me?
image

loss problem

hello,
i have run your code, but the loss is hard to get 5e-7, specifically,It converges when the loss reaches 5e-6. Is there such a big difference between different data sets?

loss nan!

The version I am using is TensorFlow2.4.1, but the first epoch of training shows a loss of nan

Note that input is not a valid node name.

Dear author, when I run your code, it appears this errors as in figure 1 below.
image

So, how can I fixe it? In figure 2 is folder for this code. The "path to save the trained datatrain" folder does not have any created file after running main.py file.
Many thanks.
image

***************** start train *****************
WARNING:tensorflow:_input is not a valid node name. Accepted names conform to Regex /re.compile('^[A-Za-z0-9.][A-Za-z0-9_.\\/>-]$')/
WARNING:tensorflow:_input is not a valid node name. Accepted names conform to Regex /re.compile('^[A-Za-z0-9.][A-Za-z0-9_.\\/>-]
$')/

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbe in position 77: invalid start byte

I downloaded noise_trainset_28spk_wav from Edinburgh DataShare as a dataset, and changed the sampling rate in the code to sr=48kHz according to the information given by the website, and generated the corresponding .tfrecords file.
When I run main.py now, I'm told UnicodeDecodeError: 'utf-8' codec can't decode byte 0xbe in position 77: invalid start byte, and can't find the cause of the problem so far.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.