roc-streaming / roc-toolkit Goto Github PK

View Code? Open in Web Editor NEW

1.0K 40.0 207.0 9.99 MB

Real-time audio streaming over the network.

Home Page: https://roc-streaming.org

License: Mozilla Public License 2.0

Python 5.21% C++ 90.90% C 2.83% Shell 0.56% Ragel 0.50%

real-time audio streaming networking fec dsp hacktoberfest

roc-toolkit's Introduction

Roc Toolkit: real-time audio streaming

Roc is a toolkit for real-time audio streaming over the network.

You can read about the project here:

Summary

The toolkit consists of a C library and a set of command-line tools.

Key features:

real-time streaming with guaranteed latency;
restoring lost packets using Forward Erasure Correction codes;
converting between the sender and receiver clock domains;
CD-quality audio;
multiple profiles for different CPU and latency requirements;
portability;
relying on open, standard protocols.

Besides library and tools, toolkit provides:

bindings for other programming languages
modules for sound servers like PulseAudio and PipeWire
applications

Documentation

Documentation for the latest release is available here.

Additionally, there is also Doxygen-generated documentation for internal modules.

Build status

See details on continuous integration here.

Branch	Status
`master`
`develop`

Versioning

See details here:

Platforms

See details here.

If you want to run Roc on an embedded system or a single-board computer, see:

Plans

See here:

Donations

If you would like to support the project financially, please refer to this page. This project is developed by volunteers in their free time, and your donations will help to spend more time on the project and keep it growing.

Thank you!

Community

We have a forum, mailing list, and Matrix chat room. See details here.

Contributing

Contributions in any form are always welcome! Please check out contribution guidelines.

Tasks needing help are listed here.

Licensing

See details on licenses here.

Authors

See a list of maintainers and contributors here.

roc-toolkit's People

Contributors

Stargazers

Watchers

Forkers

digideskio itsn1x readmenot ben-401 xiaocui duyouhua ajs124 mutiadavid myonetaps zenuo linecode lahvuun sumdoll malcops gill984 erik-d-mueller dustins hrishikeshsuresh ilanmimoun astitva98 tomd1990 biljith nporkodi root5427 sardo15 asalle gavv duynhan39 v-ini-t86 moneytech roiotero alexandremgo urkx sumit581 strfry cristobalm mehrdad-shokri rohank99 aplastiras jverce equalearn ivanmiklec davex25 leonard-9500 pandiora metheccoder4 anniemay1337 aj-thomas-8 kruthikahassan drahflow clevershovel fusuiyi123 villaquiranm peppermachine ddimension qaws01395 mecope1 ekanshbhatnagar deepanshu2015 matteoarella moon9 caseyguansun rishabhc711 pvimal816 enigmaro prg1005 yjheo15 nazi-pikachu ritikjain999 ssfuen ashishsiyag curioustauseef kelre hassan-a ameyanrd abiabilesh imagehandler prabhsuggal ejeschke jandersof86 jorawarsingh12 jsomwaru my5t3ry swapnil-gaur jarhat tboschi terry182 kenneth-raymond stevenzhu94 bkayes adrianniebla rostamn739 tracylai standardgalactic black-eagle17 zzu-andrew gourishbiradar brox1 hypfer zinbers

roc-toolkit's Issues

Check alignment in LDPC_BlockEncoder

Determine alignment requirements for OpenFEC and check that buffers are properly aligned.

When alignment requirements are not met, OpenFEC encoder segfaults. (Misaligned packets were fixed in 1977737).

Add integration test for public API

Test sender and receiver.

Replace ad-hoc FEC packets with something standard-compliant

SCons: improve clang/gcc sanitizers support

add option to disable sanitizers
enable both clang and GCC sanitizers

Public API

roc-streaming/roc-pulse#23 is a good start for gathering requirements for public API.

Improve scons options

doxygen support
check external dependencies and their versions
use pkg-config if available
configurable paths for external dependencies
option to download and build external dependencies (--with-3rdparty)
option to select enabled targets manually (--with-targets)
check clang-format version
configurable compiler and tools paths

Determine maximum acceptable packet loss ratio for FEC decoder

Our integration tests with random packet loss of 1% fail from time to time. I guess it's because sometimes, there are to much lost packets per block.

I think we should use a fixed number of lost packets per block in such tests.

SCons: remove .sconsign in 'clean' target

Refactor roc_pipeline

extract session management from server
merge base and derived classes (client, server, session)
remove session composer (use pool directly)
remove audio::IRenderer
merge audio::Renderer into pipeline::Session

Improve roc-send and roc-recv tools

roc-recv: add --oneshot option; if it's set, exit when first client disconnects
roc-send: fix EOF handling when sending files
roc-send: produce readable error message when sox can't open input file

Get rid of hardcoded latency in FreqEstimator

Currently, latency can be configured at runtime for Delayer, but not for FreqEstimator. Thus, using non-default latency will break it.

Session negotiation

Implement some sort of session negotiation.

At a minimum, we need to be able to transfer session parameters from the sender to the receiver. The parameters should include rtpmap (dynamic payload types) and fssi (FEC-scheme specific information).

Why we need this:

to fully confirm FECFRAME and make FEC scheme parameters configurable;
to be able to add more encodings, which require dynamic payload types.

The session description should be in the SDP format because this is required by various RTP-related RFCs, e.g. FECFRAME or Opus for RTP.

As for the control protocol, we can use RTSP/1.0 or SIP.

We should also add a session control port to the API, command-line tools, and PulseAudio modules.

RTCP support

RFC 3550 requires conforming implementations to provide RTCP support. In fact, roc can function without it, though RTCP provides some benefits:

without RTCP, sender has no feedback from receiver; it even doesn't know, if receiver was able to receive traffic, if receiver was able to process the traffic and didn't reject it for some reason, and if quality of service is acceptable;
RTCP allows receiver to detect associated streams from single sender in a standard-conforming way, e.g. audio stream and FEC stream (we're now using an ad-hoc implementation that conforms to no standard);
RTCP provides timing synchronisation for sender and receiver and for several streams of single sender (e.g. audio and video);
RTCP can be used to implement dynamic adjustment of latency and FEC code rate (using RFC 5725);
RTCP can be used to implement retransmission using RFC 4588 and RFC 4585;
RTCP allows multiple senders to share the same source transport address (e.g. in case of RTP translator);
RTCP allows seamless sender restart or address change (this also allows sender roaming);
RTCP allows receiver to terminate sender's session immediately without waiting for timeout;
RTCP is used in RTSP, which we may want to support in future;
with RTCP, receiver sends to sender various statistics that may be useful for troubleshooting, especially when receiver is some embedded device which is harder to debug;
some senders may reject connection if there is no RTCP feedback from receiver (I'm not aware of such senders however);
finally, RTCP also allows to exchange arbitrary application-specific data.

RTCP consists of the following message types:

SR (sender report): statistics sent from sender to receiver;
RR (receiver report): statistics and feedback send from receiver to sender;
SDES (source description items): sender identification sent from sender to receiver;
BYE: session termination, initiated by sender or receiver;
APP: arbitrary application-specific data.

Refactor output latency in server

Move output latency from SampleBufferQueue to Server.

remove start threshold from SampleBufferQueue
generate output_latency / samples_per_tick zero buffers before first tick in server
cleanup config

Create web site for Roc

Landing page
Sphinx
Doxygen
Mobile-friendly

Fix ALSA underrun

Add overview and server-flow pages to wiki

https://github.com/roc-project/roc/wiki/Overview

Write or generate man pages for tools

We could use help2man to generate man pages from --help.

Resampler tool

Add roc-resample tool:

read input samples from wav file
pass to resampler
measure resampler performance (min/max/avg real seconds per virtual second)
write result samples to wav file

Reduce output latency

understand why higher output latency is currently required for ALSA backend compared to pulseaudio backend;
determine minimum possible latency for pulseaudio and ALSA (bare and over pulseaudio);
reduce output latency:
- tune SoX parameters;
- or, alternatively, tune ALSA parameters directly.

See alsa_play_tuned.cpp for full list of ALSA parameters that may affect latency.

Configure CI for Linux/x86_64

First, we need https://travis-ci.org/ for various linux distros and mac os.

Document FreqEstimator and Resampler

Please :)

https://github.com/roc-project/roc/wiki

SCons: Add 'install' target

This target should install public API headers and library (#12), tools, and documentation (#28) on system.

Move output latency handling from server to pipeline

Currently, output latency is handled directly in Server. We should move it to new pipeline component, DelayedWriter.

Add Opus support

Opus is a lossy streaming audio codec.

Specs:

RFC 6716 - codec
RFC 7587 - RTP payload format

Adding Opus support introduces a few challenges:

currently RTP payload decoder is stateless; Opus decoder is stateful;
currently RTP payload decoder and FEC decoder are separate pipeline steps; Opus decoder combines them.

Our fec::Decoder is packet-format-independent and works before decoding audio packet payload. Thus, it theoretically may be used in conjunction with Opus decoder, if it makes sense.

Get rid of additional output latency in server?

Currently, output latency is added to session latency. But is this really required?

Add support for Raspberry Pi

Fix ALSA/pulseaudio capturing

Example:

pactl list sources | grep Name: | grep monitor
	Name: alsa_output.pci-0000_01_00.1.hdmi-stereo.monitor
	Name: alsa_output.pci-0000_00_14.2.analog-stereo.monitor

roc-send -vv -t pulseaudio -i alsa_output.pci-0000_01_00.1.hdmi-stereo.monitor \
    -s 127.0.0.1:10001 -r 127.0.0.1:10002

Configurable sample rate

Three components are aware of sample rate:

sndio::Reader
sndio::Writer
audio::TimedWriter, used in pipeline::Server and pipeline::Client

What we need:

add --rate option for roc-send and roc-recv
add sample rate setter and getter to IPacket
set sample rate in Splitter
drop packets with mismatching sample rate in Watchdog

IPv6 support

Current implementation unconditionally uses IPv4:

uses 4 bytes for IP address;
calls ipv4 functions from libuv to parse and bind address.

We can add support for IPv6, activated if it's available at build time and if user provided an IPv6 address.

Support selecting compiler version in SCons

E.g. when multiple GCC versions are installed, it should be possible to select specific version.

Add client/server integration test for roc_pipeline

Currently, there are only separate unit tests for client and server.

Document tools usage

network options
reading/writing files
reading/writing devices (ALSA, pulseaudio)

Refactor Chanalyzer

Implement per-channel instances of IPacketReader instead if single IAudioPacketReader
Remove IAudioPacketReader interface
Cast packets to IAudioPacket in Streamer

Tricky case with Depacketizer and fec::Reader

It seems that in current implementation there is a case when we're unnecessary dropping received packet(s).

Assume sender generates three packets:

+----+----+----+
| P1 | P2 | P3 |
+----+----+----+

Then they're reordered by interleaver or network:

+----+----+----+
| P3 | P1 | P2 |
+----+----+----+

and P1 and P2 are delayed a bit:

+----+-------+----+----+
| P3 | delay | P1 | P2 |
+----+-------+----+----+

When Depacketizer wants to read P1 from fec::Reader, only P3 is already received:

+----+-------+
| P3 | delay |
+----+-------+

Thus, fec::Reader considers that P1 and P2 are lost and returns P3. Depacketizer notes that P1 and P2 are lost. Depacketizer renders zeros instead of P1.

Then, P1 and P2 are received:

+----+----+
| P1 | P2 |
+----+----+

After this, Depacketizer is going to render samples for P2, previously marked as lost.

Here is the problem:

P2 is received and it's time to extract samples from P2
however, Depacketizer renders zeros instead of P2, because it already got P3 from fec::Reader and thinks that P2 is lost
anyway, fec::Reader drops P2 because it never returns late packets

However, technically nothing prevents us from rendering P2 in this situation.

Note that with fec::Reader, the situation when queue is empty or almost empty is not rare in practice, because network latency jitter value may be close to aim_queue_size - fec_block_size (7 packets currently).

Is this a problem?

Add support for Mac OS X

Adapt SConstruct for MacOS
Adapt 3rdparty building for MacOS (#88)
Don't use version script on MacOS or maybe don't use it at all (#89)
Disable _POSIX_C_SOURCE on MacOS (doesn't work as expected)
Remove core::SpinMutex (we don't really need it) (#91)
Add target_posixtime and target_darwin to SConstruct (#99)
Port time functions to MacOS (#99)
Add Travis build for MacOS
Update wiki

Improve RTP support

add tests for RTP parser/composer
handle packet's SSRC (keep RTCP (#14) and SDP (#34) in mind)
- add SSRC support to RTP packet
- fill SSRC in Splitter and fec::Encoder
- check SSRC and seqnum in fec::Decoder
- parse packets before routing to session
- identify session by destination address + source address + SSRC list
- identify packet queue inside session by SSRC
- auto-detect SSRC for every packet queue inside session

external dependencies
building
installing
supported platforms

Support crosscompiling for x86 on x86_64

Use -m32 compiler/linker option and pass it to 3rd-parties.
Add 32-bit build to CI (most likely to ubuntu:nodep image).

Get rid of hardcoded parameters in roc_fec

In current implementation, the following encoder/decoder parameters are configured at compile time:

block size (number of data and FEC packets in block);
packet size (number of bytes in audio packet).

They are:

ROC_CONFIG_DEFAULT_FEC_BLOCK_DATA_PACKETS
ROC_CONFIG_DEFAULT_FEC_BLOCK_REDUNDANT_PACKETS
ROC_CONFIG_DEFAULT_PACKET_SIZE

Moreover, if server and client are built with different parameters, server may behave incorrectly.

What we need instead:

configure parameters at runtime (when creating encoder/decoder), and use ROC_CONFIG_ values just as defaults;
handle possibility of parameters mismatch;
provide corresponding runtime configuration options in ServerConfig and ClientConfig in roc_pipeline;
use smaller packet size in client/server tests to make them faster.