cpp-concurrency

Basic concurrency code snippets in C++.

Compilation

mkdir build && cd build/
cmake ..
make

Execution

./bin/cpp-concurrency

CMake

The current CMake script supports CMake 2.8 and higher.

cmake_minimum_required(VERSION 2.8)

Set the name of the project.

project(cpp-concurrency)

Find the Threads package. This is the module mode of CMake to find a package: if the findThreads.cmake file is found, its content is executed by CMake; the .cmake file usually contains the whole procedure to find and use external libraries into the current project. They provide dependency information for the target. They also perform all the preliminary checks indicating the library is correctly installed on the current system and can be used as well.

find_package(Threads)

Set the output path of the generated binary.

set(CMAKE_RUNTIME_OUTPUT_DIRECTORY ../bin)

The generated executable is not optimized for debug.

set(CMAKE_BUILD_TYPE Release)

We compile using C++14, and we display all the warnings.

add_compile_options(-std=c++14 -Wall)

Create a set of all the sources files.

file(
    GLOB
    sources
    src/*
)

Integrate the includes files directory.

include_directories(includes)

Specify the executable name after compilation.

add_executable(
    cpp-concurrency
    ${sources}
)

The libraries of the Threads package are used during the linking process. The command below adds -lthread ... to the compilation command.

target_link_libraries(cpp-concurrency ${CMAKE_THREAD_LIBS_INIT})

Concurrency theory

Threads

Threads are independent sequences of execution. Threads are started by processes and run into the same memory space. Processes run in different memory spaces.

The "memory space" management/organization depends of the OS implementation, but it's usually a space owned by one process (and many threads if the process is multithreaded).

Hardware threads vs Software threads

Hardware threads are a feature of the processor. This is the entity that really makes calculations. The amount of hardware threads is fixed and depends of the CPU.

Software threads are started by processes/OS, and managed by the OS. They are sent to the hardware threads for execution. Usually, if a software thread is blocking (waiting for an IO), a hardware thread is able to take another software thread and executes it.

Basic concurrency with threads

The following code (in simple_thread) starts two threads, and waits for each one to finish.

std::thread firstThread(procedureToRun);

std::thread secondThread(procedureToRun);

firstThread.join();
secondThread.join();

Unlike std::async tasks, it is not possible to return values from std::thread functions. If a std::thread function throws an exception, the program stops. When using std::thread, there is no automatic management for the repartition and oversubscription problems.

`join` vs `detach`

Example can be found in join_vs_detach.

When using the join method, the program waits for the given thread to be terminated.

std::thread t(method);

t.join(); // waits here until t is finished

When using the detach method, the program does not wait for the thread to be finished. The program can be terminated before the thread finishes.

std::thread t(method);

t.detach(); // both of the program and the thread continue their execution

RAII Threads

Example in raii_thread.

In order to be sure that the thread is correctly finished, one solution is to wrap it into a RAII class.

class RAIIThread {

public:

    enum class ThreadAction {
        Join, /** < call join() to finish the thread */
        Detach /** < call detach() to finish the thread */
    };

    RAIIThread(
        const ThreadAction& action,
        std::thread&& thread
    ) :
        action(action),
        thread(std::move(thread))
    {
    }

    ~RAIIThread() {

        if (not thread.joinable()) {
            return;
        }

        if (action == ThreadAction::Join) {
            thread.join();
        } else {
            thread.detach();
        }
    }

private:

    ThreadAction action;
    std::thread thread;
};

Tasks

Simple tasks usage

An example of this feature can be found into simple_task.

A task is a function passed to std::async. By using this function, a thread "might" be created to execute the passed function. The standard library analyzes by itself if the creation of a new thread is really necessary for the method (according to the processor hardware threads, the CPU caches, the CPU specificities...). In some cases, the std::async method will simply execute the given function in the same thread that asks for the task result (get function).

std::future<int> task = std::async(asynchronousFunction);

/* a lot of code */

auto result = task.get();

Oversubscription problem

This is a very common multi-threading problem. This problem occures when the amount of available hardware threads is lesser than the amount of non-blocking software threads, ready to be executed.

The operation system allows each software thread to be executed for a given time. When the time is over, another software thread is executed by a hardware thread.

Every software switch is a context switch (the CPU hardware thread must be refreshed in order to execute the new given thread).

std::async is a solution to this problem, as it may execute the given function without any new thread creation.

Work stealing

Work stealing algorithms are used in multi-threading environments.

When threads are created, their instructions are put inside a queue, in order to be executed by the hardware threads/processors. When a hardware thread or a processor has nothing to do, it checks the instructions queue of the other threads and "steals" them to put them into its own queue.

`std::launch::async` vs `std::launch::deferred`

Instead of using std::async, we can use one of the two following functions:

std::launch::async passed method is executed asynchronously, into a different thread,
std::launch::deferred passed method is executed synchronously at the get or wait method invocation; if no one of these two methods is called by the future object, the passed method is never executed.

If std::async is simply called, one of the two method above is applied. It could be a synchronous or an asynchronous call. The std::async method will judge by itself which execution way is better according to the current execution (oversubscription, work stealing, CPU...).

/* function is executed asynchronously in a new thread */
auto future = std::async(
    std::launch::async,
    function
);

/* function is executed synchronously into the same thread when calling wait() */
auto future = std::async(
    std::launch::deferred,
    function
);

/* function is executed synchronously OR asynchronously */
auto future = std::async(function);

There are common pitfalls with the simple std::async usage. Here a simple example:

/* if future is created with std::launch::deferred,
   the program does not create a new variable */
thread_local int value {5};

auto future = std::async(function);

if (future.wait_for(std::chrono::microseconds(500)) == std::future_status::ready) {

    /* if future is created with std::launch::deferred,
       then this code is never executed as the future
       is not ready */
}

future.wait();

Thread destruction

The way a thread is destroyed depends on its creation.

There are two way to destroy a thread:

blocking until the end of the thread (the destructor blocks),
terminates the thread immediately

The destructor of a thread blocks if:

the thread has been started with std::async,
the thread is not deffered

{
    auto future = std::async(
        std::launch::async,
        function
    );

    /* wait for the thread to be finished */
}

Sometimes, there is no guarantee the destructor will wait or not:

{
    auto future = std::async(function);

    /* may wait or not, depends how the thread has been started */
}

A classic walk-around is to declare an asynchronous task with packaged_task.

std::packaged_task executes any task and stores its result into a shared state that can be read by one or many futures, instead of std::future that directly has the returned value.

{
    std::packaged_task<int()> task(function);

    auto future = task.get_future();

    std::thread thread(task);

    /* the thread destructor will never wait;
       the destruction process will be the same
       as if we had simply declared std::thread  */
}

`thread_local` variables

A variable declared with thread_local indicates that the given variable is created for every new created thread.

An example can be found in thread_local_variables.

thread_local unsigned short copied_value {10};
unsigned short shared_value {10};

void threadFunction() {

    /* shared_value is equal to the last value that has been set by a thread */
    /* copied_value is guarantee to be equal to 10 */

    copied_value += 1;
    shared_value += 1;
}

auto future = std::async(
    std::launch::async,
    threadFunction
);

`std::mutex` and `std::lock_guard`

Usage example can be found in mutex.

A mutex is used to ensure that one thread does not manipulate shared resources while another thread is already manipulating them. A shared resource is shared between all threads.

void task(
    const std::string& key,
    const unsigned short& data,
    std::mutex& mutex,
    std::map<std::string, unsigned short>& container
) {
    std::lock_guard<std::mutex> guard(mutex); // seg fault without this line
    container[key] = data;
}

std::mutex mutex;
std::map<std::string, unsigned short> container;

constexpr unsigned short FIRST_DATA {10};
std::thread(
    task,
    "first_key",
    FIRST_DATA,
    std::ref(mutex),
    std::ref(container)
);

constexpr unsigned short SECOND_DATA {100};
std::thread(
    task,
    "second_key",
    SECOND_DATA,
    std::ref(mutex),
    std::ref(container)
);

Condition variables (`std::condition_variable`)

An example can be found in condition_variable.

A condition variable is used to notify one or many threads from one thread.

How to use them ?

void firstThreadFunction(
    std::condition_variable& cv,
    std::mutex& mutex
) {

    /* execute some code... */

    cv.notify_one(); // the second thread is notified

    /* execute some code... */
}

void secondThreadFunction(
    std::condition_variable& cv,
    std::mutex& mutex
) {

    /* execute some code */

    std::unique_lock<std::mutex> lock(mutex);
    cv.wait(lock); // wait until the notification from the first thread

    /* execute some code... */
}

int main() {

    std::condition_variable cv;
    std::mutex mutex;

    std::thread firstThread(
        firstThreadFunction,
        std::ref(cv),
        std::ref(mutex)
    );

    std::thread secondThread(
        secondThreadFunction,
        std::ref(cv),
        std::ref(mutex)
    );

    firstThread.join();
    secondThread.join();
}

Even if this code is operational, it has some weaknesses:

the second thread may indefinitely wait if the first thread sends the notification before the second thread executes the wait method,
the wait() method fails to account for spurious wakeups (automatic weakups from the processor, often from the implementation for safety reasons) The wait() method should use its lambda to check if the wakeup really comes from the other thread.

`std::lock_guard` vs `std::unique_lock`

They both do the same thing (take a mutex and lock it).

std::lock_guard cannot unlock the mutex until the end of the block (destruction of the object).

std::unique_lock can be locked and unlocked during the execution. They provide more features than std::lock_guard (like RAII construction) but are more expensive to use.

`std::shared_future` vs `std::future`

Call the function get on std::future moves the object from the shared state to the new address.

Call the function get on std::shared_future copies the object from the shared state to the new address.

`std::atomic` usage

std::atomic guarantees that operations on a given variable are atomic (from the other threads point of view).

For example:

void threadFunction(int& value) { // replaced by std::atomic<int>&

    value += 20;

    value -= 15;
}

int main() {

    int value {10}; // replaced by std::atomic<int>

    std::thread firstThread(
        threadFunction,
        std::ref(value)
    );

    std::thread secondThread(
        threadFunction,
        std::ref(value)
    );
}

With the code above, there is no guarantee about the final result of value. In fact, at the line value += 20, a read process is performed on value. During this read process, the two threads may read simultaneously the value 20 as the action is not atomic.

By replacing int by std::atomic<int>, we have a guarantee that read and modify value at the line value += 20 is one and unique action. A thread won't be able to read value until the number 20 is not added to it by another thread.

Furthermore, using std::atomic<int>, there is a guarantee that the instructions value += 20 and value -= 15 will be executed in the specified order. This is not guarantee when we simply declare int.

`volatile`

volatile variables are not optimized by the compiler.

For example, when we write this:

int value {5};
value = 10;

The compiler may directly understand the instruction int value {10} instead.

This optimization is not performed when using volatile:

volatile int value {5};
value = 10;

volatile is used for variable that contains sensors content, IO content...

The following is also correct:

volatile std::atomic<int> value {10};

Check threads(s) amount of a running process

ps -e -T | grep cpp-concurrency

jean553 / cpp-concurrency Goto Github PK

cpp-concurrency's Introduction

cpp-concurrency

Compilation

Execution

CMake

Concurrency theory

Threads

Hardware threads vs Software threads

Basic concurrency with threads

join vs detach

RAII Threads

Tasks

Simple tasks usage

Oversubscription problem

Work stealing

std::launch::async vs std::launch::deferred

Thread destruction

thread_local variables

std::mutex and std::lock_guard

Condition variables (std::condition_variable)

How to use them ?

std::lock_guard vs std::unique_lock

std::shared_future vs std::future

std::atomic usage

volatile

Check threads(s) amount of a running process

cpp-concurrency's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org