ibm / pycloudmessenger Goto Github PK

This repository includes sample code showing how to interact with messaging based services provided by IBM Research Ireland.

License: Apache License 2.0

Python 62.96% Makefile 0.38% Shell 0.82% HTML 35.84%

iot python3 mqtt-client amqp-client rabbitmq messaging

pycloudmessenger's Introduction

pycloudmessenger

The purpose of this project is to provide sample code for interacting with various messaging based cloud platforms provided by IBM Research Europe - Dublin.

Prerequisites

It is assumed that all development takes place in Python, using at least version 3.6.

Testing

Unit tests are contained in the tests directory.

To run the unit tests, a local RabbitMQ container is launched automatically. Settings and credentials to match the latest RabbitMQ docker image are also provided. To run the test:

creds=local.json make test

Examples

Sample code for basic messaging as well as federated learning and castor are contained in the examples directory. To run various samples, invoke the appropriate make target, as follows.

# The basic messaging sample
creds=local.json make basic

# The federated learning sample (online, requires cloud credentials)

python -m examples.ffl.register --credentials=<CLOUDCREDENTIALS> --user=<USER> --password=<PASSWORD> > credentials.json
python -m examples.ffl.sample --credentials=credentials.json
python -m examples.ffl.deregister --credentials=credentials.json

# The castor sample
creds=credentials.json make castor

Note: For online platforms, <CLOUDCREDENTIALS> must be available. Please request from the IBM team.

References

[IBM Research Blog](https://www.ibm.com/blogs/research/2018/11/forecasts-iot/)
[Castor: Contextual IoT Time Series Data and Model Management at Scale](https://arxiv.org/abs/1811.08566) Bei Chen, Bradley Eck, Francesco Fusco, Robert Gormally, Mark Purcell, Mathieu Sinn, Seshu Tirupathi. 2018 IEEE International Conference on Data Mining (ICDM workshops).

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 824988. https://musketeer.eu/

pycloudmessenger's People

Contributors

Stargazers

Watchers

Forkers

mkpurcell kant giulia987 jdsheehan fearghalodonncha mathsinn fuscof85 sagremarisco marcosimioni minhitbk bradleyjeck cclauss bhaskers-blu-org1 stefano81 erdal-pb soham9999 ghas-results

pycloudmessenger's Issues

The participant list in the aggregator is duplicated

Bug in Local platform:
The participant list in the aggregator is duplicated since it is updated everytime we call the receive and it contains a Notification.participant_joined but also when calling the method get_participants. I have created a task with 2 users (Marcos and Jaime) and this is the print from the ML library after calling get_participants:
Participants: ['Marcos', 'Jaime', 'Marcos', 'Jaime']

Missing the option of sending a message to a specific user from the aggregator

This version of the communications library is missing the option of sending a message to a specific user from the aggregator. We need this in order to implement algorithms under the POM3.

Task additional user info

Summary

Make sure you know which tasks the user is participating in and which tasks he is creator of.

Justification

So that you can filter and know which tasks a user is involved in.

Example

We've identified two solutions:

Make sure that within the task model you know which tasks the user is participating in and the ones he is creator of: by name/id or boolean (the model could also be a DTO)
Have a get_tasks_by_user method to return a lists of tasks for which the user is a participant or aggregator

Non-blocking receive functions

E.g. modify the receive methods for aggregator/participants to return a 204 code in case the corresponding queues were empty. With this control can be given back to the aggregator/participant part of the algorithm and avoid blocking the execution.

Additional task status

Summary

It would be helpful to have an intermediate status between CREATED and STARTED.

Justification

When a new task is added to the available ones it gets the CREATED status. As soon as its owner decides
to start the aggregation on the background the master waits for the workers number to reach the task quorum
in order to start the federated learning. Nevertheless, during this time the task status remains the same,
changing to STARTED only when the actual task execution starts. In this way, there is no way for other users
to understand whether that task owner has started the aggregation or not, getting an error if they choose
to join a task which has just been created and not yet aggregated.

"Must have" functionality

The additional task status (something like "PENDING") would have multiple benefits:

Aggregator receives a real feedback after the aggregation by checking the task status.
All the "participant" calls on tasks which are not waiting for workers are avoided.
Improved browsing on the tasks list.

Error classes

Summary

Define error classes.

Justification

In order to capture and handle in a custom way the error that occur.

Example

fflapi.py:

class ServerError(ValueError):
pass

class BadRequestError(ValueError):
pass

…
if 'error' in result:
raise ServerError(result['error'])

if 'calls' not in result:
raise BadRequestError("Malformed object: " + str(result))

controller.py:

from pycloudmessenger.ffl.fflapi import ServerError
from pycloudmessenger.ffl.fflapi import BadRequestError

@app.errorhandler(ServerError)
def server_error(error):
return jsonify({'message': str(error)}), 500, {'ContentType': 'application/json'}

@app.errorhandler(BadRequestError)
def server_error(error):
return jsonify({'message': str(error)}), 400, {'ContentType': 'application/json'}

Receive method still gets blocked and never returns a timeout exception

Bug in Local platform:
The receive method for both the aggregator and the participant still get blocked and never returns a timeout exception when nothing is received within the fixed time.

Task model enrichment

Summary

Add this information at task level:
- task description
- the name of the task owner
- the POM
- algorithm name
- the quorum
- a “rewards” field to define how the task is rewarded
- a “datasets” field to define the topology and dataset description to be used
- the number or list of participants who have joined the task

Justification

Add useful information for the user who wants to join a task. All of this information should be set by the task creator.

"Must have" functionality

All of these information are stored during the task creation and retrievable by any user (ffl.get_tasks() method); some information may only be retrievable by the task creator.

Example

A JSON model example (we can further discuss about some of this information):

{
"task_name": "Test",
"status": "CREATED",
"description": "Task test",
"added": "2020-01-20T12:00:00",
"topology": "STAR",
"participants": 4,
"owner": "owner_name",
"model_type": "Kmeans",
"POM": 1,
"quorum": 5,
"reward": "collaborative"
"datasets": [
{
"type": "image",
"description": "High-res images depicting people",
"features": {
"extension": "png",
"maxHeight": 16000000,
"maxWidth": 16000000
}
},
{
"type": "tabular",
"description": "Data about people",
"sample": [
"Lucas",
35
],
"features": {
"extension": "csv",
"size": 223000000,
"columns": [
{
"name": "Name",
"type": "string",
"description": "Person first name",
"isLabel": false
},
{
"name": "Age",
"type": "number",
"description": "Person age",
"isLabel": true
}
]
}
}
],
"definition": {
"NC": 2,
"maxIterations": 100
}
}

Delete task method

Summary

Add a "delete_task" method inside fflapi.py

Justification

A user can remove its own tasks if he wants.

"Must have" functionality

Only the user who created his own task can remove it.

Sending of non-JSON serializable messages

Summary

Trying to send certain objects results in an error Object of type ... is not JSON serializable.
Could those be serialized using an encoding like

base64.b64encode(pickle.dumps(x)).decode('utf-8')

Avoid conversion of numpy array messages into lists

Currently, when an numpy array is part of a sent message, the receiver will receive it as list. Could such implicit transformation be avoided?

A participant with the same name can join a task multiple times

Bug in Local platform:
Currently, a participant with the same name can join a task multiple times, this is not allowable since it can break the logic of the platform, but it is currently assumed to let users to control this issue. However, this issue should be resoloved by the platform itself.

ibm / pycloudmessenger Goto Github PK

pycloudmessenger's Introduction

pycloudmessenger

Prerequisites

Testing

Examples

References

pycloudmessenger's People

Contributors

Stargazers

Watchers

Forkers

pycloudmessenger's Issues

Summary

Justification

Example

Summary

Justification

"Must have" functionality

Summary

Justification

Example

Summary

Justification

"Must have" functionality

Example

Summary

Justification

"Must have" functionality

Summary

Recommend Projects

Recommend Topics

Recommend Org