Implement general Parameter Server

A parameter server is a framework to asynchronously share parameters among machine learning workers for higher scalability. `Hivemall` currently has a standalone server implementation, named a [MIX server](https://github.com/myui/hivemall/tree/master/mixserv), to asynchronously average parameters among workers for internal use only. To make the MIX server more general, we are planning to implement parameter server functionalities (e.g., cluster manager supports, optimizers to calculate deltas from gradients to update parameters, RPC protocols that third-party libraries use, and so on) based on the implementation.

We started some works as a first step:
- Support Cluster managers. The MIX server implementation currently supports a standalone mode and we can start a cluster of  MIX servers through [the start-up script](https://github.com/myui/hivemall/pull/234). For easy operability, it is a good idea to deploy MIX servers via cluster managers, e.g., `Apache Hadoop YARN` and `Apache Mesos`. We are working on a `YARN` integration in #236, #246, and [the topic branch](https://github.com/myui/hivemall/tree/dev/yarnkit).
- Incorporate optimizer functionalities into the MIX server. `Hivemall` has optimizer functionalities in the [core](https://github.com/myui/hivemall/tree/master/core) package. So, we'll separate them in #285 and then import in [core](https://github.com/myui/hivemall/tree/master/core) and [mixserv](https://github.com/myui/hivemall/tree/master/mixserv).
- Define RPC protocols for general use. There are some works (e.g., #147) though, this interoperability issue is still open.

This ticket is to track related activities for parameter servers and please feel free to leave comments and advices here.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement general Parameter Server #332

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Implement general Parameter Server #332

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions