[nanomsg] nanomsg: the big picture

From: Martin Sustrik <sustrik@xxxxxxxxxx>
To: nanomsg <nanomsg@xxxxxxxxxxxxx>
Date: Thu, 05 Sep 2013 09:07:50 +0200

Hi all,

There have been several discussions both on mailing list and on the IRCchannel that can't be really resolved without understanding the bigpicture of where nanomsg is going. There's an long email aboutmonitoring from Paul from yesterday, but also several discussions aboutservice discovery, DNS and such are relevant to the topic.

This should be probably written down as a more formal article, however,let me just briefly describe the vision at the moment.

From user's perspective nanomsg is basically done. Although theimplementation is not perfect yet, the conceptual framework (sockets,scalability protocols, topologies, etc.) is unlikely to change in thefuture.

However, from administrative perspective almost nothing have been doneyet and even the very concepts are not defined. And that's what I amtrying to do here.

The main idea is strictly separating the user API from from the adminAPI, or mechanism from policy, if you will.

To get the idea, think of TCP. The user establishes the connection, thensends and receives data, but is completely ignorant about the underlyingnetwork infrastructure. Is there a simple cable between the two boxes?Is there a LAN? Are 10 IP hops involved? Have an intermediary IP routercrashed somewhere on the path and have it been routed around? The usernever knows.

Now there are admins who administer the network. These see all thesecomponents and issues and work hard to make the whole thing working.However, the point is that they don't do that directly via IP or TCP.They use specialised administrative interfaces such as SNMP.

So, in the end you have two distinct APIs and two clearly delineatedsets of users: programmers and admins. The former write business logic,the latter take care that the infrastructure is working.


Let's apply the above to nanomsg now.

The idea would be to shield the user from the details of the topologysame way as the TCP user doesn't see all the routers on the path. Thiscan be done by user connecting to a topology ("market data feed"),rather than to a specific endpoint ("129.168.0.111:5555"):


    nn_connect (s, "topology://market-data-feed");

I am not going to disucss implementatation details here, but the idea isthat admins store the actual info about topology setup in a distributeddatabase (such as DNS) and that the library translatestopology://market-data-feed" into actual endpoint(s) to connect to byquerying the database.

Thus, from the user perspective, they are connecting to a "cloud" called"market-data-feed" without being aware of its internal structure:


[back1.png]

It's up to admins to define the internal structure. During thedevelopment phase it may be something simple like:


[back2.png]

When deployed to production it may be more complex to account foradministrative and geographical boundaries:


[back3.png]

The main point is: There's no difference visible to the user between thecases. This allows admins to optimise and re-structure the topologywithout affecting the applications.


End of chapter one.

Now forget about users and imagine you are an admin tasked withmaintaining a topology. What kind of tools do you need to do your job?

First, you need a way to update the distributed database (such as DNS)so that you can configure the topology according to your needs.

Second, you need a tool to check whether the topology is working asexpected.

As for the latter, the statistics from the entire topology must becollected and presented to the admin in such a comprehensive way:


[back4.png]

So, the admin looks at the graph above, sees there's an disconnectionbetween two intermediary devices and can actually do something about it.Is a network connection broken? Or maybe he just set the address wrongin the DNS? Etc.

As I want to keep this email as short as possible, I won't elaboratefurther, however, it's easy to see the implications of the conceptualmodel described above. For example, the admin wants to check thetopology as a whole so querying local logs on individual machinesprobably won't fly.


Martin

Attachment: back1.png
Description: PNG image

Attachment: back2.png
Description: PNG image

Attachment: back3.png
Description: PNG image

Attachment: back4.png
Description: PNG image

Follow-Ups:
- [nanomsg] Re: nanomsg: the big picture
  - From: Paul Colomiets
- [nanomsg] Re: nanomsg: the big picture
  - From: Paul Colomiets

[nanomsg] nanomsg: the big picture

Other related posts: