When searching around for things that can be blogged for Zeppelin got this idea of sharing a feature of Zeppelin. The intention of the blog is to brief the feature and highlight its availability and potential features of it.
Zeppelin being a wonderful tool with opensource environment and plug able back end data processing, provides the users the privilege to play with data interactively. But when it comes to data science it would be more interesting if the data being played around is being shared amongst multiple teams and multiple people. Zeppelin Hub is one such feature that facilitates this.
Sharing Notebook : When the real need comes to us to share the data amongst teams/people and the data needs to be public, zeppelin hub enables you to share Apache notebook either public or privare and share them with specific team of people. Apart from notebooks, zeppelin hub space can also be created where you can privately share notebooks only to team having an access permission,leave some comments in your notebook for the dynamic team work.
We can think of media’s National Election polls results as a typical scenario for such a need, where they query in data in various different perspective and present them to debate.
Multitenancy: One more advantage from an organisation perspective could be,it is not feasible to make all users install and configure Apache Zeppelin. With single URL, all users go to get their own, containerised version of Apache Zeppelin, utilising resources of the shared cluster. Multitenancy makes better user of Apache Zeppelin with Zeppelin Hub easier in multi-user environment.
This feature helps in the better utilisation of resources amongst users.
Few more features of Zeppelin Hub includes,
Copying of data: Sometimes, you might want to have the shared notebooks in your own Apache Zeppelin instances since you need to modify or just run them. If so, copy the notebooks located in the Spaces to your own instances. It can be done also between other instances.
Creating composers: Sometimes you might want to extract some paragraphs located in different notebooks. In this case, you can create a new notebook using Composer feature in Zeppelinhub.
Dynamically Replicate notebook : Install Apache Zeppelin and create an account for Zeppelin hub, you can generate a token for your Zeppelin instance. Then you can seamlessly connect Zeppelin hub with Apache Zeppelin.
On a brief note, apart from using Zeppelin as a individual tool for data analysis,Zeppelin hub can be a used as a tool for use in a collaborative and multi user environment.