• research workflow scheduling
    at genomics scale


Eagle has integrated highly efficient open source code (Sanger/EBI) in a new fully tested architecture. The product uses the Docker container technology to deliver market-leading efficiencies in parallel computation pipelines on any distributed, local or cloud infrastructure.

How it works

e[hive] coordinates “runnables” – validated processing code wrapped in an appropriate container. Any piece of analytics code can be wrapped into a validated runnable component. Runnables from an ever-growing library can be combined to create complicated workflows to support analysis in genomics, metagenomics, data visualisation etc. The platform supports sharing, versioning and a wealth of monitoring metadata necessary for experimental reproducibility (e.g. execution time, errors, user information and performance).

Business Benefits

Promotes collaboration

Supports collaboration and sharing of advanced analytics workflows – libraries of components and dockerised images of fully functional analytical virtual machines can be created, stored and used on demand.

Reduces time to discovery

Unlimited scalability coupled with data discovery and interactive reporting reduces “time to insights” by two order of magnitude.

Drastically reduced infrastructure costs

Autonomously managed analytics resource that ensures minimal computing costs without any infrastructure maintenance or installation investment. e[hive] automate resource instantiation and termination, even to the point where the application itself autonomously manages this process (no manual intervention required).

Streamline integration of any analytics applications

Efficient integration of analytics or knowledge management software applications: These include genomics, metagenomics pipelines, machine learning and modelling, bioinformatics, cheminformatics, visualisation and reporting.

Module Features

Real-time monitoring capabilities » Computing resource, error tracking and resource allocation is tracked using Kibana dashboards. Running, completed jobs and any errors can all be quickly identified.

Uses docker images » Our propriety Docker Swarm Orchestrator engine builds on top of existing proven Docker technologies, such as Docker Machine and Docker Swarm to add further functionality such as elastic cluster scalability.

Enterprise development of open source code » based on high quality open source code, internal or external teams can develop and test re-usable workflow units (runnables) on GitHub. All dependencies and code is automatically tested and deployed periodically.

Modern architecture » secure, scalable resource management compatible with a variety of data aware enterprise environments.

Our Smart Data Management Platform

e[hive] is part of our proprietary software suite that bridges the entire process from the medical data through to insight.

automated data scientist logo