Mesosphere just open sourced DC/OS, one of the most promising cluster orchestration tools today. It is a perfectly timed move for the bioinformatics scientists who were looking for the glue to tie their open source tools together.
NGS Orchestration Workflows
Next Generation Sequencing data acquisition technologies have dramatically increased the amount of data collected at a much lower cost. Cervoo's solution provides a horizontally scalable stack of technologies. Cervoo's solution includes templates and ready-to-go packages that sit on top of a flexible choice of on-premise and public cloud providers. The templates contain the Docker images and Mesos frameworks for prevalent genome sequencing and data analytics tools and applications including Tophat, Bowtie, Spark, SAS and R. DC/OS accelerates and simplifies the adoption of our solution.
Datacenter Operating System (DC/OS)
Successful web-scale organizations like Google and Facebook have made tremendous strides over the last decade in how datacenters are setup and operated. DC/OS brings that hyperscale computing to the mainstream enterprises. DC/OS extends Apache Mesos to provide a standard ecosystem of frameworks and applications. It provides a very easy to use GUI and a Command Line Interface. The core system services include the distributed init, cron, service discovery, package management and installer.
What makes DC/OS the ideal tool to run the bioinformatics workflows?
Ease of use
Ability to use their software tools easily is very important for the bioinformatics scientists to focus on their core jobs and not worry about setting up and operating huge infrastructure. The cutting edge research happening in the genomics world right now needs entirely new approaches to support the big compute and big data challenges. DC/OS can help save a lot of money and headache for these bioinformatics organizations.
The DC/OS and custom frameworks combination is a natural fit for the bioinformatics scientists community. Bioinformatics scientists community inherently uses open source for the primary tool chain. Now they are moving more aggressively towards open data also and sharing at a whole another level. Open source DC/OS and custom frameworks and UI for bioinformatics is a natural requirement for scientists to adopt these solutions.
DC/OS is helping open new opportunities to build entirely new capabilities for these scientists. Two perfect storms are coming together right now, one is around the scale and velocity of changes happening in how genomics data is processed and analyzed, the other is a once in a lifetime kind of transitions happening within the IT organizations supporting these new opportunities.
Scientific reproducibility is a requirement that was tough with the manually operated infrastructure. Containers and orchestration help bring consistency to reproducing tests.
Cervoo is helping the bioinformatics organizations move to the new stack of hybrid clouds, containers, orchestration and automation. We are creating domain specific solutions for biopharma focusing initially on the Next Generation Sequencing workflows.
Bioinformatics organizations are very early in their journey to this new stack. This new simple world of microservices is an amazing place to be in but where we are as an industry includes legacy data algorithms written decades ago. We can not cut the umbilical cord and move suddenly to this new stack. Cervoo works with our customers using tools like DC/OS to define this journey, their roadmap and be a partner throughout.