Project

General

Profile

History » History » Revision 20

Revision 19 (Peter Amstutz, 02/12/2014 02:47 PM) → Revision 20/25 (Jonathan Sheffi, 02/13/2014 03:12 PM)

h1. History 

 In 2006 researchers at "Dr. George Church's Lab":http://arep.med.harvard.edu/ at Harvard Medical School began work on the "Personal Genome Project.":http://www.personalgenomes.org/ The PGP collects and publishes whole genome sequencing, environmental, and trait data from individuals who openly consented to have their data shared on the internet under an IRB approved study. The vision was to publish 100,000 genomes at the Harvard project and help dozens of other project launch around the world. From the beginning, the team envisioned having data stored in data centers around the world that would need to be federated and shared.  

 "Alexander Wait Zaranek PhD (Sasha)":http://openwetware.org/wiki/User:Alexander_Wait_Zaranek became Director of Informatics for the project and began developing an informatics platform that could accomplish the goals of the PGP leveraging the best thinking from Google and other organizations work with petabyte and exabyte scale data set distributed across data centers.  

 Sasha worked with Tom Clegg and Ward Vandewege to design the system, build a prototype, and present a paper describing the approach at the 2008 USENIX Annual Technical Conference: "Free Factories: Unified Infrastructure for Data Intensive Web Services.":http://www.ncbi.nlm.nih.gov/pubmed/20514356   

 table{float:right;border:0;margin-left:2em;}. 
 |{border:0}. !Harvard_PGP_cluster_2012_sm.jpg! 
 _One of the Harvard PGP Clusters_| 

 Sasha, Tom, Ward and other engineers have continued to build on the prototype presented in the paper. Free Factories currently runs two clusters at Harvard Medical School that power the PGP. Together these clusters provide storage and computational resources for 300TB of data. 

 In 2010, Sasha, Tom, Ward along with Zen Chu and Dr. Joe Thakuria started a company, Clinical Future, to drive broader adoption of the technology they had developed for the PGP.  

 During 2012 the team began re-working the API, evaluating requirements across other labs, and designing the next generation of the system. In 2013 Free Factories was renamed "Arvados", and the new open source project was officially announced to at Bio-IT World on April 11, 2013.  

 In December 2013, the company sponsoring Arvados development was renamed from "Clinical Future" to "Curoverse." "Curoverse". 

 h3. Where is the name from?  

 Arvados is a combination of "Arvada III":http://en.memory-alpha.org/wiki/Arvada_III, III", the planet from Star Trek: The Next Generation where Dr. Beverly Crusher was inspired to become a doctor, and "Orvos," "Orvos" the Hungarian word for doctor, which was an older a code name we used for the project. while.