International Centers of Excellence for Malaria Research: Background, Progress, and Ongoing Activities
  • ISSN: 0002-9637
  • E-ISSN: 1476-1645



Data generated during the course of research activities carried out by the International Centers of Excellence for Malaria Research (ICEMR) is heterogeneous, large, and multi-scaled. The complexity of federated and global data operations and the diverse uses planned for the data pose tremendous challenges and opportunities for collaborative research. In this article, we present the foundational principles for data management across the ICEMR Program, the logistics associated with multiple aspects of the data life cycle, and describe a pilot centralized web information system created in PlasmoDB to query a subset of this data. The paradigm proposed as a solution for the data operations in the ICEMR Program is widely applicable to large, multifaceted research projects, and could be reproduced in other contexts that require sophisticated data management.

[open-access] This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.


Article metrics loading...

The graphs shown below represent data from March 2017
Loading full text...

Full text loading...



  1. Schadt EE, Linderman MD, Sorenson J, Lee L, Nolan GP, , 2010. Computational solutions to large-scale data management and analysis. Nat Rev Genet 11: 647657.[Crossref] [Google Scholar]
  2. Rao M, , 2012. The International Centers of Excellence for Malaria Research. Acta Trop 121: 157.[Crossref] [Google Scholar]
  3. Wright A, Sittig DF, , 2008. SANDS: a service-oriented architecture for clinical decision support in a National Health Information Network. J Biomed Inform 41: 962981.[Crossref] [Google Scholar]
  4. Kashyap V, Morales A, Hongsermeier T, , 2006. On implementing clinical decision support: achieving scalability and maintainability by combining business rules and ontologies. AMIA Annu Symp Proc 2006: 414418. [Google Scholar]
  5. Megy K, Emrich SJ, Lawson D, Campbell D, Dialynas E, Hughes DS, Koscielny G, Louis C, Maccallum RM, Redmond SN, Sheehan A, Topalis P, Wilson D, VectorBase Consortium; , 2012. VectorBase: improvements to a bioinformatics resource for invertebrate vector genomics. Nucleic Acids Res 40: D729D734.[Crossref] [Google Scholar]
  6. Aurrecoechea C, Brestelli J, Brunk BP, Dommer J, Fischer S, Gajria B, Gao X, Gingle A, Grant G, Harb OS, Heiges M, Innamorato F, Iodice J, Kissinger JC, Kraemer E, Li W, Miller JA, Nayak V, Pennington C, Pinney DF, Roos DS, Ross C, Stoeckert CJ, Jr Treatman C, Wang H, , 2008. PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Res 37 (Suppl 1): D539D543. [Google Scholar]
  7. Bauer S, , 2008. Mining data, gathering variables and recombining information: the flexible architecture of epidemiological studies. Stud Hist Philos Biol Biomed Sci 39: 415428.[Crossref] [Google Scholar]
  8. Cassell EJ, , 2000. The principles of the Belmont report revisited: how have respect for persons, beneficence, and justice been applied to clinical medicine? Hastings Cent Rep 30: 1221.[Crossref] [Google Scholar]
  9. Vollmer SH, Howard G, , 2010. Statistical power, the Belmont report, and the ethics of clinical trials. Sci Eng Ethics 16: 675691.[Crossref] [Google Scholar]
  10. Gallin JI, Ognibene FP, , 2012. Principles and Practice of Clinical Research. Waltham, MA: Academic Press. [Google Scholar]
  11. Marcus DS, Harms MP, Snyder AZ, Jenkinson M, Wilson JA, Glasser MF, Barch DM, Archie KA, Burgess GC, Ramaratnam M, Hodge M, Horton W, Herrick R, Olsen T, McKay M, House M, Hileman M, Reid E, Harwell J, Coalson T, Schindler J, Elam JS, Curtiss SW, Van Essen DC, , 2013. Human Connectome Project informatics: quality control, database services, and data visualization. Neuroimage 80: 202219.[Crossref] [Google Scholar]
  12. Gomaa H, , 2011. Software Modeling and Design: UML, Use Cases, Patterns, and Software Architectures. New York, NY: Cambridge University Press.[Crossref] [Google Scholar]
  13. Prlić A, Procter JB, , 2012. Ten simple rules for the open development of scientific software. PLoS Comput Biol 8: e1002802.[Crossref] [Google Scholar]
  14. Sandve GK, Nekrutenko A, Taylor J, Hovig E, , 2013. Ten simple rules for reproducible computational research. PLoS Comput Biol 9: e1003285.[Crossref] [Google Scholar]
  15. Osborne JM, Bernabeu MO, Bruna M, Calderhead B, Cooper J, Dalchau N, Deane C, , 2014. Ten simple rules for effective computational research. PLoS Comput Biol 10: e1003506.[Crossref] [Google Scholar]
  16. Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A, Mungall CJ, OBI Consortium Leontis N, Rocca-Serra P, Ruttenberg A, Sansone SA, Scheuermann RH, Shah N, Whetzel PL, Lewis S, , 2007. The OBO foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol 25: 12511255.[Crossref] [Google Scholar]
  17. Brinkman RR, Courtot M, Derom D, Fostel JM, He Y, Lord P, Malone J, Parkinson H, Peters B, Rocca-Serra P, Ruttenberg A, Sansone SA, Soldatova LN, Stoeckert CJ, Jr Turner JA, Zheng J, OBI consortium; , 2010. Modeling biomedical experimental processes with OBI. J Biomed Semantics 1 (Suppl 1): S7.[Crossref] [Google Scholar]
  18. Dugan VG, Emrich SJ, Giraldo-Calderón GI, Harb OS, Newman RM, Pickett BE, Schriml LM, Stockwell TB, Stoeckert CJ, Jr Sullivan DE, Singh I, Ward DV, Yao A, Zheng J, Barrett T, Birren B, Brinkac L, Bruno VM, Caler E, Chapman S, Collins FH, Cuomo CA, Di Francesco V, Durkin S, Eppinger M, Feldgarden M, Fraser C, Fricke WF, Giovanni M, Henn MR, Hine E, Hotopp JD, Karsch-Mizrachi I, Kissinger JC, Lee EM, Mathur P, Mongodin EF, Murphy CI, Myers G, Neafsey DE, Nelson KE, Nierman WC, Puzak J, Rasko D, Roos DS, Sadzewicz L, Silva JC, Sobral B, Squires RB, Stevens RL, Tallon L, Tettelin H, Wentworth D, White O, Will R, Wortman J, Zhang Y, Scheuermann RH, , 2014. Standardized metadata for human pathogen/vector genomic sequences. PLoS One 9: e99979.[Crossref] [Google Scholar]
  19. Crompton PD, Kayala MA, Traore B, Kayentao K, Ongoiba A, Weiss GE, Molina DM, Burk CR, Waisberg M, Jasinskas A, Tan X, Doumbo S, Doumtabe D, Kone Y, Narum DL, Liang X, Doumbo OK, Miller LH, Doolan DL, Baldi P, Felgner PL, Pierce SK, , 2010. A prospective analysis of the Ab response to Plasmodium falciparum before and after a malaria season by protein microarray. Proc Natl Acad Sci USA 107: 69586963.[Crossref] [Google Scholar]
  20. Sama W, Dietz K, Smith T, , 2006. Distribution of survival times of deliberate Plasmodium falciparum infections in tertiary syphilis patients. Trans R Soc Trop Med Hyg 100: 811816.[Crossref] [Google Scholar]
  21. Johnston GL, Smith DL, Fidock DA, , 2013. Malaria's missing number: calculating the human component of R0 by a within-host mechanistic model of Plasmodium falciparum infection and transmission. PLoS Comput Biol 9: e1003025.[Crossref] [Google Scholar]
  22. Deroost K, Opdenakker G, Van den Steen PE, , 2014. MalarImDB: an open-access literature-based malaria immunology database. Trends Parasitol 30: 309316.[Crossref] [Google Scholar]
  23. Sankoh O, Byass P, , 2012. The INDEPTH Network: filling vital gaps in global epidemiology. Int J Epidemiol 41: 579588.[Crossref] [Google Scholar]
  24. Corsi DJ, Neuman M, Finlay JE, Subramanian SV, , 2012. Demographic and health surveys: a profile. Int J Epidemiol 41: 16021613.[Crossref] [Google Scholar]
  25. Achidi EA, Agbenyega T, Allen S, Amodu O, Bojang K, Conway D, Williams T, , 2008. A global network for investigating the genomic epidemiology of malaria. Nature 456: 732737.[Crossref] [Google Scholar]
  26. Moyes CL, Temperley WH, Henry AJ, Burgert C, Hay SI, , 2013. Providing open access data online to advance malaria research and control. Malar J 12: 161.[Crossref] [Google Scholar]

Data & Media loading...

Supplemental material

  • Received : 02 Jan 2015
  • Accepted : 01 Jul 2015
  • Published online : 02 Sep 2015

Most Cited This Month

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error