International Centers of Excellence for Malaria Research: Background, Progress, and Ongoing Activities
  • ISSN: 0002-9637
  • E-ISSN: 1476-1645



Data generated during the course of research activities carried out by the International Centers of Excellence for Malaria Research (ICEMR) is heterogeneous, large, and multi-scaled. The complexity of federated and global data operations and the diverse uses planned for the data pose tremendous challenges and opportunities for collaborative research. In this article, we present the foundational principles for data management across the ICEMR Program, the logistics associated with multiple aspects of the data life cycle, and describe a pilot centralized web information system created in PlasmoDB to query a subset of this data. The paradigm proposed as a solution for the data operations in the ICEMR Program is widely applicable to large, multifaceted research projects, and could be reproduced in other contexts that require sophisticated data management.

[open-access] This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.


Article metrics loading...

Loading full text...

Full text loading...



  1. Schadt EE, Linderman MD, Sorenson J, Lee L, Nolan GP, , 2010. Computational solutions to large-scale data management and analysis. Nat Rev Genet 11: 647657.[Crossref]
  2. Rao M, , 2012. The International Centers of Excellence for Malaria Research. Acta Trop 121: 157.[Crossref]
  3. Wright A, Sittig DF, , 2008. SANDS: a service-oriented architecture for clinical decision support in a National Health Information Network. J Biomed Inform 41: 962981.[Crossref]
  4. Kashyap V, Morales A, Hongsermeier T, , 2006. On implementing clinical decision support: achieving scalability and maintainability by combining business rules and ontologies. AMIA Annu Symp Proc 2006: 414418.
  5. Megy K, Emrich SJ, Lawson D, Campbell D, Dialynas E, Hughes DS, Koscielny G, Louis C, Maccallum RM, Redmond SN, Sheehan A, Topalis P, Wilson D, VectorBase Consortium; , 2012. VectorBase: improvements to a bioinformatics resource for invertebrate vector genomics. Nucleic Acids Res 40: D729D734.[Crossref]
  6. Aurrecoechea C, Brestelli J, Brunk BP, Dommer J, Fischer S, Gajria B, Gao X, Gingle A, Grant G, Harb OS, Heiges M, Innamorato F, Iodice J, Kissinger JC, Kraemer E, Li W, Miller JA, Nayak V, Pennington C, Pinney DF, Roos DS, Ross C, Stoeckert CJ, Jr Treatman C, Wang H, , 2008. PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Res 37 (Suppl 1): D539D543.
  7. Bauer S, , 2008. Mining data, gathering variables and recombining information: the flexible architecture of epidemiological studies. Stud Hist Philos Biol Biomed Sci 39: 415428.[Crossref]
  8. Cassell EJ, , 2000. The principles of the Belmont report revisited: how have respect for persons, beneficence, and justice been applied to clinical medicine? Hastings Cent Rep 30: 1221.[Crossref]
  9. Vollmer SH, Howard G, , 2010. Statistical power, the Belmont report, and the ethics of clinical trials. Sci Eng Ethics 16: 675691.[Crossref]
  10. Gallin JI, Ognibene FP, , 2012. Principles and Practice of Clinical Research. Waltham, MA: Academic Press.
  11. Marcus DS, Harms MP, Snyder AZ, Jenkinson M, Wilson JA, Glasser MF, Barch DM, Archie KA, Burgess GC, Ramaratnam M, Hodge M, Horton W, Herrick R, Olsen T, McKay M, House M, Hileman M, Reid E, Harwell J, Coalson T, Schindler J, Elam JS, Curtiss SW, Van Essen DC, , 2013. Human Connectome Project informatics: quality control, database services, and data visualization. Neuroimage 80: 202219.[Crossref]
  12. Gomaa H, , 2011. Software Modeling and Design: UML, Use Cases, Patterns, and Software Architectures. New York, NY: Cambridge University Press.[Crossref]
  13. Prlić A, Procter JB, , 2012. Ten simple rules for the open development of scientific software. PLoS Comput Biol 8: e1002802.[Crossref]
  14. Sandve GK, Nekrutenko A, Taylor J, Hovig E, , 2013. Ten simple rules for reproducible computational research. PLoS Comput Biol 9: e1003285.[Crossref]
  15. Osborne JM, Bernabeu MO, Bruna M, Calderhead B, Cooper J, Dalchau N, Deane C, , 2014. Ten simple rules for effective computational research. PLoS Comput Biol 10: e1003506.[Crossref]
  16. Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A, Mungall CJ, OBI Consortium Leontis N, Rocca-Serra P, Ruttenberg A, Sansone SA, Scheuermann RH, Shah N, Whetzel PL, Lewis S, , 2007. The OBO foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol 25: 12511255.[Crossref]
  17. Brinkman RR, Courtot M, Derom D, Fostel JM, He Y, Lord P, Malone J, Parkinson H, Peters B, Rocca-Serra P, Ruttenberg A, Sansone SA, Soldatova LN, Stoeckert CJ, Jr Turner JA, Zheng J, OBI consortium; , 2010. Modeling biomedical experimental processes with OBI. J Biomed Semantics 1 (Suppl 1): S7.[Crossref]
  18. Dugan VG, Emrich SJ, Giraldo-Calderón GI, Harb OS, Newman RM, Pickett BE, Schriml LM, Stockwell TB, Stoeckert CJ, Jr Sullivan DE, Singh I, Ward DV, Yao A, Zheng J, Barrett T, Birren B, Brinkac L, Bruno VM, Caler E, Chapman S, Collins FH, Cuomo CA, Di Francesco V, Durkin S, Eppinger M, Feldgarden M, Fraser C, Fricke WF, Giovanni M, Henn MR, Hine E, Hotopp JD, Karsch-Mizrachi I, Kissinger JC, Lee EM, Mathur P, Mongodin EF, Murphy CI, Myers G, Neafsey DE, Nelson KE, Nierman WC, Puzak J, Rasko D, Roos DS, Sadzewicz L, Silva JC, Sobral B, Squires RB, Stevens RL, Tallon L, Tettelin H, Wentworth D, White O, Will R, Wortman J, Zhang Y, Scheuermann RH, , 2014. Standardized metadata for human pathogen/vector genomic sequences. PLoS One 9: e99979.[Crossref]
  19. Crompton PD, Kayala MA, Traore B, Kayentao K, Ongoiba A, Weiss GE, Molina DM, Burk CR, Waisberg M, Jasinskas A, Tan X, Doumbo S, Doumtabe D, Kone Y, Narum DL, Liang X, Doumbo OK, Miller LH, Doolan DL, Baldi P, Felgner PL, Pierce SK, , 2010. A prospective analysis of the Ab response to Plasmodium falciparum before and after a malaria season by protein microarray. Proc Natl Acad Sci USA 107: 69586963.[Crossref]
  20. Sama W, Dietz K, Smith T, , 2006. Distribution of survival times of deliberate Plasmodium falciparum infections in tertiary syphilis patients. Trans R Soc Trop Med Hyg 100: 811816.[Crossref]
  21. Johnston GL, Smith DL, Fidock DA, , 2013. Malaria's missing number: calculating the human component of R0 by a within-host mechanistic model of Plasmodium falciparum infection and transmission. PLoS Comput Biol 9: e1003025.[Crossref]
  22. Deroost K, Opdenakker G, Van den Steen PE, , 2014. MalarImDB: an open-access literature-based malaria immunology database. Trends Parasitol 30: 309316.[Crossref]
  23. Sankoh O, Byass P, , 2012. The INDEPTH Network: filling vital gaps in global epidemiology. Int J Epidemiol 41: 579588.[Crossref]
  24. Corsi DJ, Neuman M, Finlay JE, Subramanian SV, , 2012. Demographic and health surveys: a profile. Int J Epidemiol 41: 16021613.[Crossref]
  25. Achidi EA, Agbenyega T, Allen S, Amodu O, Bojang K, Conway D, Williams T, , 2008. A global network for investigating the genomic epidemiology of malaria. Nature 456: 732737.[Crossref]
  26. Moyes CL, Temperley WH, Henry AJ, Burgert C, Hay SI, , 2013. Providing open access data online to advance malaria research and control. Malar J 12: 161.[Crossref]

Data & Media loading...

Supplementary Data

Supplemental material

  • Received : 02 Jan 2015
  • Accepted : 01 Jul 2015

Most Cited This Month

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error