BIOINFORMATICS APPLICATIONS NOTE
Vol. 24 no. 3 2008, pages 438–439 doi:10.1093/bioinformatics/btm615
Systems biology
ZFIQ: a software package for zebrafish biology Tianming Liu1,2, Jingxin Nie1,2,3, Gang Li1,2,3, Lei Guo3 and Stephen T. C. Wong1,2,* 1
The Methodist Hospital Research Institute and Department of Radiology, The Methodist Hospital, 2Weill Cornell Medical College and 3School of Automation, Northwestern Polytechnic University, China
Received on October 11, 2007; revised and accepted on December 10, 2007 Advance Access publication December 18, 2007 Associate Editor: Chris Stoeckert
ABSTRACT
1
OVERVIEW
The overall architecture of the zebrafish image quantitator (ZFIQ) platform is illustrated in Figure 1a. The major four components of the ZFIQ platform, represented by the blue boxes, include: (1) Pipeline construction environment. The ZFIQ platform allows developers or users to dynamically construct data processing and analysis pipelines in an interactive graphical environment. The pipeline construction is driven by the data quantification problem raised by the zebrafish research community. The data types depend on specific applications. Current supported data types include image data and text data. (2) Pipeline pool. Once the components and parameters of a data quantification pipeline are tested and validated, the pipeline is archived in the pipeline pool, which is the aggregation of users’ knowledge and experience of data quantification. (3) Algorithm and method repository. Currently, the algorithms and method repository has already integrated most of the data analysis routines (over 400) in the Intel OpenCV library (Gary et al., 2000), and in-house built zebrafish data quantification methods. (4) Pipeline running environment. This environment provides users with graphic tools to perform large scale of data *To whom correspondence should be addressed.
438
quantification studies, and to output the quantification results for specific statistical modeling and analysis. The ZFIQ platform runs on the Microsoft Windows system and accepts most of popular formats of input images.
2 2.1
FUNCTIONALITIES Pipeline construction environment
There are four windows in the pipeline construction environment, as shown in http://www.cbi-platform.net/document.htm. Here, a constructed pipeline for cell nuclei segmentation is used as an example. Specifically, the left window in Figure 1b lists all of the integrated computational algorithms in the ZFIQ platform. The user can drag-and-drop any of these algorithms or filters, into the middle window of pipeline construction window, and link them together into a pipeline. In particular, the ZFIQ platform provides automatic checking mechanism to make sure that the links that connects algorithm components are legal in terms of parameter types. At the same time, the execution order of the pipeline is rendered in the execution sequence window on the top right. The execution sequence can be adjusted by user’s drag-and-drop operation in this window, if there are independent executions in parallel. Finally, the parameters in the pipeline can be interactively edited in the parameter editing window, as shown in the right bottom. This functionality provides the capability to adaptively fine-tune some parameters of the pipeline for a specific dataset. In this sense, a pipeline not only means a sequence of data analysis algorithms, but a set of parameters associated with these algorithms.
2.2
Algorithm and method repository
The ZFIQ algorithm and method repository has integrated: (1) the data analysis routines (over 400 functions) in the Intel OpenCV library (Gary et al., 2000) and (2) zebrafish data quantification methods and tools developed by the CBI (Center for Biomedical Informatics) and its collaborators. The ZFIQ platform focuses on data quantification problems in the zebrafish biology community. In particular, we are interested in the following common zebrafish data quantification problems: (1) zebrafish cell quantification and pattern analysis, (2) zebrafish neuron and axon quantification, (3) zebrafish somite quantification, (4) zebrafish gene expression quantification and (5) zebrafish morphological analysis. Currently, we have integrated over 30 algorithms and tools developed by
ß The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email:
[email protected]
Downloaded from http://bioinformatics.oxfordjournals.org/ by guest on June 1, 2013
Summary: Rapid development, transparency and small size are the outstanding features of zebrafish that make it as an increasingly important vertebrate system for developmental biology, functional genomics, disease modeling and drug discovery. Zebrafish has been regarded as ideal animal specie for studying the relationship between genotype and phenotype, for pathway analysis and systems biology. However, the tremendous amount of data generated from large numbers of embryos has led to the bottleneck of data analysis and modeling. The zebrafish image quantitator (ZFIQ) software provides streamlined data processing and analysis capability for developmental biology and disease modeling using zebrafish model. Availability: ZFIQ is available for download at http://www. cbi-platform.net Contact:
[email protected] Supplementary information: Additional documentation for this software package is referred to http://www.cbi-platform.net/ document.htm. Application examples of this software are referred to http://www.cbi-platform.net/download.htm
ZFIQ
demonstration of the software platform and that the specific algorithm details and biological questions represented by the datasets are addressed in other publications (Campbell et al., 2006; Li et al., 2007a,b,c; Liu et al., 2006; Liu, 2007)
4
CONCLUSION
ACKNOWLEDGEMENTS Fig. 1. (a) Overall architecture of the ZFIQ platform. (b). An example of zebrafish nuclei segmentation result produced by ZFIQ. The algorithm is referred to Li et al. (2007b).
CBI and its collaborators into the ZFIQ platform (Li et al., 2007c; Liu et al., 2006).
2.3
Pipeline pool
So far, the ZFIQ platform has over 50 pipelines for zebrafish cell detection, zebrafish cell quantification and zebrafish neuron detection, as well as for general data manipulation and processing. Although these pipelines are already extensively tested and validated using datasets from our collaborating zebrafish research labs, other users might need to fine-tune the parameters according to their own datasets. The pipeline pool is under the folder./CBI_platform/pipelines after the installation of ZFIQ.
2.4
Pipeline running environment
The pipeline running environment provides users with different modes to run the selected pipeline, depending on the amount of dataset to be analyzed. For example, the running environment includes graphical tools that allow users to select large amounts of datasets for batch mode running using a specific pipeline. This support is particularly useful when large datasets generated from zebrafish experiments are to be processed and quantified. The analyzed results and relevant meta-data can be output to a structured database for statistical analysis and modeling of the relationship between genotypes and phenotypes. The pipeline running environment also provides users graphical tool to fine-tune the parameters specific for their zebrafish data quantification applications.
3
APPLICATION
The ZFIQ platform has been applied for zebrafish data quantification in study of Alzheimer’s disease genes using zebrafish models, zebrafish retina development and zebrafish somitogenesis. We emphasize that this article focuses on the
We would like to thank Dr Weiming Xia and Dr Jarema Malicki of Harvard Medical School and Dr Scott Holley of Yale University for providing the zebrafish datasets discussed in this article. The research was funded by the Bioinformatics Research Center Program Grant (PI: S.T.C.W.) from Harvard Neurodiscovery Center (formally known as HCNR), Harvard Medical School and The Bioinformatics Program Grant (PI: S.T.C.W.) from the Methodist Hospital Research Institute. Conflict of Interest: none declared.
REFERENCES Campbell,W.A. et al. (2006) Zebrafish lacking Alzheimer presenilin enhancer 2 (Pen-2) demonstrate excessive p53-dependent apoptosis and neuronal loss. J. Neurochem., 96, 1423–1440. Gary,R.B. and Vadim,P. (2000) Intel’s computer vision library: applications in calibration, stereo, segmentation, tracking, gesture, face and object recognition. Proceeding of IEEE Computer Vision and Pattern Recognition. Hilton Head Island, South Carolina. Peng,H. et al. (2007a) Automatic image analysis for gene expression patterns of fly embryos. BMC Cell Biol., 8 (Supp.1), S7. Peng,H. et al. (2007b) Straightening C. elegans images. Bioinformatics, Published online on 19 November, 2007, doi: 10.1093/bioinformatics/btm569. Holley,S.A. et al. (2000) Control of her1 expression during zebrafish somitogenesis by a Delta-dependent oscillator and an independent wavefront activity. Genes Dev., 14, 1678–1690. Li,G. et al. (2007a) Detection of blob objects in microscopic Zebrafish images based on gradient vector diffusion. Cytometry A, 71, 835–845. Li,G. et al. (2007b) 3D cell segmentation based on gradient flow tracking, accepted with revision. BMC Cell Biol., 8, 40. Li,G. et al. (2007c) Segmentation of touching cell nuclei using gradient flow tracking. J. Microsc., in press. Liu,T et al. (2006) Computerized image analysis for quantitative neuronal phenotyping in zebrafish. J. Neurosci. Methods, 153, 190–202. Liu,T. (2007) A quantitative zebrafish phenotyping tool for developmental biology and disease modeling. IEEE Signal Proc. Mag., 24, 126–129. Patton,E.E. and Zon,L.I. (2001) The art and design of genetic screens: zebrafish. Nat. Rev. Genet., 2, 956–966. Peterson,R.T. et al. (2004) Chemical suppression of a genetic mutation in a zebrafish model of aortic coarctation. Nat. Biotechnol., 22, 595–599. Pujic,Z. et al. (2006) Reverse genetic analysis of neurogenesis in the zebrafish retina. Dev. Biol., 293, 330–347.
439
Downloaded from http://bioinformatics.oxfordjournals.org/ by guest on June 1, 2013
Developing a data analysis system that can deal with the diversity of zebrafish datasets and data quality is a challenge (Holley et al., 2000; Patton and Zon, 2001; Peterson et al., 2004; Pujic et al., 2006). The development of the ZFIQ software package aims to fill the void. By building ZFIQ on top of the public domain OpenCV platform, we welcome the computation community to contribute their data quantification pipelines to the ZFIQ, and, in turn, expect the zebrafish research community to contribute their quantification problems and associated datasets to the development. The growth of the pipeline pool of the ZFIQ package would depend on the fruitful interaction of the two communities. In addition, the development of ZFIQ would be complementary to available tools for other model systems such as Caenorhabditis elegans and fly (Peng et al., 2007a,b)