SB18:  Globus Case Study - GADU: A High-throughput Computational System for Genome Analysis
09/13/2006, 4:45 PM - 5:30 PM

Speaker:
Natalia Maltsev, Computational Biologist, Argonne National Lab.

During the past decade, the scientific community has witnessed the rapid accumulation of gene sequence data and data related to physiology and biochemistry of organisms. Bioinformatics tools used for efficient and computationally intensive analysis of genetic sequences require large-scale computational resources to accommodate the growing data. Grid computational resources such as the Open Science Grid and TeraGrid have proved useful for scientific discovery. GADU (Genome Analysis and Database Update), developed by the Bioinformatics group at Argonne National Laboratory, is a high-throughput computational system to automate the steps involved in accessing the Grid resources for running bioinformatics applications. It is a resource independent system that can run jobs simultaneously on different Grids such as Open Science Grid (OSG) and TeraGrid. The applications are executed as workflows that are expressed and controlled by a "virtual data" model which transparently maps computational workflows to distributed Grid resources. Grid technologies such as Globus, Condor, and the Griphyn Virtual Data System are used in the development of GADU. GADU has been widely used for the computational requirements of applications such as PUMA2, GNARE, Chisel, and others.


© 2008 IDG WORLD EXPO CORP. ALL RIGHTS RESERVED