Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo Jeong(jeongk@konkuk.ac.kr) Applied Grid Computing Center Department of Advanced Technology Fusion Konkuk University, Seoul, Korea Motivations Two Application Projects – KOCED. TeleScience and Data Sharing Environments for Civil Engineering Research in Korea – Glyco-MGrid. Collaborative Molecular Simulation Grid Environments for Glycomics Required to support scientific data management – Multiple data models, but not many. They may be changed and new models may be added, but not frequently – Legacy data (just files)and analysis software (just code with input and output files), but they are kind of simple Conventional Systems – DBMS-based Systems. Too inflexible – Semantic Web Systems. Too advanced and complicated – File based Systems. Too flexible. Custom approach © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Scientific Data Management Data Models •3D Structure explain •Trajectory •Scientist: John •Parameter: #$@ •Data location – Metadata – Experimental Data Indexing & Searching Data Repository Store – Metadata Management – Data Indexing and Searching Data Models – Storage for Experimental Data – Access Management Experimental Metadata Data Explains Data Repository © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Goals: Data Models Multiple Data Models Data Model Evolution Data Integration Experimental Data Query Data Model A Metadata Data Model B Experimental Data Data Repository © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Metadata Metadata Model (Context Data) Experimental Context – Information about experiments (e.g. owners and parameter settings) Logical View of Experimental Data – Logical organization of physical experimental data – Their location information – Associated software information © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. X-SIGMA Metadata Model Example < (b) User Interface for Editor > © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Logical View of Experimental Data Location, Format and Associated Software © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Goals: Federated Data Repository Distributed Repository Local Site Autonomic Management Decoupling between Metadata and Experimental Data Legacy Data Access Support Global Metadata Management Distributed Access to Experimental Data Local Metadata Management Local Metadata Local Access to Experimental Local Access to Experimental Management Data Data XML Schemas in XML DBMS XML Schemas in Experimental Experimental XML DBMS Data in gfarm, GridFtp, SRB Data in gfarm, GridFtp, SRB Site BKonkuk Univ. © Karpjoo Jeong, Applied Grid Computing Center, Site A X-SIGMA System Structure Global X-SIGMA Global Schema Management Search & Access Schema Integration Global Schema Query Processor Context Data Distributed Query System (OGSA-DAI) Distributed Access System (SRB,GRID-FTP) Local X-SIGMA Local X-SIGMA Local Schema Management Local Schema Management Register /Insert Local Query Processing System Local Experimental Access System Search Schema & Context Data Experimental Data Access System Access Storage (File, Legacy) Register /Insert Local Query Processing System Local Experimental Access System Search Schema & Context Data Access Storage (File, Legacy) Local X-SIGMA Local Schema Management Register /Insert Local Query Processing System Local Experimental Access System Search Schema & Context Data © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Access Storage (File, Legacy) X-SIGMA System Architecture Global X-SIGMA Context Data Global Schema Management Distributed Query Processor Experimental Data Access System OGSA-DAI Integrated Access Layer Searching Context Data Grid Middleware RDF Database SRB Accessing Real Data Local X-SIGMA Local X-SIGMA Local X-SIGMA Schema & ContextData Management Schema & ContextData Management Schema & ContextData Management Query Process Real Data Access Query Process Real Data Access Query Process Real Data Access XML Database Storage XML Database Storage XML Database Storage < Site A > < Site B > < Site C > © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Interfaces GUI-based Interfaces Web Services based Interfaces © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Glyco-MGrid Glyco-MGrid is a molecular simulation computing and data grid portal for glycomics It provides shared and integrated cyber-environments which support simulation, databases, and trajectory analysis in a collaborative way Data sharing is based on X-SIGMA © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. Glyco-MGrid Glyco-MGrid Database Simulation Trajectory Active Projects XML Document MGrid-SDG (X-SIGMA) MGrid-CG PSE Analysis Computing Real GridFTP/RFT Data © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ. References “X-SIGMA: XML based Simple data Integration system for Gathering, Managing, and Accessing Scientific Experimental Data in Grid Environments”, 2nd Conference on eScience and Grid Computing, 2006 “Glyco-MGrid : A Collaborative Molecular Simulation Grid for e-Glycomics”, To appear in 3rd Conference on eScience and Grid Computing, 2007 © Karpjoo Jeong, Applied Grid Computing Center, Konkuk Univ.