Indiana University

 

D2I Current Projects

Data Management Research
D2I is funding up to 5 research grants to address aspects of data management such as but not limited to extensible infrastructure for data interoperability, data to insight, data lifecycles, digital data curation, scientific data preservation. The grants are intended to support the early development of innovative research likely to compete favorably for competitive external funding in the future and/or produce translational deliverables such as new software, a demonstration collection, or other tangible output that advances the goals of the Data to Insight Center.
Proposal Page
 

 
Funding: Lilly Endowment
July 12, 2010 - current
 
Digital Data Provenance - Karma
As research digital data collections become more accessible, it becomes increasingly important to address the issues of data validity and quality: To record and manage information about where each data object originated, the processes applied to the data products, and by whom.  This project is developing tools for provenance generation and collection and case-based reasoning. The tools and collected data are also available for download for wider community use.
Project Page
 
Internal Links
 Discussion Board
Funding: NASA | Geni
October 2009 - current
 
Geni-NetKarma
The project will collect provenance of the data generated by GENI. A GENI Provenance Registry (NetKarma) will capture the workflow of GENI slice creation, topology of the slice, operational status and other measurement statistics and correlate it with the experimental data. NetKarma will allow researchers to see the exact state of the network and store configuration of the experiment and its slice.
Project Page
GENI Project Wiki
Internal Links
 Discussion Board
Funding: Geni
October 2009 - current
 
NASA-InstantKarma
The project will improve the collection, preservation, utility and dissemination of provenance information within the NASA Earth Science community.  It will customize and integrate Karma into NASA data production by collecting and disseminating provenance of AMSR-E (Advanced Microwave Scanning Radiometer - Earth Observing System) standard data products.
Project Page
 
Internal Links
 Discussion Board
Funding: NASA
April 2010 - current
 
LEAD II 
LEAD II is a follow-on to the successful Linked Environments for Atmospheric Discovery NSF funded large-scale ITR. LEAD II carries the vision of LEAD forward into new areas as it explores research challenges in hybrid computing and in the manipulation and use of weather data in non-weather applications.  LEAD II supported Vortex2 and is currently supporting Kathleen Baker with her USDA crop disease research.
Project Page
LEAD Portal
Internal Links
 Discussion Board
 Developers Website
Funding: Microsoft Research,
USDA & Data to Insight Center
September 2009 - current
 
Sigiri
Sigiri is a light-weight job management and abstraction service that supports job specifications like JSDL and RSL. A Web Service Interface allows integration with various scientific workflow systems and each step in job submission and management is decoupled to increase scalability.
Project Page
 
Internal Links
 Discussion Board
Funding: Data to Insight Center
Current
 
StreamflowStreamflow
 
Streamflow integrates data streams into a standard workflow system through a programming model approach that introduces new workflow semantics that enable scientific workflow designers to incorporate data streams into the experiment without major changes to the infrastructure.  It utilizes XBaya as a graphical client program for workflow composition, execution and monitoring.
Project Page
 
Internal Links
 Discussion Board
Funding: Data to Insight Center
Current
 
XMC Cat
XMC Cat is a web service toolkit for capturing and storing metadata during the execution of scientific workflows to enable data discovery and reuse. Its advantages include adaptability to domain schemata through configuration instead of code changes, support for automatic capture of metadata through curation plugins, and search and browse capabilities through a web-based GUI that dynamically adjusts to the domain schema. This allows XMC Cat to be deployed in different scientific domains without requiring new code to be written. It is currently in use in the LEAD Science Gateway.
Project Page
 
Internal Links
 Discussion Board
Funding: Data to Insight Center
October 2005 - current
 

Archived Projects >>