Data sets & tools

  • Data reporting and representation

    Kinsey Reporter

    • Global mobile survey platform for collecting and sharing anonymous data about sexual and other intimate behaviors.
    • Developed by Digital Science Center

    Learn more about Kinsey Reporter

    Click Dataset

    • Study of the structure and dynamics of web traffic networks
    • Developed by Digital Science Center

    Learn more about Click Dataset

    Truthy

    • A research project that helps in understanding how memes spread online through collections and analyses of tweets from Twitter
    • Developed by Digital Science Center

    Learn more about Truthy

  • Environmental data and analysis

    Linked Environments for Atmospheric Discovery II (LEAD II)

    • A follow-on to the successful Linked Environments for Atmospheric Discovery NSF funded large-scale ITR
    • Explores research challenges in hybrid computing and using weather data in non-weather applications
    • Developed by Data to Insight Center (D2I)

    Learn more about LEAD II

    LEAD II Vortex 2 Archive Dataset

    • In support of NSF-funded Vortex 2 tornado data gathering field effort
    • Developed by Data to Insight Center (D2I)

    Learn more about LEADII Vortex 2 Archive Dataset

  • Genomic data and analysis

    DroSpeGe

    • Comparative Drosophila Species Genomes database
    • Developed by NCGAS

    Learn more about DroSpeGe

    Daphnia (water flea) genome

    • Annotated Genome of Daphnia (water flea)
    • Developed by NCGAS

    Learn more about Daphnia (water flea) genome

    Reference Genome Data

    • Data for assemblies that use a standard sequence as a reference
    • Developed by NCGAS

    Learn more about Reference Genome Data

    Galaxy

    • A web-based workflow composer and manager for genomic analysis that can be used by domain scientists
    • Supports data importing and management, file transformation, and analytical tools such as Trinity for RNA-Seq analysis
    • Developed by NCGAS

    Learn more about Galaxy

    Get SNPs using NCBI eSearch and eFetch

    • A web service to display information about SNPs specified either by SNP ID, or by a chromsome region
    • Developed by NCGAS

    Learn more about SNPs

  • Geographic information

    Indiana Spatial Data Portal

    • Indiana digital aerial photos, topographic maps, and digital elevation data
    • Developed by Research Technologies

    Learn more about Indiana Spatial Data Portal

  • Grid computing

    PRAGMA at IU

    • IU provides a virtual cluster consisting of a front end node and 3 compute nodes
    • Additional virtual clusters may be made available in the future
    • Developed by Data to Insight Center (D2I)

    Learn more about PRAGMA

    Streamflow

    • Enables scientific workflow designers to incorporate data streams into the experiment without major changes to the infrastructure
    • Uses XBaya as a graphical client program for workflow composition, execution and monitoring
    • Developed by Data to Insight Center (D2I)

    Learn more about Streamflow

  • Multimedia

    Photocat (Image Collections Online)

    • A user interface to catalog images within a Fedora repository
    • Developed by Data to Insight Center (D2I)

    Log into Photocat

  • Protein methods

    Protein Interaction Abstract Relevance Evaluator (PIARE)

    • Implements the binary classifier CNets produced for the Protein-Protein Interaction Article Classification in Biocreative II and Biocreative II
    • Developed by Digital Science Center

    Learn more about PIARE

    DisProt

    • Database of proteins that contain regions of intrinsic disorder or that are entirely disordered as determined by one or more of 30 different experimental methods.
    • Developed by Research Technologies

    Learn more about DisProt

  • Provenance and metadata

    Gigabyte Synthetic Database

    • A noisy data collection generated using the Workflow Emulator Tool (WORKEM) with a number of scientific workflows
    • Developed by Data to Insight Center (D2I)

    Learn more about GSD

  • Publication curation and citations

    Scholarometer (beta)

    • A social tool to facilitate citation analysis and help evaluate the impact of an author’s publications
    • Developed by Digital Science Center

    Learn more about Scholarometer

  • Science gateways

    QuakeSim

    • Provides access to time series analysis and deformation tools to support geophysical research on earthquakes
    • Also provides access to earthquake fault models, InSAR imagery, and analyzed GPS data
    • Developed by Digital Science Center

    View QuakeSim

    E-DECIDER

    • Builds on tools from the QuakeSim project to support emergency planning and response for earthquake events
    • Developed by Research Technologies

    View E-DECIDER

    OGCE wiki

    • The main web site for information on the NSF-funded Open Gateway Computing Environments project, which provides open source, open community software for building Web-based science gateways
    • Developed by Research Technologies

    View the OGCE wiki

    OASIS

    • A caching service for OSG VOs to install application software which can then be mounted remotely to OSG worker nodes
    • Developed by Research Technologies

    Learn more about OASIS

    BioVLab

    • A scientific gateway front end for reconfigurable cloud computing environment for microRNA and mRNA integrated analysis
    • Developed by Research Technologies

    View the BioVLab wiki

    GridChem

    • A scientific gateway that provides access to high performance computing resources for computational chemistry with distributed support and services, intuitive interfaces and measurable quality of service.
    • Developed by Research Technologies

    Learn more about GridChem

  • Scientific and parallel programming

    Hierarchical MapReduce

    • A framework that gathers computation resources from different clusters, allowing you to run MapReduce jobs across them
    • Developed by Data to Insight Center (D2I)

    Learn more about MapReduce

  • Text analysis

    HathiTrust Research Center

    • An extensive collaborative digital library of more than 8 million volumes and 2 billion pages of archived material maintained by major research institutions and libraries worldwide.
    • Developed by Data to Insight Center (D2I)

    Learn more about HathiTrust