Major projects

Major and collaborative projects

Below are projects that involve a grant award in excess of $5 million or a collaboration of more than one PTI center. (Information about other projects is listed on each center’s web pages.)

NSF Cybersecurity Center of Excellence

The National Science Foundation designated the Center for Trustworthy Scientific Cyberinfrastructure (CTSC) as its Cybersecurity Center of Excellence. The Center is led by CACR Director Von Welch and Chief Policy Analyst Craig Jackson.

CTSC leads the NSF community in understanding how cybersecurity promotes trustworthy scientific and engineering research, while also enabling the broad adoption of practices in cyberinfrastructure operations, research, and development.

Centers for Academic Excellence

Indiana University is also designated as a:

  • NSA/DHS National Center of Academic Excellence in Information Assurance/Cyber Defense Education
  • NSA/DHS National Center of Academic Excellence in Information Assurance/Cyber Defense Research

Through CACR and in collaboration with the IU School of Informatics and Computing, PTI has coordinated Indiana University’s key cybersecurity designations since 2008.


Geoffrey Fox, distinguished professor of computer science and informatics and principal investigator of the NSF’s FutureGrid project, is a strategic partner for NSF’s Comet project.

Comet is the world’s first virtualized high performance computing cluster, and FutureGrid’s expertise will inform its use of virtual systems in supercomputing. By incorporating technology currently used by commercial cloud computing providers, Comet will benefit wide-ranging researchers who need quick turnaround on medium-sized computing jobs.

HathiTrust Research Center

Led at IU by D2I Director Dr. Beth Plale, the HathiTrust Research Center (HTRC) enables nonprofit and educational users to have computational access to published works in the public domain. These works are stored within the HathiTrust Digital Library, an extensive collaborative digital library.

HTRC has developed the HTRC Data Capsule as a secure framework through which researchers and educators can study copyrighted content. They may remove the research results from that data capsule—without violating copyright on the underlying protected literature.


Jetstream is a first-of-a-kind system for the NSF—a distributed production cloud resource. Jetstream offers on-demand academic cloud services, enabling researchers or students to select a pre-existing VM image or to create a new virtual environment for personalized research computing.

Jetstream also hosts persistent science gateways and enables data movement, storage, and dissemination. Jetstream is a managed science cloud—that is, a cloud managed for scientific applications. Jetstream is led by IU with PTI Executive Director Craig Stewart as PI.

MIDAS - CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science

This project will design and implement software—Middleware for Data-Intensive Analytics and Science (MIDAS). MIDAS will enable scalable applications and combine high performance computing with the rich functionality of the commodity Apache Big Data Stack.

This project will also design and implement a set of cross-cutting, high-performance data analysis libraries. For example, SPIDAL (Scalable Parallel Interoperable Data Analytics Library) will support new programming and execution models for data-intensive analysis in a wide range of science and engineering applications.

This project is led by PI Geoffrey C. Fox, Director of the Digital Science Center.

Open Science Grid

The Open Science Grid (OSG) is a high-throughput distributed computing infrastructure designed for large-scale scientific research. Developed and operated by a consortium of universities and national laboratories, the OSG interoperates with similar grids throughout the world, allowing scientists to seamlessly and opportunistically harness high-throughput computing resources they may not have been able to use otherwise. In 2015, the OSG provided over 1.2 billion CPU hours to researchers.

The OSG is jointly funded by the Department of Energy and the National Science Foundation. The OSG Operations Center located at Indiana University provides a single point of operational support for OSG users, resource providers, and collaborators. High Throughput Computing Manager Rob Quick acts as the OSG operations officer, and CACR Director Von Welch and Senior Systems Analyst Susan Sons lead OSG’s operational security program.

Software Assurance Marketplace (SWAMP)

Funded by the Department of Homeland Security, the SWAMP provides a free-to-use software assurance facility for testing weaknesses that can lead to software vulnerabilities and other cybersecurity problems. SWAMP facilities are now available as open source for on-facility installation.

CACR Director Von Welch is a co-PI on the project, and CACR and RT’s High Throughput Computing group will provide leadership for cybersecurity, monitoring, and user support.

Open Gateway Computing Environments Science Gateways Platform as a Service (OGCE SciGaP)

The SciGaP project develops and deploys Apache Airavata software as an open source platform of services that can be used to build Science Gateways.

The SciGaP project (NSF award #1339774) is supported by a collaborative grant. It is led by Indiana University and includes the University of California San Diego and the University of Texas Health Sciences Center San Antonio as collaborating partners.

SGRC Director Marlon Pierce is the PI of this project.

Science Gateway Institute

PTI provides cybersecurity leadership for the Science Gateway Institute through a collaboration between the Science Gateways Research Center (SGRC) and the Center for Applied Cybersecurity Research (CACR), specifically Director Von Welch and Senior Systems Analyst Randy Heiland.

Sustainable Environments – Actionable Data (SEAD)

SEAD offers data management, curation, and publication services to researchers who require reliable and enhanced cyberinfrastructure. SEAD’s services provide the ability to manage, integrate, interpret, share, curate, and preserve data.

SEAD’s web-based data services are powerful, user-friendly, and tailored to the requirements of smaller projects. D2I Director Beth Plale is a Co-PI on SEAD and the leader of IU’s involvement.


XSEDE is a single virtual system that scientists can use to interactively share computing resources, data, and expertise. People around the world use these resources and services—things like supercomputers, collections of data, and new tools—to improve our planet.

IU has a major subcontract from the University of Illinois Urbana-Champaign, with Craig Stewart as the IUPI. Dr. Marlon Pierce is a key leader in XSEDE’s extended collaborative support services, with a focus on implementing science gateways. Rich Knepper, Therese Miller, and Craig Stewart are all leaders in XSEDE’s community interoperability efforts to connect campus and national cyberinfrastructure. And Julie Wernert and Craig Stewart are leading XSEDE efforts to quantify its benefits to the US overall.