CAUSEWAY : FAQ for Website Owners


What is Ivy Plus Libraries?

Ivy Plus Libraries is a partnership between thirteen leading academic research libraries ― Brown University, the University of Chicago, Columbia University, Cornell University, Dartmouth College, Duke University, Harvard University, Johns Hopkins University, the Massachusetts Institute of Technology, the University of Pennsylvania, Princeton University, Stanford University and Yale University ― that collectively provide access to a rich and unique record of human thought and creativity through resource sharing and collaboration. The Ivy Plus Web Collecting Program is a collaborative collection development effort to build curated, thematic collections of freely available, but at-risk, web content in order to support research at participating Libraries and beyond. 

What is the Collaborative Architecture, Urbanism and Sustainability Web Archive (CAUSEWAY)?

The Collaborative Architecture, Urbanism and Sustainability Web Archive (CAUSEWAY) is a project to archive websites devoted to the related topics of architecture, urban fabric, community development activism, public space and sustainability. CAUSEWAY is curated by art and architecture librarians at Brown, Columbia, Cornell, Dartmouth, Harvard, Johns Hopkins, Princeton, and Yale universities, and the universities of Chicago and Pennsylvania (collectively known as the Ivies Plus Art and Architecture Group).

What is the purpose of CAUSEWAY?

The goal of the CAUSEWAY web collection is to preserve (and document the evolution over time of) the selected websites in a secure digital archive to guarantee the continuing availability of these important but potentially ephemeral resources for researchers.

Why was my website selected by Ivy Plus Libraries for CAUSEWAY?

Participating librarians are choosing websites that fit into the themes of CAUSEWAY: Urban Fabric (e.g. historic preservation, urban renewal, urban preservation), Public Space (e.g. parks, community gardens and other open spaces), or Community Activism (e.g. historic preservation initiatives, associations).  Each librarian is making nominations focused on the geographic region in which her or his institution is located. 

What is web archiving?

Web archiving is the process of creating and storing copies (or snapshots) of live websites to ensure that they are preserved and remain publicly accessible for future use. Sites are typically captured using automated web crawlers, which systematically identify and follow hyperlinks on selected websites (referred to as seeds), copying and storing information as they work. Ivy Plus Libraries uses the Archive-It service to crawl websites: its crawler is called archive.org_bot.

Will the web crawler interrupt access to my website?

No, it shouldn’t. Ivy Plus Libraries crawls websites at a polite rate so as not to interfere with access to your site. Crawls are typically run semi-annually, operate in the background, and last for at most a few days. Once a crawl is complete, the crawler no longer interacts with your site  

What is your copyright policy?

Ivy Plus Libraries attempts to notify all organizations and/or website owners before crawling selected sites. Some websites may contain material that is produced by other parties who may claim copyright ownership of such materials. Third-party copyright holders who believe their rights have been infringed upon may contact us at causeway@library.columbia.edu.

What if I do not want my website to be included in CAUSEWAY?

We will honor requests to remove access to archived content. Please contact us at causeway@library.columbia.edu.

Where can I access the archived versions of my website?

All websites collected in the CCWA can be freely and permanently accessed via Archive-It.

Why do the archived versions of my website look different than my live website?

Though Ivy Plus Libraries attempts to create complete copies of each website we collect, web crawlers can struggle to capture portions of the dynamic web. Examples of challenging or impossible-to-capture content include: JavaScript-driven navigation menus, streaming audio and video, and database-driven content. In addition, web crawlers cannot collect password-protected portions of your site: only publicly available content will be crawled. Site owners interested in optimizing their site design to allow full archiving may be interested in these Guidelines for Preservable Websites.