Principles for the Assessment of Text and Data Mining Acquisitions

The following set of principles is intended to guide librarians in negotiating with vendors and in considering text and data mining (TDM)-related acquisitions.

  • Availability. Resources acquired for the purposes of TDM research should be available indefinitely to the institutional community as a whole.

  • Privacy. TDM platforms, tools, and data should be available to institutions and researchers without the requirement of identifying individual researchers or research projects.

  • Portability. The mode of delivery for source data to be utilized in TDM research should be as portable as possible, limited only as necessary.

  • Accessibility. To the extent that a specific platform is required for TDM, that platform will be made accessible to researchers with a broad range of abilities.

  • Reproducibility. Research outputs that utilize TDM tools, data, or platforms should be reproducible and transparent.

  • Sustainability. TDM access should not involve unsustainable expenditures for the institution or the passing of costs on to researchers.

  • Collaboration. TDM tools, resources, and products should facilitate cross-institutional collaboration.