Document Type

Peer Reviewed/Refereed Publication

Publication Date

12-15-2020

Publication Title

Communications of the Association for Information Systems

Department

Computer Science and Information Systems

College/School

Arthur J. Bauernfeind College of Business

Abstract

Automatic retrieval of data from the Web (often referred to as Web Scraping) for industry and academic research projects is becoming a common practice. A variety of tools and technologies have been developed to facilitate Web Scraping. Unfortunately, the legality and ethics of using these tools for collecting data are often overlooked. Failure to pay due attention to these aspects of Web Scraping can result in serious ethical controversies and lawsuits. This paper reviews legal literature together with the literature on ethics and privacy to identify broad areas of concern together with a list of specific questions that need to be addressed by researchers and practitioners engaged in Web Scraping. Reflecting on these questions and concerns can potentially help the researchers decrease the likelihood of ethical and legal controversies in their work.

Comments

This is an Accepted Manuscript of an article published by the Association for Information Systems in Communications of the Association for Information Systems on December 10, 2020, available online: https://doi.org/10.17705/1CAIS.04724

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.