How to Make Ethical Scrapers?
When you are running a company, you should be careful when web scraping because your competitors can use it against you. To protect yourself and to be a good, law-abiding digital citizen, there are several steps you can take when creating and running your web scrapers.
Think twice before scraping personal data
If the data collected can be used to identify a person, make sure you obtain their consent before scraping it.
This data can be anything from official information about a person, contact details, behavioral data, shopping preferences, location either by address or GPS, Video + audio recordings of people and biometric data, sex, gender, sexual orientation, and medical records, among other information.
Publicly available personal data
When it comes to web scraping, many people mistakenly believe that only private personal data is protected. But what does that even mean? And is it really okay to scrape personal data from public sources like websites? It all depends.
A company in the EU was fined a hefty amount for scraping public data from the Polish business register. Although the court later overturned the fine, it did uphold the ban on scraping publicly accessible data.
According to the CCPA, government-released information like business register data is "publicly available" and not classified as protected.
The most recent decision concerning scraping publicly accessible data from social media networks in the US has stirred up a lot of controversies. The case, HiQ vs. LinkedIn, deals with whether or not it is legal to scrape personal information that was made public by the person.