소셜로그인 중단 안내

계정으로 로그인 기능이 2023년 11월 16일 중단되었습니다.

아이보스 계정이 사라지는 것은 절대 아니며, 계정의 이메일 주소를 이용해 로그인 하실 수 있습니다.

▶️ 자세한 공지사항 확인

웹 스크래핑은 불법인가요?

2023.10.20 16:41

layeon

조회수 840

댓글 0

Web scraping is a term used in various ways to collect information from the entire Internet. Scraping can collect information on the Internet and process the acquired information. The more detailed the data is extracted, the deeper the data analysis will be.

When do you use it?

With the advent of the big data era, data analysis is becoming more and more important to people. We use scraping techniques to collect vast amounts of data.

For example, scraping can be used when it is difficult for humans to collect data such as collecting information on weather forecasts, collecting stock indexes for stock price forecasts, and price comparison for marketing.

Precautions and countermeasures for scraping

In most cases, web scraping is not illegal. So under what circumstances does web scraping carry legal risks after collecting data?

1. Load the server

Scraping is the act of dumping data from a server that the other party publishes as a web page. There is no law that directly prohibits excessive access to websites, but excessive access will put an excessive load on the server. There is a risk of intruding into the server of the other party, and problems such as unauthorized access will arise.

Extract data about once every 3 seconds so as not to load the server. You can set a delay time on the ScrapeStorm anti-block screen. Setting 3 seconds avoids some unauthorized access.

2. Whether to allow scraping

Check whether to allow information scraping in the page through “robot.txt” in the root directory of the other web page. Enter “http: // target site URL / robots.txt” in your browser to display the robots.txt protocol.

For example, Amazon is updated daily with various information such as prices and product ratings.

Is this site allowed to retrieve product information? Let’s access robot.txt in this root document.

There are a lot of disallows, so scraping product information from Amazon wouldn’t be very good.

Don’t scrape malicious requests to avoid being arrested for the time being! Robots talk, follow the law and use scraping correctly.

목록
댓글 0

아직 등록된 댓글이 없습니다.
layeon님의 게시글에 첫번째 댓글을 남겨보세요.

댓글 새로고침
로그인 후 더욱 많은 기능을 이용하세요!아이보스 로그인
아이보스 칼럼전체보기