Jump to content

Web crawling: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
 
R from gerund
 
(One intermediate revision by one other user not shown)
Line 1: Line 1:
#REDIRECT [[Web crawler]]
Web Crawling is a fundamental procedure, of the internet, by which a software

is specifically designed to extract information from web-pages.
{{R from gerund}}
The first web-crawlers were search engines, whose sole job is to jump from link to link on web-pages, "crawling" meaning to extract information on the web-page as it goes along. This information is normally, the title/description of the pages along with kwy phrases in the web-page. This extracted information is passed to a global database index, which can be used to search content on web-pages.
Search engines coined the term "web crawling" in that context. These days, "web scraping" means the same think, but more likely in the context of extraction specific information from a web-page on a regular basis.

Latest revision as of 03:04, 14 January 2018

Redirect to: