Web crawling: Difference between revisions

Content deleted Content added

Inline

Latest revision as of 03:04, 14 January 2018

Redirect to:

From a gerund: This is a redirect from a gerund (or gerund phrase), a verb form that ends with "ing" and that functions as a noun, to a related part of speech or topic.

@@ Line 1: / Line 1: @@
+#REDIRECT [[Web crawler]]
-Web Crawling is a fundamental procedure, of the internet, by which a software
-is specifically designed to extract information from web-pages.
+{{R from gerund}}
-The first web-crawlers were search engines, whose sole job is to jump from link to link on web-pages, "crawling" meaning to extract information on the web-page as it goes along. This information is normally, the title/description of the pages along with kwy phrases in the web-page.  This extracted information is passed to a global database index, which can be used to search content on web-pages.
-Search engines coined the term "web crawling" in that context. These days, "web scraping" means the same think, but more likely in the context of extraction specific information from a web-page on a regular basis.