Web crawling: Difference between revisions
Appearance
Content deleted Content added
No edit summary |
Senator2029 (talk | contribs) R from gerund |
||
(One intermediate revision by one other user not shown) | |||
Line 1: | Line 1: | ||
#REDIRECT [[Web crawler]] |
|||
Web Crawling is a fundamental procedure, of the internet, by which a software |
|||
is specifically designed to extract information from web-pages. |
|||
{{R from gerund}} |
|||
The first web-crawlers were search engines, whose sole job is to jump from link to link on web-pages, "crawling" meaning to extract information on the web-page as it goes along. This information is normally, the title/description of the pages along with kwy phrases in the web-page. This extracted information is passed to a global database index, which can be used to search content on web-pages. |
|||
Search engines coined the term "web crawling" in that context. These days, "web scraping" means the same think, but more likely in the context of extraction specific information from a web-page on a regular basis. |
Latest revision as of 03:04, 14 January 2018
Redirect to:
- From a gerund: This is a redirect from a gerund (or gerund phrase), a verb form that ends with "ing" and that functions as a noun, to a related part of speech or topic.