Website Optimization The Four Stages of Search Engine Web Page Inclusion
As SEO practitioners, not only do we need our websites to be crawled by search engines, but we also need them to be included. Most importantly, after inclusion, we hope for good rankings. This article will briefly analyze the four stages of search engine web page inclusion. The rankings of each website and each web page are different. Let's see which stage your website is at.
Stage 1 of Web Page Inclusion: Catching Everything
Search engines adopt a "catching everything" strategy when crawling web pages. That is, they add all the links they can find on a web page to the list of URLs to be crawled one by one. They mechanically extract the URLs from the newly crawled web pages. Although this method is rather old - fashioned, it is very effective. This is why many webmasters report that spiders have visited their websites but the pages have not been included. This is just the first stage.
Stage 2 of Web Page Inclusion: Web Page Rating
The second stage is to rate the importance of web pages. PageRank is a well - known link analysis algorithm that can be used to measure the importance of web pages. Naturally, webmasters can use the idea of PageRank to rank URLs. This is what everyone is so keen on as "building external links." According to a friend, the market for "building external links" in China has an annual scale of hundreds of millions of yuan.
The purpose of crawlers is to download web pages. However, PageRank is a global algorithm. That is, its calculation results are reliable only after all web pages have been downloaded. For small and medium - sized websites, if the server quality is poor, during the crawling process, only part of the content is seen, and a reliable PageRank score cannot be obtained during the crawling stage.
Stage 3 of Web Page Inclusion: The OCIP Strategy
The OCIP strategy is more like an improvement on the PageRank algorithm. Before the algorithm starts, each web page is given the same amount of "cash." Whenever a page A is downloaded, A distributes its "cash" equally to the linked pages contained in the page and empties its own "cash." This is one of the reasons why the fewer the outbound links, the higher the weight.
For the web pages to be crawled, they are sorted according to the amount of "cash" they have at hand, and the web pages with the most "cash" are downloaded first. The OCIP is roughly the same as the PageRank in terms of the idea, but the difference is that PageRank requires iterative calculations each time, while the OCIP does not. Therefore, the calculation speed of the OCIP is much faster than that of PageRank, making it suitable for real - time calculations. This may be one of the reasons why many web pages are "instantly included."
Stage 4 of Web Page Inclusion: The Big - Site - First Strategy
The idea of the big - site - first strategy is very straightforward. It measures the importance of web pages on a website - by - website basis. For the web pages in the queue of URLs to be crawled, they are classified according to the website. If a website has the most pages waiting to be downloaded, these links are downloaded first. Its essential idea is "to preferentially download the URLs of large websites." Because large websites often contain more pages. Given that large websites are often well - known websites, the quality of their web pages is generally higher. So, although this idea is simple, it has a certain basis.
Experiments have shown that although this algorithm is simple and crude, it can include high - quality web pages and is very effective. This is also one of the most important reasons why many websites are ranked behind large websites when their content is reprinted.
【版权与免责声明】如发现内容存在版权问题,烦请提供相关信息发邮件至
,我们将及时沟通进行删除处理。
本站内容除了
谷歌外链,友情链接 (
https://www.abcdlink.com )特别标记的原创外,其它均为网友转载内容,涉及言论、版权与本站无关。