One of the important tasks of SEO-analysts is that the correct identification documents in the SERP Yandex, which are not organic search results, obtained as a result of the normal operation of the main ranking algorithm.
Such documents are referred to as an impurity to the organic matter, and they are unnecessary noise in solving a variety of analytical tasks, such as tasks textual analysis of documents from the stamp issue. Having identified impurity, we can clean it from the organic search results for later use them in solving such problems.
Two types of documents can be identified that are not organic search results. The first type (I would call it a foreign impurity) - are documents that are missing in the results of the service Yandex.XML . In a typical issue they can be visually distinguished from the organic matter on different grounds. By foreign impurities are:
1. Advertising space in the search results (PPC)
The staff Yandex is undoubtedly very much like to see the ads on the page of search results would not differ from the organics. However, they still have to label contextual advertising. In the " Rules of the show Yandex " states that "ad impressions (hereinafter referred to as -" Offers ") on the advertising space can be marked:" Yandex "," Direct "," Advertisement "," ₽ "or "P". " Moreover, with the curious ogovorochkoy: "Due to the technical limitations of the mark can be reduced or absent."
On the pages of search results at this time to mark the advertisements use the word "advertising", written in a smaller font than the font used in the ad. And in some cases, the mark can really virtually disappear, literally degenerating to one letter. For example, in the mobile issue:
The rest of the ads are practically no different from snippets of organics. Well, except that, in advertisements is never links "Read more> '. However, it is not always present in organic and snippets, so this feature should not be considered a reliable means of identification of advertising in conventional extradition.
Also note that advertisements may be displayed not only before or after the organic results, but also between them.
2. Search koldunschiki, faktovye object and answers
Search koldunschiki - a specially decorated answers found on our own services Yandex. More details about them can be read on the site Yandex Technologies .
In view of the specifics of the submission to confuse these elements with organic search results is very difficult. However, as mentioned previously, if you have the opportunity to receive from the service issuing Yandex.XML, the problem of identification of external impurities becomes irrelevant - such documents in XML issue is not there.
The second type of impurities to the organic matter (called internal impurity) stored in the search results received from the service Yandex.XML. To it are:
1. Vital Answers
It documents considered clearly the best response to a request, for example, the home page of the official website of the company at the request of its brand. In fact, this results organic receiving extensive artificial boost to the calculated value of the basic algorithm relevance request and puts them firmly to the first place of issue.
In appearance, the vital answers are indistinguishable from organic snippets. Their identification is only possible to analyze the issue Yandex.XML service. In response vital parameter value field name <categ> contains an identifier that includes the substring UngroupVital, e.g., <categ attr = "d" name = "UngroupVital59.ru" />. Moreover, vital answers may be several:
2. Recent results
Recent results are displayed in response to an "event request", which according to Yandex important fresh answers. In seoshnyh medium such responses, the term "impurity bystrobotovskaya". This results drawn from documents indexed special robot - bystrobotom - and for a very short period of time fall in the index. Ranking them by a special algorithm, and so cleaning organics from these results is relevant in analyzing the operation of the basic algorithm.
Identifitsitovat bystrobotovskuyu impurity can be by the presence of specific markers of freshness document ( «N minutes ago", "of N hours ago" "yesterday," "the day before"):
3. ungroup results
By default, the search results from the same site are grouped together in the search results, and we see only one result from each site, the most relevant queries. B XML issuing grouped in the organic results on the site value of the name field of the parameter <categ> represents the domain name, for example, <categ attr = "d" name = "yandex.ru" />. But sometimes Yandex has preferences individual pages with individual sites and displays them in the issue outside groups on the site. For example, at the request of hemorrhoids you can see two results c kp.ru site:
In XML, issuing one of these results is as a field name value of the parameter <categ> domain name, but the second - obscure design MiddleUngroup_kp.ru_68.ru:
Also, as the value of the name field of the parameter <categ> may appear not the domain name of the site, and the URL of the document. And it can also be a sign that such results are not organic.
For example, recently the site owned by Yandex service Edadil seen in the tops of the huge number of requests, and often the content of pages that needs completely irrelevant. The impression that the documents from this site received a significant boost to its regular value relevant. A typical example:
In XML issue in the name field of the parameter value <categ> for this result contains the URL of the document is, and not the domain name of the site:
I recommend all of the results that are in XML extradition matter the name field of the parameter <categ>, other than the domain name of the site, which are excluded from the analysis of organic issue, as there are some reasons to believe that these results are either not processed the basic algorithm of ranking or receive some boost to the relevance value, the calculated basic algorithm.
4. Own projects Yandex
Since there is some reason to believe that Yandex may give a boost to the organic query relevance value computed basic algorithm for documents own projects (as located on yandex.ru domain, e.g., Yandeks.Rayon , Yandeks.Dzen or Yandeks.Kollektsii and in other domains, for example, Edadil , kinopoisk or Avto.ru ), I recommend the following answers to exclude from the analysis of organic issue in any case, even if the name field as the value of the parameter <categ> XML issue contains the domain name of the site.