What are the Search Quality Analyzers?

Search quality is very common and still very vague notion.

Each user has his/her own search habits and the most liked individual
search query types.

If, for example, you use mostly navigational queries (i.e., when you
need some place on the Net rather than some particular piece of
information), then you likely will like Google as this query type
is its core competence. And, on the contrary, if you like natural
language style queries, you may prefer the Ask search engine or some
other search engine.

The popularity of an SE does not directly reflect the quality of
search, because the popularity is a product of marketing and PR as well.

In order to develop an independent automatic test of search quality, we
developed a set of analyzers, one for each type of search queries. For
all these analyzers we use special sets of sample queries and sample
sites. We
measure quality of navigational and informational search, the level of
pornography in SE results page etc.

We hope that our tests are (or eventually would be) an objective and
reliable source of information on search quality. Enjoy.

How do the Analyzers work?

To estimate the search quality for various types of search queries, we use
special sets of sample queries and sample sites. For example, for
measuring the
navigational search quality, we use a set of approximately 5000 sample
queries
and the corresponding set of sample sites.

If the user inputs the 'CNN' query, she evidently (or, statistically)
wants to see the
site www.cnn.com at the first place on the search engine results page.


Thus, the www.cnn.com would be an organic result for the query 'CNN'. In
order to prevent the Analyzers from being compromised by search engine
developers, we use a sliding window of several tens of sample
queries each other day, and constantly replenish and refine the original
sample set of queries.

You can find description of the methods used in a particular analyzer on
the page where the analyzer data is shown.

We highly appreciate any corrections and welcome any criticism.
Please feel free to send us errors found, suggest new sample queries,
criticize the method etc.

Analyzer of nagivational search

Search query with a purpose of finding a certain website is called a navigational query. Such queries include "sberbank", "komsomolskaya pravda", "rambler", "gazeta ru", etc.

Best result of a navigational query is the required site in the first position of search results.

[ Link to article ]

Analyzer of subject search

A human being is more capable than the machine to interpret a search query, to assume which information the user requires, evaluate the information on the Web and form the search results. For this reason, the results of work of an expert are always better than those of an algorithm.
[ Link to article ]

Analyzer of correct hints

Most of the search engines attempt to suggest a correct spelling for a query in case a typo is suspected. The quality of such hints is an important addition to the overall quality of the search. This analyzer looks for the correct hint in the search results for a query with a deliberate typo and estimates the number of occurrences of a 'correct' query contained in the hint.
[ Link to article ]

Typo resistance analyzer

A human is no machine, it is bound to make mistakes. This includes the mistakes while typing in a search query: a typo, next button pressed by accident ("quety" instead of "query"), a double character or a missed one ("qury" or "queery"), after all, the user can type the word 'by ear' not knowing the correct spelling (we would get "yandax" instead of "yandex").
[ Link to article ]

Quotation search quality analyzer

Quotation search is a search for a certain text using its known fragment. This method is frequently used to find original literary works. A quality search engine should produce a link to a web page containing the text of the work the quotation was taken from. Ideally, it would be in the first position. For example, user submitting a query "To be or not to be, that is the question" is most probably looking for the text of Shakespeare's Hamlet, and the link to its text should be on the first page of the search results.
[ Link to article ]

Analyzer of search spam level

The company "Ashmanov and Partners" studies the phenomenon of search spam - the methods and technologies reducing the quality of search results and interfering with the operation of search engines.

Search spam is a text, URL, technology, program code or other web elements created by the web-master for the sole purpose of promoting the site in search engines' results, and not for a fast and reliable search based on complete and authentic information.

[ Link to article ]

SEO-pressing analyzer

Many queries are ambiguous. For instance, design, cars, sports, etc. These queries are defined as informational. The best result for such a query would be a selection of links for the resources corresponding different meanings of the query. Thus, the output for a "design" query should contain links to the sites on web-design, landscape design, interior design, etc.
[ Link to article ]

Analyzer of 'adult sites' entries in the search results

This analyzer is currently running in test mode, the pornography detection for text documents is being fine-tuned. Results may be incorrect.

The given analyzer collects search results for queries which may be interpreted as a search for a certain category of pornography, but this interpretation is not the only one possible. There are no queries included that would be an unambiguous search for porn.

For instance, a query "stockings" could come from a user looking for a stockings shop, or for the corresponding category of pornography.

[ Link to article ]

Recall analyzer

The Recall analyzer estimates the relative size of indices of a given set of Internet search engines.
[ Link to article ]

Update analyzer

Update of a search engine is the process of search results renewal. Some sites make it to the top 10, some sites "sink". Every search engine has its own update style which becomes clear in the corresponding analyzer. The search engine update analyzer monitors daily the top ten links referring to 140 queries to check the number of sites that changed their positions, and how much the positions have changed.
[ Link to article ]

Click analyzer

The analyzer of the percentage of clicks is not a "qualitative" analyzer, it only shows the popularity and usage of the search engines. For this analyzer, the data of the Liveinternet.ru is utilized. Thus we only take into account the clicks on sites that have a Liveinternet.ru counter installed.
[ Link to article ]