How Do Search Engines Decide Which Information Is Relevant

How Do Search Engines Decide Which Information Is Relevant

How Do Search Engines Decide Which Information is Relevant for The User?

Today it is about understanding how search engines decide which information is relevant for a user and how the search results are calculated after a search query. If possible, I don’t want to favor search engines in my series. But since Google is probably the most well-known in the world, I will explain many examples, again and again, using Google. 

In the first part, we have already learned that search engines use crawlers to continuously collect information and websites from the entire public Internet, catalog it and store it in the search engine index. The data is already prepared so that the search engine can later work optimally when processing a search query (query processing). The Google search index contains billions of web pages and is over 100,000,000 gigabytes in size. It works like the index at the end of a book.

Based on What Information Does Google Generate Search Results?

Today, however, Google does not only collect and process words. Other data is also collected, indexed, and processed, such as the website’s server location, responsiveness, website structure, and loading speed. Prices, flight routes, job advertisements, and hotel rooms are also recorded and displayed in a structured manner in the search results. 

However, Google does not only generate search results based on the data and relevance of a website. The location of the user performing the search, the device from which he does it (e.g. smartphone), and the way he does it (question? Purchase intention. Search for information) also play a significant role in the whole.

Which Ranking Factors Does a Search Engine Use When Answering a Search Query?

This question is not that easy to answer in general since nobody knows the weighting of the individual ranking factors and the Google algorithms exactly. But there are empirical observations, correlation studies, and SEO best practices that Google itself discloses. So today we know what is important for the optimization of a website and how Google reads data, interprets it, and presents it in the search results. The way Google processes and evaluates data is also constantly changing. Alone in the year 2018 there were over 3200 changes to the various Google algorithms. Anyone who has ever been to one of the 15 Google data centers worldwide will get a rough idea of the amount of data the search engine giant is dealing with.

“But Why is All This for Nothing?"

It’s simple: Google makes most of its revenue by showing ads. You can place advertisements for certain keywords in the Google search. If a user clicks on an ad, Google earns money and the advertiser may have a new customer. Since Google lives from a large number of users, the Google search must have added value, work free of manipulation, and be user oriented. No advertising revenue without users. Incidentally, the advertising system of Google “Google Ads” is strictly separated from the organic search. You can’t buy organic placement; you can only place an ad for keywords.

Which Method Is More Advantageous? Organic Growth or Pay for Ads?

Of course, getting a website to rank high organically (not paid) is much more cost-effective in the long run than paying for advertising. That’s why SEO is so important. 

The Google search as it is today no longer a static keyword file, but a complex mechanism paired with artificial intelligence, machine learning, and quality assurance, which is intended to provide the user with the best possible search result. And that in over 170 languages and even artificial languages.

Important: I must again emphasize here that the informal scope in this series is kept to a minimum. If I were to explain all the features, functions, updates, and algorithms here, it would be like a novel.

How Does Google Generate The Rankings for The Search Results Pages?

When calculating the rankings, i.e. the order of the websites in the search results (SERP) and the graphical output, two processes run in parallel:

How Search Terms are Converted by Google?

If you enter a search term or a question in the web interface of the Google search (e.g. Google.de), then these are first parsed after pressing the Enter key. This means, for example, that a question is only converted into meaningful and processable nouns (keywords), verbs (costs, means, weighs) and question words (what, who, how, where). Using these words, the query processor can then use the index and algorithms to determine the relevance of cataloged Calculate websites from the database and display them graphically in the Google web interface in less than a second. Basically, the parser is an interface to the database. Because Google maintains data centers worldwide, the proximity of a server with a parser and query processor is always very small, which enables high speed when answering a search query. 

Problems Can be Addressed by Core Updates and BERT

The difficulty or rather the challenge for Google is to infer the intention behind a search phrase or a question as best as possible. That was also the main work of the developers in recent years. With Core Updates and updates like BERT, these challenges and problems have been addressed. 

For example, the nouns provide information about the topic of the search and the verbs questions about the intention. Algorithms then decide with which weighting factor is applied to the database (index) and the query processor ultimately determines the order of the search results. The cataloging and sorting of the data are already done during the crawling.

So, we can see that the Google search is quite complex, and it takes several steps to get the search result. And at the same time, relevant advertisements from the Google Ads system are also imported. And all this in less than a second. Respect!

How Does Google Decide What Information is Relevant to the User?

To answer this question as best as possible but also compactly: Google uses a lot of data and factors to answer a search query. On the one hand, there is the search engine index, which has categorized all websites stored on the Internet, on the other hand, there is an algorithm that answers each search query individually.
For example, when processing a search query for search engines, the following questions play a role:

Of course, these questions are only symbolic of the complexity behind them. I’m just trying to clarify how relevance can be checked. It’s important to understand that a website’s relevancy score for a keyword root or topic is already set in the index. A search query is almost always just “queried”. This also explains the secretly high speed with which Google can answer a search query. 

If a user searches “transaction-oriented”, ie he wants to buy something, then online shops and shopping ads are more in the foreground. If a user searches in an “information-oriented” way, ie he is looking for a definition, for example, then Wikipedia entries, encyclopedias and wikis are more likely to appear. You know this yourself from your daily Google searches.

Search Engine and Ranking FAQ

We know that Google uses several hundred ranking factors and several different search algorithms (e.g. when calculating or displaying search results). Due to AI and machine learning, it is no longer possible to speak of static ranking factors today. There is a core algorithm and many quality algorithms that are used, for example, to identify spam websites or search intent. As an example of how complex and profound the ranking calculation is, you can take a look at the Google Search Quality Rater Guidelines. Whereby these only illustrate from the user’s point of view what is important for the search engine.
As just described, the algorithm, or better the algorithms of Google work very complexly. There are hundreds of signals and factors that Google uses to answer a search query, including: Keywords, website structure, usability, content quality, information content, website loading speed, website topic, language, user location, domain authority, popularity (backlinks)… Google also distinguishes between different industries, e.g. finance, health, e-Commerce, and Law determined and processed their search queries or the search results differently. For example, local searches return lots of Google Maps listings and local businesses, while shopping searches return product listings, shopping ads, and even prices. The same applies to news: First and foremost, you get snippets from media and news portals. So, we see that Google search is no longer static.
Yes. Otherwise, Google would be susceptible to manipulation. It can be assumed that several development teams at Google are working on the search algorithms in parallel, and therefore probably no one knows exactly how and to what extent individual factors and algorithms affect them. One can also assume that the Google algorithm is no longer as static as it was 10 years ago. Due to AI and machine learning, search results are becoming more and more individual and precise for the user.
No technical measures and SEO can be used to optimize the way Google interprets and evaluates data, thereby increasing positioning for relevant search queries. Manipulations, as in the past through black hat SEO, backlink spam, and other techniques, are no longer so easy today thanks to various quality mechanisms and algorithms (spam detection). There are always gaps (exploits/glitches), which are fixed very promptly by Google. In terms of quality, this has the advantage, especially for users, that they see good and relevant content as a search result.
As already described above, the algorithms no longer only follow individual factors such as keywords, but also the context and intention behind a search query. For example, with the help of natural language processing (see BERT update), Google can better understand the semantic context of a search query and thus answer long questions. Certain standard factors certainly help search engines to recognize and index content better, but it is up to the unknown algorithm to determine which content best answers an individual search query.
Clearly: yes. Due to its market power and sales, Google has significantly more capital and manpower for the constant further development of features and algorithms than its competitors. Although other search engines also follow certain “basic rules”, the quality of the search results is probably nowhere as “fine” and “user-oriented” as with Google.

This happens in under a second. Various factors and algorithms are used to generate the best possible search result or a search result page from the indexed data. Google always tries to answer the search query in the best possible way. This also applies to advertisements in Google search. The focus is always on the user and his search intention.

Share This Post