{"id":2861,"date":"2022-09-02T09:53:14","date_gmt":"2022-09-02T02:53:14","guid":{"rendered":"http:\/\/international.binus.ac.id\/computer-science\/?p=2861"},"modified":"2022-09-02T09:56:13","modified_gmt":"2022-09-02T02:56:13","slug":"how-do-search-engines-work","status":"publish","type":"post","link":"https:\/\/international.binus.ac.id\/computer-science\/2022\/09\/02\/how-do-search-engines-work\/","title":{"rendered":"How do Search Engines Work?"},"content":{"rendered":"<p>Original Article: <a href=\"https:\/\/www.deepcrawl.com\/knowledge\/technical-seo-library\/how-do-search-engines-work\/\">https:\/\/www.deepcrawl.com\/knowledge\/technical-seo-library\/how-do-search-engines-work\/<\/a><\/p>\n<p>&nbsp;<\/p>\n<h2>The Search Engine Index<\/h2>\n<p>Webpages that have been discovered by the search engine are added into a data structure\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Search_engine_indexing\" target=\"_blank\" rel=\"noopener noreferrer\">called an index.<\/a><\/p>\n<p>The index includes all the discovered URLs along with a number of relevant key signals about the contents of each URL such as:<\/p>\n<ul>\n<li>The\u00a0<strong>keywords<\/strong>\u00a0discovered within the page\u2019s content \u2013 what topics does the page cover?<\/li>\n<li>The type of\u00a0<strong>content<\/strong>\u00a0that is being crawled (using microdata called Schema) \u2013 what is included on the page?<\/li>\n<li>The\u00a0<strong>freshness<\/strong>\u00a0of the page \u2013 how recently was it updated?<\/li>\n<li>The previous\u00a0<strong>user engagement<\/strong>\u00a0of the page and\/or domain \u2013 how do people interact with the page?<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h2>What is The Aim of a Search Engine Algorithm?<\/h2>\n<p>The aim of the search engine algorithm is\u00a0<a href=\"https:\/\/static.googleusercontent.com\/media\/www.google.com\/en\/\/insidesearch\/howsearchworks\/assets\/searchqualityevaluatorguidelines.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">to present a relevant set of high quality search results<\/a>\u00a0that will fulfil the user\u2019s query\/question as quickly as possible.<\/p>\n<p>The user then selects an option from the list of search results and this action, along with subsequent activity, then feeds into future learnings which can affect search engine rankings going forward.<br \/>\n<a class=\"anchor\" name=\"3\"><\/a><\/p>\n<h2>What happens when a search is performed?<\/h2>\n<p>When a search query is entered into a search engine by a user, all of the pages which are deemed to be relevant are identified from the index and an algorithm\u00a0<a href=\"https:\/\/moz.com\/beginners-guide-to-seo\/how-search-engines-operate\" target=\"_blank\" rel=\"noopener noreferrer\">is used to hierarchically rank the relevant pages into a set of results.<\/a><\/p>\n<p>The algorithms used to rank the most relevant results differ for each search engine. For example, a page that ranks highly for a search query in\u00a0<a href=\"https:\/\/support.google.com\/webmasters\/answer\/35769?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\">Google<\/a>\u00a0may not rank highly for the same query in\u00a0<a href=\"https:\/\/www.bing.com\/webmaster\/help\/webmaster-guidelines-30fba23a\" target=\"_blank\" rel=\"noopener noreferrer\">Bing<\/a>.<\/p>\n<p>In addition to the search query, search engines use other relevant data to return results, including:<\/p>\n<ul>\n<li><strong>Location<\/strong>\u00a0\u2013\u00a0<a href=\"https:\/\/static.googleusercontent.com\/media\/research.google.com\/en\/\/pubs\/archive\/37522.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">Some search queries are location-dependent<\/a>\u00a0e.g. \u2018cafes near me\u2019 or \u2018movie times\u2019.<\/li>\n<li><strong>Language detected<\/strong>\u00a0\u2013\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Google_Personalized_Search#Data_collection\" target=\"_blank\" rel=\"noopener noreferrer\">Search engines will return results in the language of the user<\/a>, if it can be detected.<\/li>\n<li><strong>Previous search history<\/strong>\u00a0\u2013 Search engines will return different results for a query\u00a0<a href=\"https:\/\/static.googleusercontent.com\/media\/research.google.com\/en\/\/pubs\/archive\/32708.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">dependent on what user has previously searched for.<\/a><\/li>\n<li><strong>Device<\/strong>\u00a0\u2013\u00a0<a href=\"https:\/\/searchengineland.com\/google-divide-index-giving-mobile-users-better-fresher-content-261037\" target=\"_blank\" rel=\"noopener noreferrer\">A different set of results may be returned based on the device<\/a>\u00a0from which the query was made.<\/li>\n<\/ul>\n<p><a class=\"anchor\" name=\"4\"><\/a><\/p>\n<h2>Why Might a Page Not be Indexed?<\/h2>\n<p>There are a number of circumstances where a URL will not be indexed by a search engine. This may be due to:<\/p>\n<ul>\n<li><strong><a href=\"https:\/\/www.robotstxt.org\/orig.html\" target=\"_blank\" rel=\"noopener noreferrer\">Robots.txt file exclusions<\/a><\/strong>\u00a0\u2013 a file which tells search engines what they shouldn\u2019t visit on your site.<\/li>\n<li><strong>Directives on the webpage<\/strong>\u00a0<a href=\"https:\/\/www.deepcrawl.com\/blog\/best-practice\/noindex-disallow-nofollow\/\" target=\"_blank\" rel=\"noopener noreferrer\">telling search engines not to index that page<\/a>\u00a0(<strong>noindex tag<\/strong>) or to index another similar page (<strong>canonical tag<\/strong>).<\/li>\n<li>Search engine algorithms judging the page to be of\u00a0<a href=\"https:\/\/support.google.com\/webmasters\/answer\/66361?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\"><strong>low quality<\/strong><\/a>, have\u00a0<strong><a href=\"https:\/\/www.deepcrawl.com\/blog\/best-practice\/google-panda-4-1-how-to-identify-thin-or-poor-quality-content-with-deepcrawl\/\" target=\"_blank\" rel=\"noopener noreferrer\">thin content<\/a><\/strong>\u00a0or contain\u00a0<strong><a href=\"https:\/\/www.deepcrawl.com\/blog\/best-practice\/advanced-duplicate-content\/\" target=\"_blank\" rel=\"noopener noreferrer\">duplicate content<\/a><\/strong>.<\/li>\n<li>The URL returning\u00a0<strong>an error page<\/strong>\u00a0(e.g. a\u00a0<strong><a href=\"https:\/\/en.wikipedia.org\/wiki\/List_of_HTTP_status_codes#4xx_Client_errors\" target=\"_blank\" rel=\"noopener noreferrer\">404 Not Found<\/a><\/strong>\u00a0HTTP response code).<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Original Article: https:\/\/www.deepcrawl.com\/knowledge\/technical-seo-library\/how-do-search-engines-work\/ &nbsp; The Search Engine Index Webpages that have been discovered by the search engine are added into a data structure\u00a0called an index. The index includes all the discovered URLs along with a number of relevant key signals about the contents of each URL such as: The\u00a0keywords\u00a0discovered within the page\u2019s content \u2013 what [&hellip;]<\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[112],"tags":[],"class_list":["post-2861","post","type-post","status-publish","format-standard","hentry","category-article"],"_links":{"self":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts\/2861"}],"collection":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/comments?post=2861"}],"version-history":[{"count":1,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts\/2861\/revisions"}],"predecessor-version":[{"id":2862,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/posts\/2861\/revisions\/2862"}],"wp:attachment":[{"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/media?parent=2861"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/categories?post=2861"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/international.binus.ac.id\/computer-science\/wp-json\/wp\/v2\/tags?post=2861"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}