Google?s Query Refinements

Search engines aim to provide the most relevant results in response to queries but limitations can be seen on what is actually returned based on the queries used. Search queries can either be too specific or too general for search engines to recognize good results. Google has filed patent applications regarding alternative query terms or query refinements to offer a solution.

The Google Solution

Search queries that are not too effective in providing good results include homonyms which are words that have the same sound or spelling but different meanings. Improper contexts in the choice of words can also be very confusing especially to search engines. Very general terms provide results that are too broad while very narrow terms can be very restrictive and may provide non-responsive search results.

Google presents a system and method that attempts to address this particular problem. In this system, a stored query and a stored document are associated as a logical pairing. The pairing is assigned a weight thus when a search query is issued, a set of search documents is produced. There is at least one search document that matches at least one document. Retrieval is done when the stored query and the assigned weight associated with it matches at least one stored document. A cluster is formed through this and scoring is done on at least one cluster relative to at least one other cluster. At least one such scored query is suggested as a set of query refinements.

The process starts when Google finds results by choosing the top 100 documents for clustering. During this phase, term vectors are computed for each of the said documents which were ranked by relevance score. The documents are matched to a stored document listed in an association database. Alternative query terms are found by looking at associations with queries that had been computed beforehand for the matched stored documents.

Term vectors are also created for alternative query terms. Clusters are created from both sets of term vectors to form groupings. Each cluster has a calculated cluster centroid. Search queries associated with a search document in the cluster are scored according to the distance from this centroid and the percent of stored documents occurring in the cluster. The best suggested query refinement contains the highest number search query terms and the most frequently seen in the documents in the cluster.

Other clusters and query names may be created to come up with additional suggested query refinements. Refinements are sorted by relevance scores. Alternative queries can include negated forms of terms appearing in the set of refinements but does not appear on the original search query. A number of predetermined search queries selected from past user queries can be used to arrive at a precomputed possible set of refinements. The predetermined queries would be issued while search results are maintained in a database for future user search requests. The refined queries would be provided to the user together with the results of the original search.

The precomputation stage happens before any query is entered into the search engine. It is best described with the use of at least four parts ? associator, selector, regenerator and inverter.

The associator creates relevance-weighted relationships between stored queries and stored documents. The selector decides which stored documents and stored queries should be retrieved. The regenerator looks at query logs and selects stored documents based on previous searches. The inverter looks at the cached data and selects documents and associated queries based on the cached data.

The query refinements system itself has four parts. A matcher matches one or more stored documents to the actual search documents which have been generated by the search engine to answer a search query. It also identifies the stored queries and assigned weights using the associations corresponding to the matched stored documents. A clusterer forms one or more clusters using term vectors formed from the terms occurring in the matched stored queries and corresponding weights. The scorer computes centroids which represent the weighted center of each cluster?s term vector. A presenter identifies the highest scoring search queries as one or more query refinements to the user. The interesting aspect about this approach is how user data is incorporated into results through the use of log files and cached information.

The patent application shows one way of achieving query refinements but no one really knows for sure exactly how Google comes up with alternative results. However, it offers some hints on how to create contents on websites and how to show up in these alternative results. By taking into careful consideration the words that people will probably search for and what appears in Google?s results for search phrases, a clue can be provided on how the search refinements approach will treat a website.

Multi-Stage Query Processing

The determination of page relevancy in responding to queries from searchers considers how a term or phrase is used in the context of a page. A patent application that looks into the possible ways of considering the context of these words was likewise submitted by Google. It describes a multi-stage process that determines relevancy and finds results to a search.

The possible actions to be taken as described in this document can be divided into stages. The first stage deals with deletion of stop words, term stemming and expansion of queries to use things like synonyms and related terms that commonly co-occur with them. During this stage, the relevancy scores are created between query and each document computed with one or more scoring algorithms. The second stage uses adjacency and proximity of terms to rank documents. The third stage reviews the term attributes such as determining whether terms are titles, headings, metadata or whether these terms possess certain font characteristics. The fourth and last stage is the generation of snippets to return with results.

Interactive query refinements have shown that it can promote effective retrieval. Major search engines use the history of a user?s actions such as queries or clicks to personalize search results. The query-specific web recommendations (QSRs) retroactively answer queries from the user?s history as new results arise. Its main goal is to recommend new web pages for user?s old queries. However, this will not be of any use unless the user has a standing interest in a particular query. Focus can also be shifted from individual queries to query sessions which includes all actions associated with a given initial query. A query is considered a query refinement of the previous one if both queries contain at least one common term.

http://www.theinternetone.net

This Site Is For Sale

Related Articles:

Link Survey Version 1.6: Improve Search Engine Ranking by Learning About Competitors
AntsSoft today announced the release of Link Survey version 1.6, the first software in the world which can check link popularity of multiple relative websites, make comprehensive analysis, and generate a detailed report.

Buying Links - How To Make Sure That The Links You Buy Are Worth It
Before you start looking at links to buy you need to know that not all links for sale are worth it There are many things that you need to look at before you buy those links

Link Building: To Link, or Not to Link, That is the Question
Lately, there have been a lot of heated discussions regarding link building. Is it ethical to create a link building campaign? Does Google or any other search engine penalize for "link farms" (a bunch of non-related links created for the SOLE purpose of increasing search engine ratings)? Is the "link building era" over?Link FarmsMany webmasters claim that Google penalizes websites for link farms.

25 Common Link Exchange & Search Engine Terms
In today's world of website promotion and traffic building, a whole new set of terms and definitions have developed. To be a successful webmaster and/or website owner, it is important to know the meanings of some of the most popular link exchange and search engine terms.

Link Building and Link Strategy for Increased Web Traffic
Toronto, ON November 26, 2007 ? There are millions of websites in cyberspace. The challenge becomes how to ensure that your website is found on search engines and is seen by potential customers.

Build Links, Increase Page Rank, Increase Traffic
Search Engines in the last couple of years are giving more weight to one way links with a similar theme, these links are a vote of trust and confidence for your website, they are so important that they help your site in the rankings of search engines. One search engine in particular uses link popularity, that search engine is Google. When you improve your link popularity it will eventually move your site up in the serps, this is the goals of every webmaster.

Using Back Links to Get Top Search Engine Ranking
There are no hidden secrets on how to rank high with the major search engines. All that is needed is a basic understanding of how search engines work and a bit of know how.

Boost Your Search Engine Ranking And Generate Free Traffic With Reciprocal Links
Reciprocal links are an important step in your overall plan to get site visitors.What are they? Reciprocal links are mutual links you and some other web site owner agree to post on your respective sites.

Is Exchanging Links Better Than One Way Links
When establishing links and exchanging links this helps your rankings with the search engines and builds on connecting with other business owners. When exchanging links with other webmasters you will need to give them your code and you will need to use their code on your site.

Traffic One Way Links And Reciprocal Link Exchange
While reciprocal links are still valid and help you gain link popularity and page rank, many SEO experts agree that one way links are more valuable. One way links are also known as non-reciprocal links. Acquiring one way links are much more difficult than reciprocal links. One way links are a tool that can be quite beneficial to the webmaster. The very best one way links are those that are included in the content of another website, directing visitors to your website. One way links are those where you point to a site, or a site points to you without a link being returned. One way links are the best way to increase the link popularity of the site and get theme based links for natural search engine optimization.

Rock Your Rank With a Dynamite Text Link - Yahoo Directory Explodes Rankings
Last week a client called me excitedly exclaiming that their Google PageRank had jumped a notch and their targeted keyword term now ranked #23 (up from #45) for their competitive search phrase. I asked the client if he'd been notified by Yahoo that his site was now included in the index after we had submitted it three weeks ago.

One Way Links and Reciprocal Link Exchange and Traffic
While reciprocal links are still valid and help you gain link popularity and page rank, many SEO experts agree that one way links are more valuable. One way links are also known as non-reciprocal links. Acquiring one way links are much more difficult than reciprocal links. One way links are a tool that can be quite beneficial to the webmaster. The very best one way links are those that are included in the content of another website, directing visitors to your website. One way links are those where you point to a site, or a site points to you without a link being returned. One way links are the best way to increase the link popularity of the site and get theme based links for natural search engine optimization.

Smart Link's Local Submit Enhances SEO for Vertical Search Engines
Smart Link Web, a Michigan based (http://profiles.smartlinksolutions.com) leader in search engine optimization (SEO), now offers a method for small and local businesses to climb to the top of search engine rankings. This is in response to Google's recent change in its search results through the vertical search system. It introduces Smart Link Local Submit to give local and small scale businesses an edge in the field of online business. Unlike the traditional horizontal counterpart, vertical search results place local businesses above the normal organic results. Vertical searches are focused on the particular - and the online user is given ...

15 Proven Ways For Link Building To Improve Search Engine Rank
Almost all webmasters know that incoming links are food for website. The website will rank high in the search engine result pages as long as they have great quality incoming links with related anchor text.

Linking for Traffic: The Shift from Link Directories to Hyper-Targeted Linking
There's a stiff wind blowing in a new direction on the web. And you'd benefit from taking the time to notice the direction its headed.


Privacy Policy | Copyright/Trademark Notification
eXTReMe Tracker