Decides whether to generate an alternative query and re-run retrieval based on whether the original query's top results are dominated by low-quality sites, replacing the result set with one weighted toward high-quality sites when the original underperforms.
Patent Overview
- Inventor
- Navneet Panda
- Assignee
- Google LLC
- Filed
- 2012-12-27
- Granted
- 2015-09-15
- Application Number
- US 13/728,693
The Challenge
Not Every Query Deserves The Same Retrieval Treatment
When the top results for a query are dominated by low-quality sites, retrieval is failing the user. The system should detect this condition and try an alternative query that pulls in higher-quality results. But it shouldn't run alternatives indiscriminately — when the original query's top results are already high quality, the alternative would only add cost. The decision needs to be query-by-query.
- Low-Quality Top Results Are A Failure Signal — When the top-N results for a query come from sites already flagged as low quality, the retrieval has failed even though the matches are technically relevant. The system should react to this condition.
- Alternative Queries Have A Cost — Running an alternative query plus re-ranking is more expensive than the original retrieval. The decision to do it must be gated by genuine need.
- Need A Threshold Test — A configurable threshold N decides how many low-quality results in the top-K trigger the alternative. The threshold lets the system tune aggressiveness.
- Alternative Must Produce High-Quality Results — Generating an alternative is only useful if the alternative's results are higher quality. The alternative is constrained to ensure its top-M results come from sites not flagged as low quality.
Innovation
Threshold-Gated Alternative-Query Substitution
The system receives data identifying first resources responsive to a first query. If at least N top-ranked first resources are located on sites previously identified as low-quality, the system obtains a second query and data identifying second resources responsive to it, where at least M top-ranked second resources are located on sites not flagged as low quality. The alternative result set replaces or augments the original.
- Receive Original Query — User submits a query. Standard retrieval produces the first result set.
- Check Top-K Quality Composition — Look at the top-K results. Count how many come from sites flagged as low quality.
- Compare To Threshold N — If the count meets or exceeds N, the original query's top-K is judged too low-quality. Trigger the alternative path.
- Generate Alternative Query — Construct an alternative query that, when run, will produce a result set whose top-M results are NOT from low-quality sites.
- Execute Alternative — Run the alternative query and retrieve its result set.
- Substitute Or Augment — The alternative results replace or augment the original results in the final SERP shown to the user.
Quality-Gated Query Rewriting
The patent links query rewriting to site-quality flags directly. Rewriting isn't applied uniformly; it's applied when the original retrieval surfaces too many flagged sites in the top results.
Rewrite When The Original Fails Quality
If the top results are dominated by low-quality sites, rewrite. If they're not, leave the original retrieval alone.
- Threshold N — How many low-quality top results trigger rewriting. Tunable.
- Threshold M — How many of the alternative's top results must come from non-flagged sites. Tunable.
What This Means for SEO
What This Means for SEO
Quality-gated query rewriting creates a hidden second-chance retrieval that high-quality content benefits from. Understanding the mechanism informs how to position content for queries currently dominated by low-quality competitors.
- Low-Quality Competitors Trigger Your Inclusion — When a query's original top results are dominated by low-quality sites, the system rewrites and pulls in alternative results from high-quality sites. Being a high-quality site in a niche dominated by low-quality competitors is an asymmetric advantage.
- Site-Quality Flag Is The Gate — The rewriting cares specifically about whether sites are flagged as low quality. Avoiding the low-quality flag (good site-quality score, good Panda factor) keeps you in the eligible pool for alternative-query inclusion.
- Niche Quality Compounds With Niche Coverage — Niches dominated by content farms reward high-quality entrants because the rewriting actively seeks them. Building authoritative content in low-quality-dominated niches has structural advantage beyond direct ranking.