Formulates and runs search queries in the context of a displayed document. Predictive-search and contextual-query DNA — the user is reading a page; the system suggests searches relevant to what they're reading.
Patent Overview
- Inventor
- Yossi Matias, others
- Assignee
- Google LLC
- Filed
- 2012
- Granted
- 2016-05-17
The Challenge
The Challenge
Users reading a page often want to know more about something on it. Manually composing a search query mid-read is friction. Contextual query formulation — using the page content as input — produces relevant queries without the user typing.
- Manual Query Composition Is Friction — Switching from reading to typing breaks flow. Contextual generation removes friction.
- Page Context Reveals Intent — What the user is reading reveals likely follow-up intents. Page content is signal.
- Generated Queries Must Be Relevant — Wrong queries waste UI real estate. Generation must score for relevance.
- User Highlight Sharpens Intent — When users highlight text, that's strong intent signal. Highlighted text drives query generation.
- Results Must Continue Context — Generated query results should continue user's reading context, not derail it.
Innovation
How The System Works
The system reads the displayed document and any user-highlighted text, generates candidate queries from the content, ranks candidates by relevance and likely user intent, surfaces top candidates as predictive search suggestions, and runs results that continue the user's context.
- Read Page Context — Per displayed document, extract content, structure, and entities.
- Capture User Highlight — If user highlights text, capture as strong intent signal.
- Generate Candidate Queries — Per context plus highlight, generate candidate queries via NLP and entity extraction.
- Score Candidates — Per candidate, score relevance to context and likely user intent.
- Surface Top Candidates — Top candidates surface as predictive search suggestions in UI.
- Run Selected Query — On user selection, run query and return results.
- Maintain Context In Results — Results presented in a way that continues the user's reading context, not breaks it.
Context Drives Query Suggestion
The patent's load-bearing idea is that the document a user is reading carries query-suggestion signal. Reading-context-aware generation removes the friction of mid-read query composition.
Context Plus Highlight Equals Intent
Per page context, generic intent signal. Per user highlight, specific intent signal. Together, generated queries match what the user actually wants to look up.
- Page-Context Reading — Per document, content, structure, entities extracted.
- User-Highlight Capture — Highlighted text drives query generation strongly.
- Predictive Suggestion UI — Top candidates surface as predictive search suggestions in reading context.
Technical Foundation
Technical Foundation
The patent specifies the page-context reader, highlight capturer, candidate generator, scorer, suggestion surfacer, query runner, and context-preserving result presenter.
- Page-Context Reader — Per document, extracts content, structure, entities.
- Highlight Capturer — Per user highlight, captures as intent signal.
- Candidate Generator — Per context plus highlight, generates candidate queries.
- Scorer — Per candidate, scores relevance and intent likelihood.
- Suggestion Surfacer — Surfaces top candidates as predictive suggestions.
- Context-Preserving Result Presenter — Presents query results in continuing reading context.
The Process
The Process
Page-context reading and candidate generation run continuously while the user reads. Selection runs query.
- User Opens Page — Page displayed. Context reader extracts content.
- User Reads — Reading captured. Highlight optionally captured.
- Generate Candidates — Per context, candidates generated.
- Score Candidates — Candidates ranked.
- Surface Suggestions — Top candidates surface in UI.
- User Selects Suggestion — On selection, query runs.
- Present Results In Context — Results presented in continuing reading context.
Quality Control
Quality Control
Irrelevant suggestions waste UI. The patent specifies safeguards.
- Candidate-Score Threshold — Per candidate, score threshold for surfacing.
- Highlight-Driven Priority — Per user highlight, highlight-aligned candidates prioritized.
- Context-Preservation Validation — Result presentation validated for context continuity.
- Privacy Preservation — Page content and highlights handled with privacy preservation.
- Continuous Recalibration — Candidate generator and scorer recalibrate against fresh data.
Real-World Application
Contextual query formulation underpins Google's predictive-search and look-up-while-reading surfaces. The pattern of context plus highlight as intent signal appears across modern Assistant and Search Lens integrations.
- Context-driven Generation Source — Page content drives candidate query generation.
- Highlight-aware Intent Sharpening — User-highlighted text sharpens intent signal.
- Continuing-context Result Presentation — Query results presented in continuing reading context.
Why Clear, Entity-Rich Pages Win Context Suggestions
Pages with clear entity references and well-structured content produce strong context signal. Pages that talk around topics without naming entities generate weaker candidate queries.
Why Lookup-Friendly Content Compounds Discovery
Content that anticipates likely look-up questions and answers them inline reduces user need to look up. Pages doing this earn engagement and reduce result-search friction.
<\/section>What This Means for SEO
What This Means for SEO
This predictive-search patent generates follow-up queries from the page a user is reading, sharpened by highlighted text. SEO implication: clear, entity-rich content that anticipates and answers likely follow-up questions earns engagement and feeds these contextual suggestions.
- Entity-Rich Pages Generate Strong Suggestions — Candidate queries are generated from page content and entities. Pages with clear entity references produce strong context signal, while pages that talk around topics without naming entities generate weaker candidates.
- Anticipate And Answer Follow-Up Questions — The system predicts what users will look up next from what they are reading. Content that anticipates likely follow-up questions and answers them inline earns engagement and reduces the need for users to search away.
- Highlighted Text Is A Strong Intent Signal — When a user highlights text, that drives query generation strongly. Writing clear, self-contained, quotable passages around your key entities makes your content the kind users highlight and act on.
- Well-Structured Content Reads As Context — The reader extracts content, structure, and entities. Clean structure, descriptive headings, and explicit topic framing give the generator strong material to work with.
- Relevance Scoring Filters Weak Candidates — Generated queries are scored for relevance and surface only above threshold. Vague content that produces only generic follow-ups will not feed useful suggestions tied back to your page.
- Lookup-Friendly Content Compounds — Content that resolves the user's likely questions in place reduces lookup friction and earns engagement. Comprehensive coverage of a topic positions you as the page that answers rather than the page that prompts a new search.
- The Pattern Spans Assistant And Lens — Context-plus-highlight intent inference appears across Assistant and Search Lens integrations. Building clear, entity-rich, lookup-friendly content positions you across these contextual surfaces, not just classic search.