Search Engine Land
  • SEO
    • > All SEO
    • > What Is SEO?
    • > SEO Periodic Table
    • > Google: SEO
    • > Bing SEO
    • > Google Algorithm Updates
  • PPC
    • > All PPC
    • > What is PPC?
    • > Google Ads
    • > Microsoft Ads
    • > The Periodic Tables of PPC
  • Focuses
    • > Local
    • > Commerce
    • > Shopify SEO Guide
    • > Content
    • > Email Marketing Periodic Table
    • > Social Media Marketing
    • > Analytics
    • > Search Engine Land Awards
    • > All Focuses
  • SMX
  • Webinars
  • Intelligence Reports
  • White Papers
  • About
    • > About Search Engine Land
    • > Newsletter
    • > Third Door Media
    • > Advertise

Processing...Please wait.

Search Engine Land » Channel » Local » Googlebot In Aisle Three: How Google Plans To Index The World?

Googlebot In Aisle Three: How Google Plans To Index The World?

Robots reading cereal boxes in the supermarket? Googlebot at the art museum? Street signs and building addresses snatched from Street View images for local search, image search, and product search? Three new patent applications published at the U.S. Trademark and Patent Office this week explore the intricacies of reading text in images taken from Google’s […]

Bill Slawski on January 4, 2008 at 11:14 am

Robots reading cereal boxes in the supermarket? Googlebot at the art museum? Street signs and building addresses snatched from Street View images for local search, image search, and product search?

Three new patent applications published at the U.S. Trademark and Patent Office this week explore the intricacies of reading text in images taken from Google’s Street View project and some interesting steps beyond those. I described a number of the implications behind the patent filings in an SEO by the Sea post from last night: Google on Reading Text in Images from Street Views, Store Shelves, and Museum Interiors.

Let’s take a slightly different look.


One of the most fun blog posts of last year was a spoof titled Google Interiors – the day my house became searchable. The satire seems to have come a little closer to reality, with the publication of these three patent filings.

The patent applications involved are:

  • Recognizing text in images
  • Enhancing text in images
  • Using extracted image text

The most sensational aspects of the documents come towards the end where we are told that robots might be used to take pictures of products on store shelves and in museums. A snippet from the filings:

In addition to street scenes, indexing can be applied to other image sets. In one implementation, a store (e.g., a grocery store or hardware store) is indexed. Images of items within the store are captured, for example, using a small motorized vehicle or robot. The aisles of the store are traversed and images of products are captured in a similar manner as discussed above. Additionally, as discussed above, location information is associated with each image. Text is extracted from the product images. In particular, extracted text can be filtered using a product name database in order to focus character recognition results on product names.

There’s a science fiction element to this world of robots running amuck in supermarkets, but there’s also a lot of science involved in the documents. The descriptions of how text might be taken from street view images describes a number of techniques that account for problems with images, such as those caused by low contrast from shadows and shading. The use of consecutive images from the Street View cameras can also enhance the reading of text that might be blurry or partially hidden from view in one or more shots.

Here’s a screenshot from the patent filings, which shows a number of places where text might be extracted from one image:

google-images-1

Some of the image techniques described in this document were first hinted at in the patent applications behind Google’s Book project, which I wrote about in the summer of 2006 in Patent applications provide window into Google Book Search and Gmail. Those documents discuss the use of optical character recognition to both read the text within books and to understand differences in the structural elements of that text, so that, for instance, chapter headings in books or article titles in magazines might be seen and indexed differently than body text from those documents.

These text recognition and extraction techniques will work with digital still images and with video images. A number of the techniques described work best with video, where there might be multiple images of a view from slightly different angles. If the Street View filming apparatus also included a laser distance measuring device, described in the patent filings, that may also help to eliminate false positives in recognizing text.

It’s been an old sawhorse for years that Google couldn’t recognize text that was displayed in images while indexing pages on the Web. These patent filings hint that Google may be able to do much more with images than we can imagine.

Some of the things that this technology could be used for:

  • Improving local search, and showing images of the actual locations of businesses
  • Providing images of other nearby businesses in a local search
  • Showing alternative businesses near a location that may offer similar products or services during a local search or product search
  • Picturing actual landmarks along a driving route
  • Allowing for a wider range of keyword searches associated with businesses, and images of those businesses
  • Enabling product searches associated with specific businesses at specific locations
  • Allowing museums to be searched by keyword, or to be browsed

It’s difficult to tell if and when we might see googlebot in the grocery stores, but we probably should start wondering how well Google might be able to handle text within images on the Web these days.


Opinions expressed in this article are those of the guest author and not necessarily Search Engine Land. Staff authors are listed here.


New on Search Engine Land

    More FAQ rich results being displayed in Google Search

    Webinar: Benchmark your social media performance for a competitive edge

    Google releases May 2022 broad core update

    Spotify, Meta update political ad offerings for 2022 election cycle

    Take web hosting to the (NVMe) extreme

About The Author

Bill Slawski
Bill Slawski is the Director of Search Marketing for Go Fish Digital and the editor of SEO by the Sea. He has been doing SEO and web promotion since the mid-90s, and was a legal and technical administrator in the highest level trial court in Delaware.

Related Topics

Image SearchLocal

Get the daily newsletter search marketers rely on.

Processing...Please wait.

See terms.

ATTEND OUR EVENTS

Learn actionable search marketing tactics that can help you drive more traffic, leads, and revenue.

March 8-9, 2022: Master Classes (virtual)

June 14-15, 2022: SMX Advanced (virtual)

November 15-16, 2022: SMX Next (virtual)

Learn More About Our SMX Events

Discover time-saving technologies and actionable tactics that can help you overcome crucial marketing challenges.

Start Discovering Now: Spring (virtual)

September 28-29, 2022: Fall (virtual)

Learn More About Our MarTech Events

Webinars

Take a Crawl, Walk, Run Approach to Multi-Channel ABM

Content Comes First: Transform Your Operations With DAM

Dominate Your Competition with Google Auction Insights and Search Intelligence

See More Webinars

Intelligence Reports

Enterprise SEO Platforms: A Marketer’s Guide

Enterprise Identity Resolution Platforms

Email Marketing Platforms: A Marketer’s Guide

Enterprise Sales Enablement Platforms: A Marketer’s Guide

Enterprise Digital Experience Platforms: A Marketer’s Guide

Enterprise Call Analytics Platforms: A Marketer’s Guide

See More Intelligence Reports

White Papers

Reputation Management For Healthcare Organizations

Unlock the App Marketing Potential of QR Codes

Realising the power of virtual events for demand generation

The Progressive Marketer’s Ultimate Events Strategy 2022 Worksheet

CMO Guide: How to Plan Smart and Pivot Fast

See More Whitepapers

Receive daily search news and analysis.

Processing...Please wait.

Topics

  • SEO
  • PPC

Our Events

  • Search Marketing Expo - SMX
  • MarTech

About

  • About Us
  • Contact
  • Privacy
  • Marketing Opportunities
  • Staff

Follow Us

  • Facebook
  • Twitter
  • LinkedIn
  • Newsletters
  • RSS
  • Youtube

© 2022 Third Door Media, Inc. All rights reserved.

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.Ok
news medicine seo game game business health news news news health news news https://latestlayrics.com job seo news game news seo health health news news news seo news news news seo news medicine news