The Google Effect and the Semantic Web

Jennifer Zaino
SemanticWeb.com Contributor

NEW YORK — It’s time to put the semantic web in a business context, especially for the struggling publishing industry. That’s exactly what Tom Tague, Calais Initiative Lead at Thomson Reuters, did for the audience at the opening keynote address at the Web 3.0 Conference here today.



“The hard message [to the publishing industry] is there will be winners and losers,” Tague said. Spending is not moving from print to online media anymore: “There’s only one pie. It’s a zero sum game, and the pie is getting smaller.”

Calais’ mission, Tague says, is to enable the winners in the sector. If you want to be among them, you had better move fast to:


  • Automate/ semi-automate workflow

  • Improve audience development

  • Enhance content
  • The first factor may not read sexy, but it reads important, because it goes right to the heart of cost reduction, which is critical to every business today, but particularly to the battered publishing industry.

    “The real world is workflow systems, custom content management systems, the stuff that lives in the boiler room of large enterprises,” Tague says. Semantic technologies must be integrated into these systems, for organizations to realize improved editorial efficiencies, generating semantic topics and metadata, tagging and moving articles through more quickly, and realizing ten to hundreds of thousands of dollars in savings. “Cost reduction is the way to get companies started. It’s the way to build support within your organization.”

    The bar also is being raised on audience development — improving search to get the audience and giving them a good experience to keep them. Thanks to Google’s Rich Snippets, the game is changing.

    “Google is now harvesting semantic metadata,” Tague says. “And I guarantee you they will use it to decide on rankings. So while this was a nice to have a week ago, now it’s a must have. You must embed semantic metadata within your web pages.”


    When it comes to enhancing content, it’s time for publishers to find their way into the web of linked data. “Once you have tagged content it lets you harvest free content from the web to enhance your publications,” he says, citing the free, open content ranging from geographic to music databases already out there in the linked data world. “If you are not using it you are not taking advantage of an incredible opportunity to enhance content for free.”



    Tague recommends that organizations start small: “Find a way to semantically enable your content, get it tagged. Start a small project that uses that to save money and increase efficiency.”

    From there publishers can explore opportunities to use the semantic web in ways that get their juices flowing, such as enabling the investigative journalism process. Tague points to the ability to reduce the high costs of these important efforts by using semantic web technologies to automate the process of culling through mountains of information to find connections, as one news organization has done to explore nepotism and corruption around government contracts.

    “But,” he advises, “don’t lead with this [as a reason for embracing semantic technologies]. It’s soft and fuzzy and requires lots of resources. But we may be entering an era of greater transparency in government and the opportunities for investigative journalism are enormous.”

    Semantic Tech & Business Conference Returns to San Francisco

    Semantic Tech & Business Conference returns to San Francisco in June! Join us from June 3-7 for complete coverage of Big Data, Linked Data, Extreme Information Management, and Semantic Web. From breakthrough approaches to solving business problems to the big data implications of fast–evolving technologies, SemTechBiz provides you with an unparalleled interactive experience and delivers tangible business value. We're offering a special early rate when you register by February 17. Sign up now!