Kerstin Recker recently wrote for Wired, “Search has changed dramatically over the past year and semantic technology has been at the center of it all. Consumers increasingly expect search engines to understand natural language and perceive the intent behind the words they type in, and search engine algorithms are rising to this challenge. This evolution in search has dramatic implications for marketers, consumers, technology developers and content creators — and it’s still the early days for this rapidly changing environment. Here is an overview of how search technology is changing, how these changes may affect you and what you can do to market your business more effectively in the new era of search.” Read more
Posts Tagged ‘search engines’
Interested in how schema.org has trended in the last couple of years since its birth? If you were at The International Semantic Web Conference event in Sydney a couple of weeks back, you may have caught Google Fellow Ramanathan V. Guha — the mind behind schema.org — present a keynote address about the initiative.
Of course, Australia’s a far way to go for a lot of people, so The Semantic Web Blog is happy to catch everyone up on Guha’s thoughts on the topic.
We caught up with him when he was back stateside:
The Semantic Web Blog: Tell us a little bit about the main focus of your keynote.
Guha: The basic discussion was a progress report on schema.org – its history and why it came about a couple of years ago. Other than a couple of panels at SemTech we’ve maintained a rather low profile and figured it might be a good time to talk more about it, and to a crowd that is different from the SemTech crowd.
The short version is that the goal, of course, is to make it easier for mainstream webmasters to add structured data markup to web pages, so that they wouldn’t have to track down many different vocabularies, or think about what Yahoo or Microsoft or Google understands. Before webmasters had to champion internally which vocabularies to use and how to mark up a site, but we have reduced that and also now it’s not an issue of which search engine to cater to.
It’s now a little over two years since launch and we are seeing adoption way beyond what we expected. The aggregate search engines see about 15 percent of the pages we crawl have schema.org markup. This is the first time we see markup approximately on the order of the scale of the web….Now over 5 million sites are using it. That’s helped by the mainstream platforms like Drupal and WordPress adopting it so that it becomes part of the regular workflow. Read more
Centurion, South Africa (PRWEB) April 26, 2013 — Gatfol serves base technology to provide digital devices with the ability to process human natural language efficiently.
The goal of truly semantic search has not yet fully been realized. The main problem is the enormity of ambiguous word permutations of semantic equivalence in even the simplest of phrases, which up to now has processing-wise required huge structured lexicons and ontologies as guides. Read more
Schema.org, Learning Resource Metadata Initiative Join Hands In Boost To Educational Content Searches
Earlier this month word came of a revision to schema.org: Version 1.0a additions, according to this posting from Dan Brickley, include the Datasets vocabulary, and some supporting utility terms for describing schema.org types, properties and their inter-relationships. One of the gems in the update are additions related to the Learning Resource Metadata Initiative (LRMI), an effort led by the Association of Educational Publishers and Creative Commons, which has as its goals making it easier to publish, discover and delivery quality educational resources on the web. The Bill and Melinda Gates Foundation and the William and Flora Hewlett Foundation helped fund the work.
With schema.org serving as a catalyst for its work, the LRMI developed a common metadata framework for tagging online learning resources, with the idea of having that metadata schema incorporated into Schema.org. With that now the case, it’s possible for publishers or curators of educational content to use LRMI markup and have that metadata recognized by the major search engines.
“One of the reasons why education was one of the first extensions of schema.org is that the education industry is going through some very interesting times,” says Madi Weland Solomon, head of Data Architecture Standards at education company Pearson plc, one of the LRMI project launch partners.
Claire Cain Miller of The New York Times recently wrote, “Say you need a latté. You might pull out your phone, open the Yelp app and search for a nearby cafe. If instead you want to buy an espresso machine, you will most likely tap Amazon.com. Either way, Google lost a customer. Google remains the undisputed king of search, with about two-thirds of the market. But the nature of search is changing, especially as more people search for what they want to buy, eat or learn on their mobile devices. This has put the $22 billion search industry, perhaps the most lucrative and influential of online businesses, at its most significant crossroad since its invention.” Read more
Structured data makes the Web go around. Search engines love it when webmasters mark up page content. Google’s rich snippets, for instance, leverages sites’ use of microdata (preferred format), or RDFa or microformats: It makes it possible to highlight in a few lines specific types of content in search results, to give users some insight about what’s on the page and its relationship to their queries – prep time for a recipe, for instance.
Plenty of web sites generated from structured data haven’t added HTML markup to their pages, though, so they aren’t getting the benefits that come with search engines understanding the information on those web pages.
Maybe that will change, now that Google has introduced Data Highlighter, an easy way to tell its search engine about the structured data behind their web pages. A video posted by Google product management director Jack Menzel gives the snapshot: “Data Highlighter is a point- and-click tool that allows any webmaster to show Google the patterns of structured data on their pages without modifying the pages themselves,” he says.
Search engine Yandex, which like Google, Bing and Yahoo takes advantage of sites using schema.org markup to improve the display of search results, today released a search app for the iPad. The other major search providers have already accounted for the iPad in their search portfolios.
According to the release announcing the news, the Yandex Search App offers a tablet-optimized, intuitive interface marked by the ability for users to open pages as tabs in a browser – as many as they wish – so they can switch between tabs and search results within one screen.
A packed room at the Semantic Tech & Business Conference in San Francisco played host to the much-anticipated Schema.org panel on Wednesday morning. As W3C semantic activity lead and moderator Ivan Herman had hoped (see this article), the discussion didn’t get bogged down in a duel between RDFa and microdata, but rather emphasized some important accomplishments of the last year and looked forward to future work.
As Herman put it, the only discussion he wanted to have around RDFa was to announce that the proposed RDFa 1.1 recommendations are expected to be published as official W3C standards Thursday, and that there had been a lot of interaction with the schema.org folks to make this useable for them as well.
Wednesday’s panel was composed of: Dan Brickley, of Schema.org at Google; R.V. Guha of Google; Steve Macbeth of Microsoft; Peter Mika ofYahoo!; Jeffrey W. Preston of Disney Interactive Media Group; Evan Sandhaus of The New York Times Company; and Alexander Shubin of Yandex.
Here are highlights of what took place:
The Semantic Technology & Business Conference has been underway since Sunday, with tutorials and lightning sessions catching audience interest. The conference presentations get underway today, most of them following on the heels of the opening keynotes given by Bart van Leeuwen, firefighter and architect at netage.nl; Jay Myers, web architect at Best Buy; and Steve Harris, CTO of Garlik, a part of Experian.
Best Buy, as readers of this blog know, has been diving deep into the semantic web waters under Myers’ direction for a few years now, and he shared that journey with the audience at SemTech.
Perhaps one of the most anticipated panels at next week’s Semantic Technology & Business Conference in San Francisco is the Wednesday morning session on Schema.org. Since the announcement of Schema.org just prior to last year’s SemTech Business Conference on the west coast, using the Schema.org shared vocabularies along with the microdata format to mark up web pages has been much debated, and created questions in the minds of webmasters and web search marketers along the lines of, “Which way should we go? Microdata or RDFa?”
NEXT PAGE >>