Posts Tagged ‘schema.org’

You Can Take An Active Role In Schema.Org

brickHave you wanted to get involved in the schema.org project? Your contribution to the collaborative effort driven by Bing, Google, Yahoo and Yandex for a shared markup vocabulary for web pages is more than welcome. As Dan Brickley, who is developer advocate at Google, noted during his presentation about schema.org’s progress to date at this summer’s Semantic Technology & Business Conference, the “pattern of collaboration with the project [is] we’re trying to push work off on people who are better qualified to do it, and then we mush it all together.”

What is meant by that is that the project is so broad, covering such a huge amount of topics, that the input of experts – whether from the library, media, sports or any other of the multitude of communities whose vocabularies are or aim to be represented – is incredibly valuable, and very much encouraged. In an overview of the 2013-2014 releases, which included TV/radio, civic services, and bibliographic additions, as well as accessibility properties, among others, Brickley related that during the year, “We listened a lot. We listened to people who knew better than us about accessibility, about how broadcast TV and radio are described, about describing social services, about libraries, journals, and ecommerce, and then integrated their suggestions into a unified set of schemas.”

Read more

Schema.Org: The Fire’s Been Lit

schemaorgadoptspecificWhy has schema.org made the following strides since its debut in 2011?

  • In a sample of over 12 billion web pages, 21 percent, or 2.5 billion pages, use it to mark up HTML pages, to the tune of more than 15 billion entities and more than 65 billion triples;
  • In that same sample, this works out to six entities and 26 facts per page with schema.org;
  • Just about every major site in every major category, from news to e-commerce (with the exception of Amazon.com), uses it;
  • Its ontology counts some 800 properties and 600 classes.

A lot of it has to do with the focus its proponents have had since the beginning on making it very easy for webmasters and developers to adopt and leverage the collection of shared vocabularies for page markup. At this August’s 10th annual Semantic Technology & Business conference in San Jose, Google Fellow Ramanathan V. Guha, one of the founders of schema.org, shared the progress of the initiative to develop one vocabulary that would be understood by all search engines and how it got to where it is today.

Read more

Deconstructing Google’s Knowledge Graph

Image from Google I/O showing the addition of information into Google's Knowledge Graph using JSON-LD.Barbara Starr of Search Engine Land recently observed that, “Search is changing – and it’s changing faster than ever. Increasingly, we are seeing organic elements in search results being displaced by displays coming from the Knowledge Graph. Yet the shift from search over documents (e.g. web pages) to search over data (e.g. Knowledge Graph) is still in its infancy. Remember Google’s mission statement: Google’s mission is to organize the world’s information to make it universally accessible and useful. The Knowledge Graph was built to help with that mission. It contains information about entities and their relationships to one another – meaning that Google is increasingly able to recognize a search query as a distinct entity rather than just a string of keywords. As we shift further away from keyword-based search and more towards entity-based search, internal data quality is becoming more imperative.”

Read more

The Four Top Search Providers Unite to Talk Semantic Web

schema dot org logoMark Albertson of the Examiner recently wrote, “It was an unusual sight to be sure. Standing on a convention center stage together were computer engineers from the four largest search providers in the world (Google, Yahoo, Microsoft Bing, and Yandex). Normally, this group couldn’t even agree on where to go for dinner, but this week in San Jose, California they were united by a common cause: the Semantic Web… At the Semantic Technology and Business Conference is San Jose this week, researchers from around the world gathered to discuss how far they have come and the mountain of work still ahead of them.” Read more

New Opps For Libraries And Vendors Open Up In BIBFRAME Transition

semtechbiz-10th-125sqOpportunities are opening up in the library sector, both for the institutions themselves and providers whose solutions and services can expand in that direction.

These vistas will be explored in a session hosted by Kevin Ford, digital project coordinator at the Library of Congress at next week’s Semantic Technology & Business conference in San Jose. The door is being opened by the Bibliographic Framework Initiative (BIBFRAME) that the LOC launched a few years ago. Libraries will be moving from the MARC standards, their lingua franca for representing and communicating bibliographic and related information in machine-readable form, to BIBFRAME, which models bibliographic data in RDF using semantic technologies.

Read more

Drupal Deepens Semantic Web Ties

semtechbiz-10th-125sqAmong the mainstream content management systems, you could make the case that Drupal was the first open source semantic CMS out there. At next week’s Semantic Technology and Business Conference, software engineer Stéphane Corlosquet of Acquia, which provides enterprise-level services around Drupal, and Bock & Co. principal Geoffrey Bock will discuss in this session Drupal’s role as a semantic CMS and how it can help organizations and institutions that are yearning to enrich their data with more semantics – for search engine optimization, yes, but also for more advanced use cases.

“It’s very easy to embed semantics in Drupal,” says Bock, who analyses and consults on digital strategies for content and collaboration. At its core it has the capability to manage semantic entities, and in the upcoming version 8 it takes things to a new level by including schema.org as a foundational data type. “It will become increasingly easier for developers to build and deliver semantically enriched environments,” he says, which can drive a better experience both for clients and stakeholders.

Corlosquet, who has taken a leadership role in building semantic web capabilities into Drupal’s core and maintains the RDF module in Drupal 7 and 8, explains that the closer embrace of schema.org in Drupal is of course a help when it comes to SEO and user engagement, for starters. Google uses content marked up using schema.org to power products like Rich Snippets and Google Now, too.

Read more

Add schema.org Actions to Your Own Knowledge Graph (Video — Part 3)

[Editor's note: this is Part 3 of a series. See Part 1 and Part 2]

schema dot org logoIn Part 3 of this series, Jarek Wilkiewicz details activating the small Knowledge Graph (built on Cayley) with Schema.org Actions. He begins by explaining how Actions can be thought of as a combination of “Entities” (things) and “Affordances” (uses). As he defines it, “An affordance is a quality of an object, or an environment, which allows an individual to perform an action.”

For example, an action, might be using the “ok Google” voice command on a mobile device. The even more specific example that Wilkiewicz gives in the video (spoiler alert) is that of using the schema.org concept of potentialAction to trigger the playing of a specific artist’s music in a small music store’s mobile app.

To learn more, and to meet Jarek Wilkiewicz and his Google colleague, Shawn Simister, in person, register for the Semantic Technology & Business Conference where they will present “When 2 Billion Freebase Facts is Not Enough.”

Building The Scientific Knowledge Graph

saimgeStandard Analytics, which was a participant at the recent TechStars event in New York City, has a big goal on its mind: To organize the world’s scientific information by building a complete scientific knowledge graph.

The company’s co-founders, Tiffany Bogich and Sebastien Ballesteros,came to the conclusion that someone had to take on the job as a result of their own experience as researchers. A problem they faced, says Bogich, was being able to access all the information behind published results, as well as search and discover across papers. “Our thesis is that if you can expose the moving parts – the data, code, media – and make science more discoverable, you can really advance and accelerate research,” she says.

Read more

Schema.org Introduces Concept of Role

Image of soccer team depicting question: What is this player's role?In a post yesterday at the official schema.org blog, Vicki Tardif Holland (Google) and Jason Johnson (Microsoft) have announced that schema.org has created a way to more richly describe relationships between entities in structured markup. The addition of the “Role” schema allows for the description of more complex relationships than were previously possible. In the post, the authors cite the business need as one that is often found in the domains of entertainment and sports.

For example, in schema.org, it can be asserted that Bill Murray was an actor in the film Ghostbusters [Fig. 1].

visualization of the basic relationship between Bill Murray as an actor Ghostbusters.

Figure 1

That’s all well and good, but how can one extend this relationship to include more detail such as the name of the character Mr. Murray played in the film? More on that in a moment.

Read more

Structured Data In The Spotlight At The New York Times

Photo credit : Eric Franzon

Photo credit : Eric Franzon

In the winter of 2012, The New York Times began its implementation of the schema.org compatible version of rNews, a standard for embedding machine-readable publishing metadata into HTML documents, to improve the quality and appearance of its search results, as well as generate more traffic through algorithmically generated links. The semantic markup for news articles brought to its web pages structured data properties to define author, the date a work was created, its editor, headline, and so on.

But according to a leaked New York Times internal innovation report that appears here, there’s more work to be done in the structured data realm as part of a grand plan to truly put digital first in the face of falling website and smartphone app readership and hotter competition from both old guard and new age newsrooms and social media properties that are transforming how journalism is delivered for an audience increasingly invested in mobile, social, and personalized technologies.

The report was put together with insights from parties including Evan Sandhaus, director for search, archives and semantics at The NY Times, who was instrumental in the rNews/schema.org effort as well as the TimesMachine relaunch, a digital archive of 46,592 issues of The New York Times whose use includes surrounding current news stories with context. While the report notes that the Gray Lady has not been standing still in the face of its challenges, citing newsroom advances to grow audience with efforts such as using data to inform decisions, it needs to do more – faster – to make it easy to get its content in front of digital readers.

Read more

NEXT PAGE >>