Media

Big Data Meets Semantics in Ooyala and Jinni Partnership

jinni

Rapid TV News reports that Ooyala and Jinni have joined up to deliver advanced video discovery technology. The article states, “The two personalisation leaders are targeting media companies, broadcasters and pay-TV operators with their new proposition that will integrate Ooyala’s machine-learning big-data analytics systems with Jinni‘s semantic discovery to deliver what the two companies call a powerful new level of video personalisation for all screens. The two companies will work together to develop and deploy what they claim will be a new level of machine learning powered by semantic discovery that will allow TV providers to tailor programming and video viewing experiences to each individual user. This will include personalised channels, custom programming guides, mood-based browsing and search, and viewer recommendations for both live and video-on-demand (VOD) content.” Read more

Music Discovery Service seevl.fm Launches

screen shot of seevl.fm search: Lou ReedThis week marked the public launch of seevl.fm.

SemanticWeb.com has tracked seevl’s development through various incarnations, including a YouTube plugin and as a service for users of Deezer (available as a Deezer app). This week’s development, however, sees the service emerge as a stand-alone, cross-browser, cross-platform, mobile-ready service; a service that is free and allows for unlimited search and discovery. So, what can one do with seevl?

Following the death of Lou Reed this week, I (not surprisingly) saw mentions of the artist skyrocket across my social networks. People were sharing memories and seeking information — album and song titles, lyrics, biographies, who influenced Reed, who Reed influenced, and a lot of people simply wanted to listen to Reed’s music.  A quick look at the seevl.fm listing for Lou Reed shows a wealth of information including a music player pre-populated with some of the artist’s greatest hits.

Read more

Video: Shared Library Data at the ALA Annual 2013

Logo of the OCLCRegular readers of this blog may know that Linked Data and Semantic Web technologies are gaining significant traction in the worlds of Libraries, Archives, and Museums. Earlier this summer, Richard Wallis (Technology Evangelist) and Ted Fons (Executive Director, Data Services and WorldCat Quality) of the OCLC discussed and demonstrated how that organization in particular is sharing library data. This presentation was delivered at the Annual Conference of the American Libraries Association in Chicago.

The presentations by Fons and Wallis serve as good introductory pieces to practical Linked Data use, and the potential benefits of using Linked Data as a platform for knowledge management for large collections of data.  Wallis also discusses why OCLC chose to use schema.org as a vocabulary.

Part I:

Part II:

Semantic Web Challenge Winners Announced

TrophiesEarlier this week, we reported on the Semantic Web Challenge taking place at the 2012 ISWC Conference in Boston. The winners of the challenge have now been announced!

Taking First Place in the “Open Track” was EventMedia Live, presented by Houda Khrouf, Vuk Milicic and Raphaël Troncy. EventMedia has been a hub in the Linked Data cloud since September 2010, but EventMedia Live is a recent application leveraging the resource. The academic paper is available, as well as the web application itself.  For a quick introduction, check out this fun video the team produced (after the jump).

Read more

GoodRelations Fully Integrated with Schema.org

Schema.org and GoodRelations logosSchema.org has announced that GoodRelations is now fully integrated into the markup vocabulary backed by Google, Yahoo!, Bing/Microsoft, and Yandex (read our past schema.org coverage). GoodRelations is the e-commerce vocabulary that has been developed and maintained by Martin Hepp since 2002 (previous coverage).

In the official announcement, R.V. Guha (Google) says, “Effective immediately, the GoodRelations vocabulary (http://purl.org/goodrelations/) is directly available from within the schema.org site for use with both HTML5 Microdata and RDFa. Webmasters of e-commerce sites can use all GoodRelations types and properties directly from the schema.org namespace to expose more granular information for search engines and other clients, including delivery charges, quantity discounts, and product features.”

Read more

The Semantic Link with Guest, Nick Holzherr of ‘The Apprentice’ and Whisk – July, 2012

Paul Miller, Bernadette Hyland, Ivan Herman, Eric Hoffer, Andraz Tori, Peter Brown, Christine Connors, Eric Franzon

On Friday, July 13, a group of Semantic thought leaders from around the globe met with their host and colleague, Paul Miller, for the latest installment of the Semantic Link, a monthly podcast covering the world of Semantic Technologies.

Whisk LogoThis episode includes a discussion with Nick Holzherr, finalist on the BBC One Television Series, “The Apprentice,” and founder of Whisk, a recipe planning idea based on big data and semantic analysis. He was named ‘emerging entrepreneur of the year’ by Insider Magazine in 2010, and Birmingham Young Personality of the Year (Entrepreneurship) in 2011.
Read more

OCLC Announcement: WorldCat.org Meets Schema.org (and hints of more to come)

image of library from Shutterstock.comOCLC has announced that WorldCat.org pages now include schema.org descriptive mark-up.

Created over the last four decades with the participation of thousands of member libraries, WorldCat is the world’s largest online registry of library collections. As the official press release states, “WorldCat.org now offers the largest set of linked bibliographic data on the Web. With the addition of Schema.org mark-up to all book, journal and other bibliographic resources in WorldCat.org, the entire publicly available version of WorldCat is now available for use by intelligent Web crawlers, like Google and Bing, that can make use of this metadata in search indexes and other applications.”

On the heels of the announcement earlier this week about Dewey Decimal Classifications also being available as Linked Data, this certainly marks an exciting week in the world of library information and the Semantic Web. However, this should also prove to be exciting for non-librarians, as these resources are now available beyond the world of library sciences.

Read more

Twitter, the new kid on the Semantic Web block

Remember how search engines can show nice snippets in their search results thanks to the structured data that webmasters embedded in the HTML of their webpages (RDFa, schema.org, etc)? Additionally, Facebook gains insight about user’s interest through structured data on webpages (i.e. Open Graph Protocol). Now there is a new kid on the block: Twitter.

Twitter Cards

Twitter recently introduced Twitter Cards, a way to “attach media experiences to Tweets that link to your content.” By adding structured data embedded in the HTML of your webpage, “users who Tweet links to your content will have a ‘card’ added to the Tweet that’s visible to all of their followers.” Basically, Twitter will now have a bit more of information about your webpage in order to know how to make a nice snippet in a tweet.

Read more

Schema.org, Wikidata, Google Knowledge Graph – Two Great Causes and a Symptom

I was toying with another title for this post – Yet Another Perfect Storm, but I think that particular metaphor (although appropriate here) has been somewhat over done.  So what sparked this one then?

I am on the long flight back from the Semantic Tech & Business Conference in San Francisco to the good ol’ UK, to see how they got on with the Queen’s Diamond Jubilee festivities.  I am reflecting on what my week at the conference has told me.  It has told me that things are a changing – I got that impression last year too, but more so this year.  Obviously, from the title of this post, it has something to do with Schema.org, Wikidata, and the Google Knowledge Graph….

Read more

Dynamic Semantic Publishing for Beginners, Part 2

Even as semantic web concepts and tools are underpinning revolutionary changes in the way we discover and consume information, people with even a casual interest in the semantic web have difficulty understanding how and why this is happening.  One of the most exciting application areas for semantic technologies is online publishing, although for thousands of small-to-medium sized publishers, unfamiliar semantic concepts are too intimidating to grasp the relevance of these technologies. This three-part series is part of my own journey to better understand how semantic technologies are changing the landscape for publishers of news and information.  Read Part 1.

—-

News and Media Organizations were well represented at the Semantic Technology and Business Conference in San Francisco this year.  Among the organizations presenting were the New York Times, the Associated Press (AP), the British Broadcasting Co. (BBC), Hearst Media Co., Agence France Press (AFP), and Getty Images.

It was interesting to note that, outside of the New York Times, which has been publishing a very detailed index since 1912, many news organizations presenting at the conference did not make the extensive classification of content a priority until the last decade or so.  It makes sense that, in a newspaper publishing environment, creating a detailed and involved index that guides every reader directly to a specific subject mentioned in the paper must not have seemed as critical as it does now– it’s not as though the reader was likely to keep the newspaper for future reference material– so the work of indexing news content by subject as a reference was left for the most part for librarians to do well after an article was published.

In the early days of the internet, categorization of content (where it existed) was limited to simple taxonomies or to free tagging.  News organizations made rudimentary attempts to identify subjects covered by content, but  did not provide much information  about relationships between these subjects.   Search functions matched the words in the search to the words in the content of the article or feature.   Most websites still organize their content this way.

The drawbacks of this approach to online publishing is that it doesn’t make the most of the content “assets” publishers possess.    Digital content has the potential to be either permanent or ephemeral– it can exist and be accessed by a viewer for as long as the publisher chooses to keep it, and many news organizations are beginning to realize the value of giving their material a longer shelf life by presenting it in different contexts.   If you have just read an article about, say, Hillary Clinton, you would might be interested in a related story about the State Department, or perhaps her daughter Chelsea, or her husband Bill….   But how would any content management system be able to serve up a related story if no one had bothered to indicate somewhere what the story is about and how these people and/or concepts are related to one another?

Read more

NEXT PAGE >>