Posts Tagged ‘Yahoo!’

Wikimeta Project’s Evolution Includes Commercial Ambitions and Focus On Text-Mining, Semantic Annotation Robustness

Wikimeta, the semantic tagging and annotation architecture for incorporating semantic knowledge within documents, websites, content management systems, blogs and applications, this month is incorporating itself as a company called Wikimeta Technologies.  Wikimeta, which has a heritage linked with the NLGbAse project, last year was provided as its own web service.

Dr. Eric Charton, Ph.D, MSc at École Polytechnique de Montréal, is project leader and author of the Wikimeta code. The NLGbAse project was conducted by Charton at the University of Avignon as part of his Ph.D. Thesis.  The Semantic Web Blog recently hosted an email discussion with him to learn more about the Wikimeta architecture and its evolution.

 

The Semantic Web Blog: Tell us about the NLGBase project and Wikimeta’s relationship to it.

Charton: NLGbAse is an ontology extracted from Wikipedia. It is used in Wikimeta as a resource for semantic disambiguation. For each Wikipedia document (aka Semantic Concept), NLGbAse provides various ways of word-writing (for example, “General Motors” can be written “GM Company”, “GM”, “General Motors Corp” and so on), used for detection.

Read more

Semantic Tech & Business Conference Returns to San Francisco

Semantic Tech & Business Conference returns to San Francisco in June! Join us from June 3-7 for complete coverage of Big Data, Linked Data, Extreme Information Management, and Semantic Web. From breakthrough approaches to solving business problems to the big data implications of fast–evolving technologies, SemTechBiz provides you with an unparalleled interactive experience and delivers tangible business value. We're offering a special early rate when you register by February 17. Sign up now!

Parse.ly Brings A Dash of Semantics To Online Publishers

Online publishers and other content providers have a new analytics tool to help them understand what their readers care about and use that information to better connect them to their sites’ relevant and compelling content. Launching today is Dash, based on the predictive content analytics platform Parse.ly. The technology crawls every article page for Parse.ly’s publisher-partners, and analyzes, in real time and at scale, the text to identify relevant topics to group related content together. Behind this lies natural language processing technology, which uses language queues hidden inside the text to determine its affiliated topics. To date Dash has extracted over 350,000 unique topics through all the URLs is has crawled during private beta for a healthy taxonomy of topics across the web being consumed by users.

Read more

Yahoo! Makes Content Analysis Tools Openly Available

Yahoo! has opened up their content analysis technology to web developers. According to the company’s blog, “The newly launched Yahoo! Content Analysis service replaces Yahoo!’s popular Term Extraction service and now provides advanced content analysis on either text or a URL, leverages Yahoo!’s state of the art machine-learned ranking (MLR) technology to extract key terms from the content, and, more importantly, to rank them based on their overall importance to the content. The output you receive contains the keywords and their ranks along with other actionable metadata.” Read more

Facebook’s Timeline Launches

Facebook today posted that users now officially can upgrade their profiles to its Facebook Timeline by heading here. Timeline, as Mark Zuckerberg described in September, is Facebook’s way of helping uses curate the stories of their lives, calling out the most meaningful events and recent highlights. During the F8 Developers’ Conference, he said that Facebook had “rethought from the ground up the heart of the Facebook experience.”

The Open Graph protocol, based on RDFa, provides power to the experience, enabling applications to focus on filling out user Timelines with lightweight activities, and on discovering new things through friends in what Zuckerberg at the time called a frictionless experience. As an example, he noted the debut of the Open Graph Spotify music app that adds to a user’s Timeline the songs she listens to, radio stations, and albums.

Other Open Graph app launch partners announced at the event in the fall were The Daily, Dailymotion, Earbits, The Guardian, Hulu, iHeartRadio, The Independent, Izlesene, Jelli, My Video, Netflix, Rdio, Slacker, Songza, The Washington Post, and Yahoo. Others, such as The Huffington Post, joined later. In late November, Facebook said that the publishers building social news apps to help users see what their friends or reading or to view past top articles are seeing good early results.

Read more

BREAKING: Schema.org announces intent to support RDFa Lite!

Last month, we reported on the new RDFa 1.1 Lite proposal by Ben Adida. In our recent podcast on Schema.org with guest Ramanathan V. Guha, we touched on the topic of RDFa Lite as well.

Today, schema.org spokesperson Dan Brickley posted that “we’re pleased to give advance notice of a new way of adopting schema.org’s structured data vocabulary. W3C’s RDF Web Applications group are right now putting the finishing touches to the latest version of the RDFa standard. This work opens up new possibilities also for developers who intend to work with schema.org data using RDF-based tools and Linked Data, and defines a simplified publisher-friendly ‘Lite’ view of RDFa.”

Read more

Russian Search Engine Yandex to Collaborate on schema.org

Yandex, Russia’s leading search engine has announced that it is joining forces with Google, Bing, and Yahoo! to collaborate on schema.org. One article reports, “Now the pages tagged with Schema.org tags will be picked up not only by Yandex search engine, but also its other services, such as Yandex’s Business Directory, Yandex.Dictionaries, Yandex.Images and Yandex.Video. Yandex shows the data from these services also in its search results.” Read more

The Semantic Link – Episode 11, October 2011

Paul Miller, Bernadette Hyland, Ivan Herman, Eric Hoffer, Andraz Tori, Peter Brown, Christine Connors, Eric Franzon

On Friday, October 14, a group of Semantic thought leaders from around the globe met with their host and colleague, Paul Miller, for the latest installment of the Semantic Link, a monthly podcast covering the world of Semantic Technologies. This episode includes a discussion about schema.org. The Semantic Link panel was joined by special guest, Ramanathan V. Guha, Google Fellow, and one of the principal people behind schema.org.

schema.org

Read more

Schema.org Workshop – A Path Forward

photo of schema-org leadership panel at workshop

schema.org Leadership Panel; L-to-R: Michael O'Connor (Microsoft), John Giannandrea (Google), Charlie Jiang (Microsoft), Kavi Goel (Google), R.V. Guha (Google), Steve MacBeth (Microsoft), Gaurav Mishra (Yahoo), Peter Mika (Yahoo)

A room full of interested parties gathered in Microsoft’s Silicon Valley Campus yesterday to discuss Schema.org, its implications on existing vocabularies, syntaxes, and projects, and how best to move forward with what has admittedly been a bumpy road.

Schema.org, you may recall, is the vocabulary for structured data markup that was released by Google, Microsoft, and Bing on June 2 of this year.  The schema.org website states, “A shared markup vocabulary makes easier for webmasters to decide on a markup schema and get the maximum benefit for their efforts. So, in the spirit of sitemaps.org, Bing, Google and Yahoo! have come together to provide a shared collection of schemas that webmasters can use.”  (For more history about the roll-out and initial reactions to it, here’s a summary.)

Yesterday was the first time since the Semantic Technology & Business Conference in San Francisco that community members have gathered face-to-face to discuss Schema.org in an open forum. It was a full agenda with plenty of opportunity for debate and discussion.

Read more

How to Add Semantic Markup to your Site or Application? Start Here!

Structured Data on the Web - structured-data.org

Recently, a new resource appeared on the Web to help developers navigate the waters around various approaches to adding semantic markup to websites and applications. We caught up with the creators of the newly launched  structured-data.org, to learn more about this project.  They are:

Stéphane Corlosquet,
OpenSpring.net

Gregg Kellogg

Gregg Kellogg,
Principal,
Kellogg Associates

Manu Sporny

Manu Sporny,
Founder/CEO,
Digital Bazaar, Inc.

Q: What is “Structured Data on the Web” (the site and the concept)?

GK: We wanted to provide a place for people to learn about the different ways in which publishers can add semantic information to their web sites and applications. There is confusion in the marketplace, partly due to the introduction of schema.org, which has raised the awareness of structured data with web developers. structured-data.org is a one-stop-shop to learn about the different mechanisms available to developers to take advantage of this.
Read more

Infochimps Adds Geo APIs and Takes A Shine to Schema.Org, Too

On the way from Infochimps: Its Geo APIs that bring to developers data from open sources such as GeoNames and The National Climate Data Center, as well as licensed sources such as Locationary and Foursquare.  Now for the twist: The data marketplace is semantifying the geo data with a schema.org approach. This is just the first step in a broader plan to format all data to which its new and existing APIs provide access to in a unified way.

As CTO Flip Kromer explains it, Infochimps takes from schema.org the collection of types it defines. “That was designed for microformat markup in web pages. And we say why not take this tastefully done collection of types and make it so that it can be used by mobile phone and web developers,” he says, so that they can easily and in a unified way build on the databases to which Infochimps provides access. “That if we map that back to a JSON HTTP API world, that’s a really good thing that unlocks a lot of power for developers.”

Read more

NEXT PAGE >>