SemTechBiz SF more TVNewser TVSpy LostRemote SocialTimes AllFacebook AllTwitter GalleyCat AppNewser UnBeige AgencySpy PRNewser 10,000 Words FishbowlNY FishbowlLA FishbowlDC MediaJobsDaily

Posts Tagged ‘DBpedia’

Dynamic Semantic Publishing for News Organizations

Ontoba logo

Paul Wilton was Technical and development lead for semantic publishing at BBC News and Sport Online during the 2010 World Cup.  Currently he is the Technical architect at Ontoba.  In this interview, a supplement to “Dynamic Semantic Publishing for Beginners”, Paul describes the current landscape for DSP as it applies to news organizations.

Q. Are you seeing a wide disparity in the way that news organizations have approached the creation and use of semantically-linked (or annotated) content?

A. Actually the pattern and often the (general) technical architecture is surprisingly similar. Where things differ are the applications, models used and instance data. This is undoubtedly bleeding edge technology, and typically the impetus to begin investigating the use of linked data, RDF and semantics in the technology stack has come from within the Information Architecture and R&D teams, not from the offices of the CTO/CIO. Maybe this is starting to change now.

Q. Do many news organizations have the resources (staff and/or Content Management Systems) that are able to publish and use semantic data?

A. Not in our experience, but this shouldn’t be a barrier to integrating semantic technologies and publishing linked data.

The key components to adopting semantic publishing – a semantic repository (triple store); appropriate linked data sets; and the ability to semantically annotate your content – can be built alongside an existing Content Management System. Read more

Spanish DBpedia Launched

A new article reports, “After months of gratuitous hard work and cooperation by higher education students and experts, the Spanish version of DBpedia, also known as the Spanish Semantic Wikipedia, has finally come into being. The Spanish DBpedia contains 70 million data that account for 80% of the information in the Spanish Wikipedia and now rivals other languages like English or French… DBpedia is a project for extracting Wikipedia data and building a semantic version of this Internet encyclopaedia. It is a community effort for extracting structured information from the Wikipedia and making it accessible on the Web.” Read more

Wikidata, and a clash of world views

Remember the days before Wikipedia had all the answers? We looked things up in libraries, referring to shelf-filling encyclopaedias. We bought CD-ROMs (remember them?) full of facts and pictures and video clips. We asked people. Sometimes, school home work actually required some work more strenuous than a cut and paste. We went about our business without remembering that New Coke briefly entered our lives on this day in 1985.

Wikipedia is far from perfect, and some of the concern around its role in a wider dumbing down of thought and argument may be justified. But, despite that, it’s a remarkable achievement and a wonderful resource. Those who argued that it would never work have clearly been proven wrong. Carefully maintained processes and the core principle of the neutral point of view mostly serve contributors well.

With Wikimedia Deutschland‘s recent announcement of Wikidata, many of the early concerns about Wikipedia itself have resurfaced once again. Read more

Highlights from WWW 2012 Conference

Juan Sequeda photoThis year was the 21st World Wide Web Conference located in Lyon, France. This conference is a unique forum for discussion about how the Web is evolving. There were hundreds of talks over 3 days. Let me summarize some Semantic Web presentations I was able to attend.

NautiLOD

Programmers daily use the wget tool to specify and retrieve data on the Web. However, wget is limited since it cannot dig into the semantics of Web data to do the job. What if you were to add semantics to wget? This is the question that Valeria Fionda, Claudio Gutierrez and Giuseppe Pirró asked themselves. They took that question to the next level: imagine a semantic wget on top of Linked Data. They wanted to create a language to declaratively specify portions of the Web of Data, define routes and instruct agents that can do things for you on the Web. All this by exploiting the semantics of information (RDF data) found in online data sources. For example, find all the Wikipedia pages of directors that have been influenced by Stanley Kubrick and send them to my email; retrieving information about David Lynch from different information providers only gives a hint of what can be done. The researchers developed a simple, generic declarative language, NautiLOD and implemented it in swget (semantic wget). swget comes in two flavors: a simple command line tool (to give the Web back to users) and a GUI. This is not a fantasy anymore. Check it our for yourself (http://swget.wordpress.com).

Read more

SWiPE Plans to Make Search a Breeze

Eileen Brown recently reported that SWiPE hopes to make querying search engines a less frustrating experience. Brown writes, “If you struggle with RDF triples (Resource Description Framework) and SPARQL (Query language and protocol for RDF) do not despair. SWiPE (Searching WIkiPedia by Example) allows semantic and well-structured knowledge bases to be easily queried from within the pages of Wikipedia. If you want to know which cities in Florida, founded in last century have more than 50 thousand people you will be able to enter the query conditions directly into the Infobox of a Wikipedia page. Swipe activates certain fields of Wikipedia that generate equivalent SPARQL queries executed on DBpedia.” Read more

A Fundamental Linked Data Debate

linkeddata_blue There is a fierce debate going on in the world of the Semantic Web and Linked Data, the question being is it of fundamental importance to realising the benefits of the technology or are they just dancing on the head of a pin.    The core debate revolves around something with the stunningly opaque title of the httpRange-14 issue.

The debate has been rumbling on for years but was reignited over the last few days by proposals being submitted to the W3C to clarify and hopefully simplify things.  I use the word ignited as that what I was beginning to think my iPhone was about to do – it has been buzzing away like a bumblebee on speed over the last few days announcing the arrival of yet another passionately held opinion from a member of the respected Semantic Web/Linked Data community from Sir Tim Berners-Lee downwards.    Fortunately for those of you that do not follow the W3C’s Technical Architecture (TAG) and Linked Open Data (public-lod) mailing lists it may have gone unnoticed.

Let me try to explain, in as simple terms as possible, what the fuss is all about and why it may be important.  From my point of view, and there are many surrounding this, the issue is a combination of two problems.

Read more

Gravity Gets The Interest Graph Going; Partners Include Wall Street Journal and TechCrunch

Just a little over a year ago The Semantic Web Blog introduced our readers to Gravity in this article. The project, spearheaded by former MySpace execs, is focused on building the Interest Graph. The team’s been pretty quiet about development efforts since that time — until just this month, when it announced Gravity Labs to let the public in on a little more about its underlying Interest Graph infrastructure and to showcase the platform. It also announced that it was open-sourcing some of the “plumbing” code it came up with during development, while understandably keeping its core IT, ontology and algorithms under wraps.

The announcement noted that the internally-named Gravity Interest Service for personalizing content at scale, in real-time, went live at production-scale 6 months ago. So far the technology has created over 400 million user interest graphs; served over 13 million pieces of personalized content per day; personalized the daily Internet experience of tens of millions of users per month; and processed over 25 million inbound interest signals per day, the company says. It expects that at this rate, that in under six months it will be handling 10X all of these numbers.

The Semantic Web Blog once again caught up with Gravity CTO Jim Benedetto to talk some more about the Interest Graph, a term he acknowledges gets thrown around quite a bit these days, with a lot of web sites claiming they’ve got the goods. But, he says, “what they effectively are saying is that buried deep within the data of our logs or deep in the data of how our users interact with our site, we know there are interest indicators there. But a lot of them are not doing much with their data.” Interest Graphs, he says, aren’t owned, but interest data resides in individual places and across the web at large — and they need the Gravity platform to help unlock that to create dynamic and personalized experiences for users, Benedetto says.

Read more

The Semantic Web Has Gone Mainstream! Wanna Bet?

Juan Sequeda photoIn 2005, I started learning about the so-called Semantic Web. It wasn’t till 2008, the same year I started my PhD, that I finally understood what the Semantic Web was really about. At the time, I made a $1000 bet with 3 college buddies that the Semantic Web would be mainstream by the time I finished my PhD. I know I’m going to win! In this post, I will argue why.

Read more

What W3C’s R2RML and Direct Mapping Mean to Enterprise Data

Juan Sequeda photoI’m very happy to announce that the World Wide Web Consortium’s RDB2RDF Working Group, in which I participate as an Invited Expert,  has published two Candidate Recommendations: R2RML: RDB to RDF Mapping Language and A Direct Mapping of Relational Data to RDF. This has been a long road and we still have some ways to go. The standardization process goes back to the W3C Workshop on RDF Access to Relational Databases, which took place in October 2007. The W3C RDB2RDF Incubator Group followed afterwards. After almost 5 years, we are on track to have a standard. However, what is this standard bringing to the table?

Read more

The Semantic Link with Guest, Denny Vrandecic – February, 2012

Paul Miller, Bernadette Hyland, Ivan Herman, Eric Hoffer, Andraz Tori, Peter Brown, Christine Connors, Eric Franzon

On Friday, February 10, a group of Semantic thought leaders from around the globe met with their host and colleague, Paul Miller, for the latest installment of the Semantic Link, a monthly podcast covering the world of Semantic Technologies. This episode includes a discussion about data; specifically, the recently announced “wikidata” project with special guest, Denny Vrandecic.
At the recent SemTechBiz Berlin conference, Denny presented a talk titled, “Wikidata: The Next Big Thing for Wikipedia.” As evidenced in the “Wow’s” expressed by the panelists in this month’s podcast call, this is indeed a big deal for Wikipedia and for Semantic Web. Read more

<< PREVIOUS PAGENEXT PAGE >>