These vistas will be explored in a session hosted by Kevin Ford, digital project coordinator at the Library of Congress at next week’s Semantic Technology & Business conference in San Jose. The door is being opened by the Bibliographic Framework Initiative (BIBFRAME) that the LOC launched a few years ago. Libraries will be moving from the MARC standards, their lingua franca for representing and communicating bibliographic and related information in machine-readable form, to BIBFRAME, which models bibliographic data in RDF using semantic technologies.
Posts Tagged ‘MARC’
At The Semantic Technology and Business conference in San Francisco Monday, OCLC technology evangelist Richard Wallis broke the news that Content-negotiation was implemented for the publication of Linked Data for WorldCat resources. Last June, WorldCat.org began publishing Linked Data for its bibliographic treasure trove, a global catalog of more than 290 million library records and some 2 billion holdings, leveraging schema.org to describe the assets.
“Now you can use standard Linked Data technologies to bring back information in RDF/ XML, JSON, or Turtle,” Wallis said. Or triples. “People can start playing with this today.” As he writes in his blog discussing the news, they can manually specify their preferred serialization format to work with or display, or do it from within a program by specifying to the http protocol for the format to accept from accessing the URI.
“Two hundred ninety million records on the web of Linked Data is a pretty good chunk of stuff when you start talking content negotiation,” Wallis told the Semantic Web Blog.
Linked data is becoming even more interesting to the OCLC, a non-profit, membership, computer library service and research organization of 72,000 libraries in 170 countries and territories around the world. It’s named Richard Wallis — formerly of the U.K.’s Talis Linked Data and Semantic Web Technology company and one of our frequent Semantic Web Blog guest authors — to the position of Technology Evangelist.
The OCLC has as a major asset Worldcat, a global catalog comprising the collections of more than 10,000 libraries and adding up to more than 258 million records and 1.8 billion-plus holdings, in traditional library metadata format. WorldCat.org is the publicly searchable view of their core data in library format based upon library records (Marc records). More semantic web-oriented is other work the OCLC been doing over the last couple of years, Wallis explains, including experiments with using RDF/Linked Data at viaf.org, where the Virtual International Authority File publishes authoritative descriptions of names or organizations, and something similar for the Dewey Decimal Classification system at dewey.info.
In his new role, Wallis will collaborate with members and facilitate projects with OCLC teams as libraries and the cooperative drive efforts to expose WorldCat data as linked data, and will represent OCLC and WorldCat to the global library and web/IT leader communities. The VIAF and Dewey projects certainly provided an opportunity for OCLC to see the benefit of linking things together. On top of that, “the climate for Linked Data and libraries has changed dramatically over the last 12 months,” Wallis says.
Interest was evident at the Linked Data in Libraries event he ran for Talis this past summer, for example, and efforts like the W3C’s Linked Data in Libraries interest group, the Linked Open Data in Libraries, Archives & Museums work, the British Library’s work on the British National Bibliography as Linked Open Data, and the Library of Congress’s Bibliographic Framework Initiative General Plan all are adding fuel to the fire.
The opportunity is there for the OCLC to take the lead on Linked Data in the somewhat fragmented library world as those organizations start to hear more and more about the concept. “Linked Data is starting to be something talked about in the library world, but like any other world, it’s still a bit of an enthusiast environment,” Wallis says. As he evangelizes to the library community what Linked Data is about – and to the web community about what the OCLC is doing with its chunk of data that is relevant to the wider Linked Data and Web of Data world – he hopes “to be in at the beginning of a process where those two communities come together to help come up with the best way of applying Linked Data principles to library data.”
In a statement announcing the appointment, Robin Murray, OCLC Vice President, Global Product Management, said, “Richard Wallis is a leader in Semantic Web and Linked Data technology, and we believe he will help the OCLC cooperative extend our efforts to help libraries move to Webscale.”
Data Liberate, the consultancy Wallis began upon leaving Talis, will continue as a personal blogging site. “I still have interest wider than the library community and I believe that those interests can keep me up to date with the wide world and advise my advice into the OCLC,” he says.
A quirky new article likens search engines to humongous babies. The article states, “You can’t expect it to understand complicated things. You would never try to teach language to a human baby by reading it Nietzsche, and you shouldn’t expect a baby google to learn bibliographic data by feeding it MARC (or RDA or METS or MODS, or even ONIX). When a baby says ‘goo-goo’ to you, you don’t criticize its misuse of the subjunctive. You say ‘goo-goo’ back. When Google tells you that that it wants to hear ‘schema.org’ microdata, you don’t try to tell it about the first indicator of the 856 ‡u subfield. You give it schema.org microdata, no matter how babyish that seems.” Read more