Research Information recently reported, “Symplectic Limited, a software company specialising in developing, implementing, and integrating research information systems, has become the first DuraSpace Registered Service Provider (RSP) for the VIVO Project. VIVO is an open-source, open-ontology, open-process platform for hosting information about the interests, activities and accomplishments of scientists and scholars. VIVO aims to support open development and integration of science and scholarship through simple, standard semantic web technologies.” Read more
If you’re interested in Linked Data, no doubt you’re planning to listen in on next week’s Semantic Web Blog webinar, Getting Started With The Linked Data Platform (register here), featuring Arnaud Le Hors, Linked Data Standards Lead at IBM and chair of the W3C Linked Data Platform WG and the OASIS OSLC Core TC. It also may be on your agenda to attend this month’s Semantic Web Technology & Business Conference, where speakers including Le Hors, Manu Sporny, Sandro Hawke, and others will be presenting Linked Data-focused sessions.
In the meantime, though, you might enjoy reviewing the results of the LOD2 Project, the European Commission co-funded effort whose four-year run, begun in 2010, aimed at advancing RDF data management; extracting, creating and enriching structured RDF data; interlinking data from different sources; and authoring, exploring and visualizing Linked Data. To that end, why not take a stroll through the recently released Linked Open Data – Creating Knowledge Out of Interlinked Data, edited by LOD2 Project participants Soren Auer of the Institut für Informatik III Rheinische Friedrich-Wilhelms-Universität; Volha Bryl of the University of Mannheim, and Sebastian Tramp of the University of Leipzig?
Christopher Tozzi of The VAR Guy reports, “PredictionIO, the open source machine learning platform, has received a big boost with the announcement of $2.5 million in seed funding, which it plans to use to make its automated data interpretation and prediction platform widely available to open source developers. PredictionIO’s goal is to make it easy for developers and companies of all sizes to integrate machine learning —i.e., software that can interpret data intelligently to make automated decisions and predictions—into their products. ‘PredictionIO aims to be the Machine Learning server behind every application,’ according to the company. ‘Building Machine Learning in software will be as common as search soon with PredictionIO’.” Read more
Among the many exciting activities at the 10th Annual Semantic Technology & Business Conference (#SemTechBiz) is the partnership with the Linked Open Data in Libraries Archives, and Museums (LODLAM) Community. On Tuesday, August 19, 2014, LODLAM will hold a full day of trainings at the SemTechBiz Conference in San Jose, California. Registration information is available here.
We spoke to Jon Voss, Co-Founder of the International LODLAM Summit, about the Training Day:
SemanticWeb.com: What is the LODLAM Training Day?
SW: What can people expect to learn?
JV: We’ve broken the day down into two sections, basically: publishing data and reusing data. The first part of the day we’ll look at ways that libraries, archives and museums are putting massive amounts of structured data online for the public good, and what techniques and tools you can use to do it. The second part of the day we’ll be looking at using this data in different ways, how to use SPARQL queries, how to build data into other mashups, how to use open datasets to improve your own data, etc.
In Part 3 of this series, Jarek Wilkiewicz details activating the small Knowledge Graph (built on Cayley) with Schema.org Actions. He begins by explaining how Actions can be thought of as a combination of “Entities” (things) and “Affordances” (uses). As he defines it, “An affordance is a quality of an object, or an environment, which allows an individual to perform an action.”
For example, an action, might be using the “ok Google” voice command on a mobile device. The even more specific example that Wilkiewicz gives in the video (spoiler alert) is that of using the schema.org concept of potentialAction to trigger the playing of a specific artist’s music in a small music store’s mobile app.
To learn more, and to meet Jarek Wilkiewicz and his Google colleague, Shawn Simister, in person, register for the Semantic Technology & Business Conference where they will present “When 2 Billion Freebase Facts is Not Enough.”
Barak Michener, Software Engineer, Knowledge NYC has posted on the Google Open Source Blog about “Cayley, an open source graph database.”: “Four years ago this July, Google acquired Metaweb, bringing Freebase and linked open data to Google. It’s been astounding to watch the growth of the Knowledge Graph and how it has improved Google search to delight users every day. When I moved to New York last year, I saw just how far the concepts of Freebase and its data had spread through Google’s worldwide offices. I began to wonder how the concepts would advance if developers everywhere could work with similar tools. However, there wasn’t a graph available that was fast, free, and easy to get started working with. With the Freebase data already public and universally accessible, it was time to make it useful, and that meant writing some code as a side project.”
The post continues: “Cayley is a spiritual successor to graphd; it shares a similar query strategy for speed. While not an exact replica of its predecessor, it brings its own features to the table:RESTful API, multiple (modular) backend stores such as LevelDB and MongoDB, multiple (modular) query languages, easy to get started, simple to build on top of as a library, and of course open source. Cayley is written in Go, which was a natural choice. As a backend service that depends upon speed and concurrent access, Go seemed like a good fit.”
Straight out of Google I/O this week, came some interesting announcements related to Semantic Web technologies and Linked Data. Included in the mix was a cool instructional video series about how to “Build a Small Knowledge Graph.” Part 1 was presented by Jarek Wilkiewicz, Knowledge Developer Advocate at Google (and SemTechBiz speaker).
Wilkiewicz fits a lot into the seven-and-a-half minute piece, in which he presents a (sadly) hypothetical example of an online music store that he creates with his Google colleague Shawn Simister. During the example, he demonstrates the power and ease of leveraging multiple technologies, including the schema.org vocabulary (particularly the recently announced ‘Actions‘), the JSON-LD syntax for expressing the machine readable data, and the newly launched Cayley, an open source graph database (more on this in the next post in this series).
Standard Analytics, which was a participant at the recent TechStars event in New York City, has a big goal on its mind: To organize the world’s scientific information by building a complete scientific knowledge graph.
The company’s co-founders, Tiffany Bogich and Sebastien Ballesteros,came to the conclusion that someone had to take on the job as a result of their own experience as researchers. A problem they faced, says Bogich, was being able to access all the information behind published results, as well as search and discover across papers. “Our thesis is that if you can expose the moving parts – the data, code, media – and make science more discoverable, you can really advance and accelerate research,” she says.