Twitter Lets Researchers Mine Through Its Data

TwitterIANS Live recently wrote, “[Twitter] has finally given access to its vast database to a selected pool of researchers to study tweets and find answers to a variety of issues. As part of its ambitious data grant programme, Twitter is allowing academic researchers across various fields to ‘go back and study things’ over, with almost a decade of historical data, Washington Post reported. While Harvard Medical School and Boston Children’s Hospital are looking at tweets about food-poisoning cases to find answers to the spread of food-borne illnesses, researchers from the University of California at San Diego are studying whether happy people are likely to post happy images on Twitter.” Read more

FLAX a Winner on the World’s Technology Stage

flax29 October, 2014 — Software designed at the University of Waikato and used all around the world has won a major international competition.

FLAX, which stands for Flexible Learning Acquisition, has taken first place at the Linked Vici Competition, held to acknowledge development in open and linked data for educational purposes.

The win was announced at the 13th International Semantic Web Conference, in Riva del Garda, Italy, with FLAX taking first place from 10 shortlisted candidates. Read more

Retrieving and Using Taxonomy Data from DBpedia

DBpedia logo on a halloween jack-o-lanternDBpedia, as described in the recent semanticweb.com article DBpedia 2014 Announced, is “a crowd-sourced community effort to extract structured information from Wikipedia and make this information available on the Web.” It currently has over 3 billion triples (that is, facts stored using the W3C standard RDF data model) available for use by applications, making it a cornerstone of the semantic web.

A surprising amount of this data is expressed using the SKOS vocabulary, the W3C standard model for taxonomies used by the Library of Congress, the New York Times, and many other organizations to publish their taxonomies and subject headers. (semanticweb.com has covered SKOS many times in the past.) DBpedia has data about over a million SKOS concepts, arranged hierarchically and ready for you to pull down with simple queries so that you can use them in your RDF applications to add value to your own content and other data.

Where is this taxonomy data in DBpedia?

Many people think of DBpedia as mostly storing the fielded “infobox” information that you see in the gray boxes on the right side of Wikipedia pages—for example, the names of the founders and the net income figures that you see on the right side of the Wikipedia page for IBM. If you scroll to the bottom of that page, you’ll also see the categories that have been assigned to IBM in Wikipedia such as “Companies listed on the New York Stock Exchange” and “Computer hardware companies.” The Wikipedia page for Computer hardware companies lists companies that fall into this category, as well as two other interesting sets of information: subcategories (or, in taxonomist parlance, narrower categories) such as “Computer storage companies” and “Fabless semiconductor companies,” and then, at the bottom of the page, categories that are broader than “Computer hardware companies” such as “Computer companies” and “Electronics companies.”

How does DBpedia store this categorization information? The DBpedia page for IBM shows that DBpedia includes triples saying that IBM has Dublin Core subject values such as category:Companies_listed_on_the_New_York_Stock_Exchange and category:Computer_hardware_companies. The DBpedia page for the category Computer_hardware_companies shows that is a SKOS concept with values for the two key properties of a SKOS concept: a preferred label and broader values. The category:Computer_hardware_companies concept is itself the broader value of several other concepts such as category:Fabless_semiconductor_companies. Because it’s the broader value of other concepts and has its own broader values, it can be both a parent node and a child node in a tree of taxonomic terms, so DBpedia has the data that lets you build a taxonomy hierarchy around any of its categories.

Read more

Semantic Web Jobs: Orbis Technologies

OrbisOrbis Technologies is looking for a Software Developer – Cloud and Big Data in Annapolis, MD. According to the post, “Orbis Technologies, located in Annapolis, MD, is a leader in providing cloud computing-based semantic text analytics, using MapReduce, to support entity extraction, relationship identification, and semantic search in a Hadoop cloud-processing environment. We are interested in speaking to talented candidates who desire to further their careers in big data and open source development predominantly in Java, and work with cloud computing including the use of Hadoop, Accumulo, CloudBase, HBase, and core semantic web technologies.” Read more

Smart Recipe Platform Yummly Brings Contextual Recommendations to iOS Apps

YummlyREDWOOD CITY, CA–(Marketwired – Oct 28, 2014) – Yummly (http://www.yummly.com), the leading innovator in recipe search & discovery, announced today, the introduction of contextual recommendations on the iPhone and iPad apps. When users open the app, in addition to personalizing the content to a user’s tastes, Yummly will now tailor to a person’s time, place and patterns.

Yummly’s proprietary Food Genome and patent-pending Food Intelligence technology already blends together to create an unmatched user experience with data-driven features such as personalized recommendations, semantic search, and a smart shopping list. With the new contextual recommendations functionality, it is bringing together more relevant and dynamic content to the users by leveraging a combination of contextual signals such as time of day, day of week, season, location, trends and more. Read more

Just What We’ve All Been Waiting For: Bing Now Offers Emoji Search

queenZach Miners of PC World reports, “Bing now supports searches with emoji, meaning you can insert or paste a range of emoji icons like hearts, smiley faces, food graphics, or any combination thereof, for some interesting, though not always useful, results. It’s a novelty feature, yes, but still fun. And one that could help Bing draw at least some attention away from Google. Google at the moment does not give results for emoji searches, though its auto-complete technology does recognize them. Yahoo, meanwhile, does support emoji searches. Bing’s tool is available in all English markets, the search engine said, offered as an homage to the shorthand’s popularity.” Read more

HTML5: The Party Is Officially On!

w3chtmlWord came from the World Wide Web Consortium (W3C) yesterday that it has published the 5th major revision of HTML, the core language of the web. While HTML5 is already in use by developers (having become a W3C candidate recommendation a couple of years ago), the recommendation for the standard is a lynchpin for the community, as it now formalizes stable guidelines for the development of innovative and cross-platform web sites and applications.

A key feature of HTML5 – the first major new HTML standard in more than a decade – is that it provides the ability to describe the structure of a web document with standard semantics. It uses semantic tags for things like page headers, footers, body, ordered lists, time, and more to better identify an element and how it is being used. Greater use of these tags should improve a browser’s ability to understand content for display across a range of devices and screen sizes without requiring any development rejiggering, and search engines’ ability to more effectively index a page, which could lead to better rankings.

Read more

NEW WEBINAR Announced: Yosemite Project – Part 3

“Transformations for Integrating VA data with FHIR in RDF”

Yosemite Project Part 3: Part 3-Transformations for Integrating VA data with FHIR in RDF SemanticWeb.com recently launched a series of webinars on the topic of “RDF as a Universal Healthcare Exchange Language.”

Part 1 of that series, “The Yosemite Project: An RDF Roadmap for Healthcare Information Interoperability,” is available as a recorded webinar and slide deck.

Part 2,The Ideal Medium for Health Data? A Dive into Lab Tests,” will take place on November 7, 2014 (registration is open as of this writing).

Announcing Part 3:

click here to register now!
TITLE: Transformations for Integrating VA data with FHIR in RDF
DATE: Wednesday, November 12, 2014
TIME: 2 PM Eastern / 11 AM Pacific
PRICE: Free to all attendees
DESCRIPTION: In our series on The Yosemite Project, we explore RDF as a data standard for health data. In this installment, we will hear from Rafael Richards, Physician Informatician, Office of Informatics and Analytics in the Veterans Health Administration (VHA), about “Transformations for Integrating VA data with FHIR in RDF.”

The VistA EHR has its own data model and vocabularies for representing healthcare data. This webinar describes how SPARQL Inference Notation (SPIN) can be used to translate VistA data to the data represented used by FHIR, an emerging interchange standard.

 

Read more

Semantic Web Jobs: IBM

ibmThe IBM Collaborative Discovery Research Team is looking for a Research Scientist in San Jose, CA. The post states, “We are looking for Research Staff Members with experience in building larger software systems and collaborative software development to help build our platform. Candidates in this role generate highly novel ideas (theoretical or experimental) in a specific engineering or scientific discipline and invent and design complex products and processes. This position may be involved in engineering these ideas to an advanced state of feasibility by evaluating ideas and plans and participating in their implementation. The full cycle of innovation to delivery is typically a multiple-year effort.” Read more

University of Amsterdam Researchers Use Neural Networks to Improve Machine Translations

amsterdamLoek Essers of Tech World recently wrote, “Researchers at the University of Amsterdam are using neural networks to help a statistical machine translation systems learn what all human translators know — that the best translation of a word often depends on the context. Machine translation systems such as Google Translate or those at iTranslate4.eu guess how to translate words and phrases based on how often they appear in a large corpus of human-translated texts. Such tools are increasingly important as individuals and businesses seek to access information or buy products and services from other countries where different languages are spoken.” Read more

NEXT PAGE >>