eBay is looking for an Applied Research Engineer in San Jose, CA. According to the post, this position will be the “Technical lead for applying natural language processing, data mining, and image processing to help drive definition of product direction, improve social commerce, and increase social networks. Define, design, and implement complex data processes, data pipeline to improve buyer experience at eBay. As a self-motivated and enthusiastic member of our team, you will work with extremely talented peers in a fun environment pushing the performance bar while solving various problems. Develop innovative solutions that not only meet functionality requirements but meet performance, scalability and reliability requirements while adhering to implementation schedules, development principles and product goals.” Read more
IANS Live recently wrote, “[Twitter] has finally given access to its vast database to a selected pool of researchers to study tweets and find answers to a variety of issues. As part of its ambitious data grant programme, Twitter is allowing academic researchers across various fields to ‘go back and study things’ over, with almost a decade of historical data, Washington Post reported. While Harvard Medical School and Boston Children’s Hospital are looking at tweets about food-poisoning cases to find answers to the spread of food-borne illnesses, researchers from the University of California at San Diego are studying whether happy people are likely to post happy images on Twitter.” Read more
The win was announced at the 13th International Semantic Web Conference, in Riva del Garda, Italy, with FLAX taking first place from 10 shortlisted candidates. Read more
DBpedia, as described in the recent semanticweb.com article DBpedia 2014 Announced, is “a crowd-sourced community effort to extract structured information from Wikipedia and make this information available on the Web.” It currently has over 3 billion triples (that is, facts stored using the W3C standard RDF data model) available for use by applications, making it a cornerstone of the semantic web.
A surprising amount of this data is expressed using the SKOS vocabulary, the W3C standard model for taxonomies used by the Library of Congress, the New York Times, and many other organizations to publish their taxonomies and subject headers. (semanticweb.com has covered SKOS many times in the past.) DBpedia has data about over a million SKOS concepts, arranged hierarchically and ready for you to pull down with simple queries so that you can use them in your RDF applications to add value to your own content and other data.
Where is this taxonomy data in DBpedia?
Many people think of DBpedia as mostly storing the fielded “infobox” information that you see in the gray boxes on the right side of Wikipedia pages—for example, the names of the founders and the net income figures that you see on the right side of the Wikipedia page for IBM. If you scroll to the bottom of that page, you’ll also see the categories that have been assigned to IBM in Wikipedia such as “Companies listed on the New York Stock Exchange” and “Computer hardware companies.” The Wikipedia page for Computer hardware companies lists companies that fall into this category, as well as two other interesting sets of information: subcategories (or, in taxonomist parlance, narrower categories) such as “Computer storage companies” and “Fabless semiconductor companies,” and then, at the bottom of the page, categories that are broader than “Computer hardware companies” such as “Computer companies” and “Electronics companies.”
How does DBpedia store this categorization information? The DBpedia page for IBM shows that DBpedia includes triples saying that IBM has Dublin Core subject values such as
category:Computer_hardware_companies. The DBpedia page for the category
Computer_hardware_companies shows that is a SKOS concept with values for the two key properties of a SKOS concept: a preferred label and broader values. The
category:Computer_hardware_companies concept is itself the broader value of several other concepts such as
category:Fabless_semiconductor_companies. Because it’s the broader value of other concepts and has its own broader values, it can be both a parent node and a child node in a tree of taxonomic terms, so DBpedia has the data that lets you build a taxonomy hierarchy around any of its categories.
Orbis Technologies is looking for a Software Developer – Cloud and Big Data in Annapolis, MD. According to the post, “Orbis Technologies, located in Annapolis, MD, is a leader in providing cloud computing-based semantic text analytics, using MapReduce, to support entity extraction, relationship identification, and semantic search in a Hadoop cloud-processing environment. We are interested in speaking to talented candidates who desire to further their careers in big data and open source development predominantly in Java, and work with cloud computing including the use of Hadoop, Accumulo, CloudBase, HBase, and core semantic web technologies.” Read more
REDWOOD CITY, CA–(Marketwired – Oct 28, 2014) – Yummly (http://www.yummly.com), the leading innovator in recipe search & discovery, announced today, the introduction of contextual recommendations on the iPhone and iPad apps. When users open the app, in addition to personalizing the content to a user’s tastes, Yummly will now tailor to a person’s time, place and patterns.
Yummly’s proprietary Food Genome and patent-pending Food Intelligence technology already blends together to create an unmatched user experience with data-driven features such as personalized recommendations, semantic search, and a smart shopping list. With the new contextual recommendations functionality, it is bringing together more relevant and dynamic content to the users by leveraging a combination of contextual signals such as time of day, day of week, season, location, trends and more. Read more
Zach Miners of PC World reports, “Bing now supports searches with emoji, meaning you can insert or paste a range of emoji icons like hearts, smiley faces, food graphics, or any combination thereof, for some interesting, though not always useful, results. It’s a novelty feature, yes, but still fun. And one that could help Bing draw at least some attention away from Google. Google at the moment does not give results for emoji searches, though its auto-complete technology does recognize them. Yahoo, meanwhile, does support emoji searches. Bing’s tool is available in all English markets, the search engine said, offered as an homage to the shorthand’s popularity.” Read more
Word came from the World Wide Web Consortium (W3C) yesterday that it has published the 5th major revision of HTML, the core language of the web. While HTML5 is already in use by developers (having become a W3C candidate recommendation a couple of years ago), the recommendation for the standard is a lynchpin for the community, as it now formalizes stable guidelines for the development of innovative and cross-platform web sites and applications.
A key feature of HTML5 – the first major new HTML standard in more than a decade – is that it provides the ability to describe the structure of a web document with standard semantics. It uses semantic tags for things like page headers, footers, body, ordered lists, time, and more to better identify an element and how it is being used. Greater use of these tags should improve a browser’s ability to understand content for display across a range of devices and screen sizes without requiring any development rejiggering, and search engines’ ability to more effectively index a page, which could lead to better rankings.
“Transformations for Integrating VA data with FHIR in RDF”
Part 1 of that series, “The Yosemite Project: An RDF Roadmap for Healthcare Information Interoperability,” is available as a recorded webinar and slide deck.
Part 2, “The Ideal Medium for Health Data? A Dive into Lab Tests,” will take place on November 7, 2014 (registration is open as of this writing).
Announcing Part 3:
TITLE: Transformations for Integrating VA data with FHIR in RDF
DATE: Wednesday, November 12, 2014
TIME: 2 PM Eastern / 11 AM Pacific
PRICE: Free to all attendees
DESCRIPTION: In our series on The Yosemite Project, we explore RDF as a data standard for health data. In this installment, we will hear from Rafael Richards, Physician Informatician, Office of Informatics and Analytics in the Veterans Health Administration (VHA), about “Transformations for Integrating VA data with FHIR in RDF.”
The VistA EHR has its own data model and vocabularies for representing healthcare data. This webinar describes how SPARQL Inference Notation (SPIN) can be used to translate VistA data to the data represented used by FHIR, an emerging interchange standard.
The IBM Collaborative Discovery Research Team is looking for a Research Scientist in San Jose, CA. The post states, “We are looking for Research Staff Members with experience in building larger software systems and collaborative software development to help build our platform. Candidates in this role generate highly novel ideas (theoretical or experimental) in a specific engineering or scientific discipline and invent and design complex products and processes. This position may be involved in engineering these ideas to an advanced state of feasibility by evaluating ideas and plans and participating in their implementation. The full cycle of innovation to delivery is typically a multiple-year effort.” Read more
NEXT PAGE >>