The World Wide Web Consortium is looking for a Research Scientist in Cambridge, MA. According to the post, “Computer Science and Artificial Intelligence Laboratory (CSAIL)-World Wide Web Consortium (W3C) (part-time, 50%), to bring social and identity technologies on the Web to their full potential. Will conduct cutting-edge research on social and identity technologies and apply that research to the construction of new standards; provide leadership and management to implement these technologies into Web standards; collaborate across disciplines with other researchers at MIT, outside universities, and across industries to build the expert groups required to build and implement technological standards; and perform other duties as needed.” Read more
According to a new article on Gov.uk, “The UK’s Open Data Institute (ODI) and Taiwan’s Open Data Alliance (ODA) have signed a Letter of Intent today, which will see the two organisations promote and explore the potential open data holds for the public, private and academic sectors in both countries. The Letter was signed by ODI Chairman and Co-Founder Sir Nigel Shadbolt during a visit to Taipei, and Chairman Peng Chi-Ming, from Taiwan’s Open Data Alliance at a high level open data forum which involved Taiwan’s ICT Minister Chang San-Cheng and Chris Wood, Director of the British Trade & Cultural Office in Taiwan.” Read more
Lara O’Reilly of Marketing Week reports, “The publisher first started trialling ‘FT Smart Match’ in 2011. It uses semantic profiling technology from Smartology that goes beyond keyword recognition to recognise the meaning, concepts and categories behind content that appears on FT.com, to help brands appear next to relevant content in real time and avoid appearing next to inappropriate articles. There have been several high profile cases of unfortunate automated ad misplacements online over the years, such as a Red Stripe “Hooray Beer!” ad appearing next to a Reuters story about a child being charged with drink-driving and an advert for Greek holidays juxtaposed next to a Guardian story about violent riots in Athens.” Read more
This week saw Frost & Sullivan award its 2013 Company of the Year to Definiens, a provider of image analysis solutions and data mining solutions for life science, tissue diagnostics, and clinical digital pathology. Definiens’ gaining of the title owes much to its work around tissue datafication that’s leveraging its Definiens Cognition Network Technology, which the company says mimics the human mind’s cognitive powers to reposition knowledge within a semantic network.
“What we do essentially is look at ways to be able to better diagnose cancer and develop therapies,” says Merrilyn Datta, CMO at Definiens. The company looks to extract data from tumor images, historically available as slides from biopsies, datafying the tissues involved to create digital images and then using its Cognition Network Technology to extract all the different relevant objects in that image and correlate them to patient outcomes. “That can be extremely, extremely powerful,” says Datta.
The image analysis technology was developed by Physics Nobel Laureate Gerd Binning, and includes a set of principles aimed at emulating the human mind’s cognitive powers, which are defined by the ability to intuitively aggregate pixels into ‘objects’ and understand the context and relationships between those objects rather than the computer’s normal way of just examining images pixel by pixel. These principles include: context, which is established and utilized through the technology’s creation of a hierarchical network of pixel clusters representing nested structures within the image; navigation, for supporting efficient navigation inside the network in order to enable specific local processing and addressing of specific contexts; and evolution of the network, in which the individual stages of segmentation and classification are alternated and the structures represented within the network are created and constantly improved in a series of loops, whereby each classification can be enhanced with local context and specific expert knowledge.
The Institute for Systems Biology (ISB) is looking for a Software Developer in Seattle, WA, to “build web based tools for analysis of cancer genomic data to advance understanding of tumor biology, accelerate new discoveries, and facilitate personalized therapy as a part of our recent award from the Life Science Discovery Fund (LSDF | http://lsdfa.org). This position will also contribute to our work as a genomic data analysis center for The Cancer Genome Atlas (TCGA | http://cancergenome.nih.gov/).” Read more
NEW YORK, Dec. 10, 2013 /PRNewswire/ – Publishers can finally align their video assets with their editorial text using content relevance technology from veeseo. Proven in Europe with some of the region’s top websites, including AOL, Axel Springer, Handelsblatt, MSN and Spiegel, veeseo adds significant value to site editorial, which results in videos with mid-article placement, higher engagement scores, and clicks that remain on the publisher site. Touchstorm partnered with Hamburg-based veeseo GmbH to bring the technology to the North American market, setting up a new venture, veeseo North America, to run the business. Read more
Colin Jeavons of Search Engine Journal recently wrote, “Millions of people are already using semantic search and they don’t know it. Some of the world’s most popular search engines and social networking sites are using the technology to make it easier to make connections, learn, and explore interests. It’s quietly become a part of our lives, and innovative companies are pushing the technology and industry toward new horizons. So, what changed? Actually, it was users. Last year, 20 percent of Google searches were new, due to the fact that people started typing sentences and paragraphs into search engines, expecting keyword searches to operate like natural language. Today, user demands for answers to their questions are satiated with innovations like Google’s Hummingbird. People are now searching for ‘cheap flights to Miami on January 7th” rather than just ‘cheap flights.’ This change in consumer behavior is a significant milestone.” Read more
The World Wide Web Consortium has headline news today: The Semantic Web, as well as eGovernment, Activities are being merged and superseded by the Data Activity, where Phil Archer serves as Lead. Two new workgroups also have been chartered: CSV on the Web and Data on the Web Best Practices.
What’s driving this? First, Archer explains, the Semantic Web technology stack is now mature, and it’s time to allow those updated standards to be used. With RDF 1.1, the Linked Data Platform, SPARQL 1.1, RDB To RDF Mapping Language (R2RML), OWL 2, and Provenance all done or very close to it, it’s the right time “to take that very successful technology stack and try to implement it in the wider environment,” Archer says, rather than continue tinkering with the standards.
The second reason, he notes, is that a large community exists “that sees Linked Data, let alone the full Semantic Web, as an unnecessarily complicated technology. To many developers, data means JSON — anything else is a problem. During the Open Data on the Web workshop held in London in April, Open Knowledge Foundation co-founder and director Rufus Pollock said that if he suggested to the developers that they learn SPARQL he’d be laughed at – and he’s not alone.” Archer says. “We need to end the religious wars, where they exist, and try to make it easier to work with data in the format that people like to work in.”
The new CSV on the Web Working Group is an important step in that direction, following on the heels of efforts such as R2RML. It’s about providing metadata about CSV files, such as column headings, data types, and annotations, and, with it, making it easily possible to convert CSV into RDF (or other formats), easing data integration. “The working group will define a metadata vocabulary and then a protocol for how to link data to metadata (presumably using HTTP Link headers) or embed the metadata directly. Since the links between data and metadata can work in either direction, the data can come from an API that returns tabular data just as easily as it can a static file,” says Archer. “It doesn’t take much imagination to string together a tool chain that allows you to run SPARQL queries against ’5 Star Data’ that’s actually published as a CSV exported from a spreadsheet.”
Goldman Sachs is looking for a Senior Developer in Jersey City, NJ. The post states, “TIDM is an ecosystem that provides inventory data to the firm with a demonstrated SLA of 99.999% availability and has been deployed globally. We’re looking for software engineers who want to work on cutting edge technologies in an agile environment. Skills would include: Java and Groovy, meta programming, graph databases, RDF databases, ETL technologies, NoSQL technologies and HADOOP/HBase. Ontology experience a plus.” Read more
Jay Myers of BestBuy recently wrote, “Shortly before Black Friday, one of my colleagues approached me with a curious question. ‘Mr. SVP XYZ was talking today about us creating a promo page of ‘stocking stuffers’. Do you think you could produce a list of products that might be ‘stocking stuffers’?’. After some discussion, we agreed that these products would be under $20 and be 5”x5” or smaller to qualify as a stocking stuffer. In a couple hours time we had a SPARQL generated list of 190 products (thank you @bsletten) on a promo page for anyone who searched for the ‘stocking stuffers’ phrase. A classic last minute, rogue (skunkworks?) effort.” Read more
NEXT PAGE >>