Semantic Web Job: Data Engineer

Oxford University Press logoOxford University Press needs a data engineer. The job description states:

“To design and implement XML data structures, devise XML strategies, and through expert content analysis, facilitate the electronic and print production, publication, and licensing of OUP’s books, journals and dictionaries content.
• Analyze OUP and licensed texts, and the requirements for their production, online functionality and presentation, and general electronic use, to:
• Determine appropriate data structures to support those requirements (content and metadata),
• Devise quality control rules and processes,
• Design tools and workflows required by staff in Editorial and Production to produce, capture, and enhance the data.
• Write transformation scripts to convert source data into OUP standard data structures that meets the agreed data quality standards for publishing and licensing.” Read more

Want a music suggestion? Just ask DJ Twitter.

screen shot of a seevl hack in action via Alexandre Passant.Alexandre Passant, founder of seevl, which we have covered before, has hacked together a cool proof of concept. He describes the project as using “Twitter As A Service,” and it leverages Twitter, YouTube, and the seevl API. As Passant describes, “The result is a twitter bot, running under our @seevl handle, which accepts a few (controlled) natural-language queries and replies with an appropriate track, embedded in a Tweet via a YouTube card.”

He continues, “As it’s all Twitter-based, not only you can send messages, but you can have a conversation with your virtual DJ.”

Read more

You Can Help Make Linked Data Core To The Future of Identity, Payment On The Web Platform

ld1At the end of September, The World Wide Web Consortium (W3C) may approve the world’s first Web Payments Steering Group, to explore issues such as navigating around obstacles to seamless payments on the web and ways to better facilitate global transactions while respecting local laws. Identity and digital signatures have a role here, at the same time as they go beyond the realm of payment into privacy and other arenas. At the end of October, there also will be a W3C technical plenary, to discuss identity, graph normalization, digital signatures and payments technologies.

Expect Linked Data to come up in the context of both events, Manu Sporny told attendees at this August’s 10th annual Semantic Technology & Business conference in San Jose during his keynote address, entitled Building Linked Data Into the Core of the Web. “It is the foundational data model to build all this technology off of,” said Sporny, who is the founder and CEO of Digital Bazaar, which develops technology and services to make it easier to buy and sell digital content over the Internet. (See our stories about the company and its technology here.)  He also is founder and chair of the W3C Web Payments Community Group, chair of its RDFa Working Group, and founder, and chair and lead editor of the JSON-LD Community Group.

Read more

Semantic Web Job: Data Curator

logo for CUSP - Center for Urban Science + ProgressThe Center for Urban Science and Progress at New York University is seeking a data curator. Requirements include: “3-5 years experience in a related field, such as applied science, metadata schema design and management, taxonomy management, or equivalent education and experience; relevant experience in research data management as a researcher, research data manager, research data repository manager , or in similar roles. Demonstrated experience in consulting with faculty or researchers regarding technology or metadata creation; Demonstrated understanding of the research and data lifecycle. Demonstrated experience in curating and handling large data sets with particular understanding of requirements for longer term digital archiving. Understanding of database systems, XML, RDF, scientific metadata standards, API development, and related technologies; as well as protocols such as OAI-PMH. Experience with ontologies and metadata issues related to the discovery of academic or data resources. Experience constructing and maintaining a controlled vocabulary. ” Read more

Schema.Org: The Fire’s Been Lit

schemaorgadoptspecificWhy has schema.org made the following strides since its debut in 2011?

  • In a sample of over 12 billion web pages, 21 percent, or 2.5 billion pages, use it to mark up HTML pages, to the tune of more than 15 billion entities and more than 65 billion triples;
  • In that same sample, this works out to six entities and 26 facts per page with schema.org;
  • Just about every major site in every major category, from news to e-commerce (with the exception of Amazon.com), uses it;
  • Its ontology counts some 800 properties and 600 classes.

A lot of it has to do with the focus its proponents have had since the beginning on making it very easy for webmasters and developers to adopt and leverage the collection of shared vocabularies for page markup. At this August’s 10th annual Semantic Technology & Business conference in San Jose, Google Fellow Ramanathan V. Guha, one of the founders of schema.org, shared the progress of the initiative to develop one vocabulary that would be understood by all search engines and how it got to where it is today.

Read more

Analyzing Cognitive Computing

image of neuronsJamie Bisker reported, “Most consumers and professionals of all types have a basic feeling about technological innovation as something positive. It is true that we may bemoan the loss of a favorite aspect of the past, and tend to recall for the most part only favorable situations that strengthen such memories. But, in general, people feel upbeat about the convenience and capabilities that technology can provide. We evoke pleasant feelings from the past and that is our nature. It is also our nature, at a very deep biological level, to anticipate the future.

Jeff Hawkins, in his book On Intelligence, highlights research he both collected and directs about the physiological aspect of how neurons in the brain are connected. He has shown that prediction is basically wired in to a large portion of neural circuitry. Hawkins named this approach as a “memory-prediction framework.” He also makes the case for prediction as being one of the foundations of human intelligence.”

Read more

Smart Cities Are Coming

view of a city public tansportation yard and skyline at night.A recent press release states, “Transforming our cities into the Smart Cities of the future will encompass incorporating technologies and key digital developments all linked by machine-to-machine (M2M) solutions and real-time data analytics which sit under the umbrella term of the Internet of Things. Smart cities however must be underpinned by the appropriate ICT infrastructure based on fibre optic and high-speed wireless technologies, which is well underway in many developed cities around the world. This infrastructure allows for the development of smart communities; supporting connected homes; intelligent transport systems; e-health; e-government and e-education; smart grids and smart energy solutions – just to name a few of the exciting solutions smart cities will incorporate. Many of the technological advancements emerging around the world today can, and will be, applied to smart cities. Artificial Intelligence; Electric Vehicles; Autonomous Vehicles; Mobile applications; Drones; Wearable and Smart devices and so on are just some of the key developments to watch.” Read more

Semantic Web Job: Lead Software Engineer

Asymmetrik logoAsymmetrik seeks a lead software engineer. Responsibilities include: “The Lead Software Engineer will develop and maintain mission-critical information extraction, analysis, and management systems; implement analysis algorithms to evaluate, correlate, and display information; and provide direct and responsive support for urgent analytic needs. In addition, the Lead Software Engineer will participate in architecture and software development activities that may include:

Read more

Deconstructing JSON-LD

Photo of (clockwise from top-left: Aaron Bradley, Greg Kellogg, Phil Archer, Stephane CorlosquetAaron Bradley recently posted a roundtable discussion about JSON-LD which includes: “JSON-LD is everywhere. Okay, perhaps not everywhere, but JSON-LD loomed large at the 2014 Semantic Web Technology and Business Conference in San Jose, where it was on many speakers’ lips, and could be seen in the code examples of many presentations. I’ve read much about the format – and have even provided a thumbnail definition of JSON-LD in these pages – but I wanted to take advantage of the conference to learn more about JSON-LD, and to better understand why this very recently-developed standard has been such a runaway hit with developers. In this quest I could not have been more fortunate than to sit down with Gregg Kellogg, one of the editors of the W3C Recommendation for JSON-LD, to learn more about the format, its promise as a developmental tool, and – particularly important to me as a search marketer – the role in the evolution of schema.org.”

Read more

Big Data Review: What The Surveys Say

bd2Big Data has been getting its fair share of commentary over the last couple months. Surveys from multiple sources have commented on trends and expectations. The Semantic Web Blog provides some highlights here:

  • From Accenture Anayltics’s new Big Success With Big Data report: There remain some gaps in what constitutes Big Data for respondents to its survey: Just 43 percent, for instance, classify unstructured data as part of the package. That option included open text, video and voice. Those are gaps that could be filled leveraging technologies such as machine learning, speech recognition and natural language understanding, but they won’t be unless executives make these sources a focus of Big Data initiatives to start with.
  • From Teradata’s new survey on Big Data Analytics in the UK, France and Germany: Close to 50 percent of respondents in the latter two countries are using three or more data types (from sources ranging from social media, to video, to web blogs, to call center notes, to audio files and the Internet of Things) in their efforts, compared to just 20 percent in the UK.  A much higher percentage of UK businesses (51 percent) are currently using just a single type of new data, such as video data, compared with France and Germany, where only 21 percent are limiting themselves to one type of new data, it notes. Forty-four percent of execs in Germany and 35 percent in France point social media as the source of the new data. About one-third of respondents in each of those countries are investigating video, as well.

Read more

NEXT PAGE >>