Posts Tagged ‘Information Extraction’

Alta Plana Takes The Pulse Of Text Analytics

wordcloudSeth Grimes, president and principal consultant of Alta Plana Corp. and founding chair of the Sentiment Analysis Symposium, has put together a thorough new report, Text Analytics 2014: User Perspectives on Solutions and Providers. Among the interesting findings of the report is that “growth in text analytics, as a vendor market category, has slackened, even while adoption of text analytics, as a technique, has continued to expand rapidly.”

Grimes explains that in a fragmented market, consisting of everything from text analytics services to solution-embedded technologies, the opportunities for users to practice text analytics is strong, but that increasingly text analytics is not the main focal point of the solutions being leveraged.

Reflecting the diversity of options, respondents listed among their providers a number of open-source offerings such as Apache OpenNLP and GATE, API services such as AlchemyAPI and Semantria, and enterprise software solution and business suite providers like SAP. The word cloud above was generated by Alta Plana at Wordle.net to show how users responded to the question of companies they know provide text/content analytics functionality. Nearly 50 percent of users are likely to recommend their most important provider.

Read more

NLP Market Set For Growth; HealthCare Among Leading Early Adopter Industries

rsz_language_pixThe natural language processing (NLP) market is moving ahead at a steady clip. According to the recently released report, Natural Language Processing Market – Worldwide Market Forecast and Analysis (2013 – 2018), the sector is estimated to grow from $3,787.3 million in 2013 to $9,858.4 million in 2018. That’s an estimated 21 percent CAGR.

The report considers the market to factor in multiple technologies — recognition technologies such as Interactive Voice Response, Optical Character Recognition, and pattern and image recognition; operational technologies such as auto coding and classification and categorization technologies; and text analytics and speech analytics technologies; as well as machine translation, information extraction and question-answer report generation.

Driving the uptake, the report notes, is the need to enhance customer experiences, especially in an age when the smartphone rules, and Big Data predominates. Big-time industry adopters of the technology, it cites, are healthcare, banking and financial services, and e-commerce, where a big growth in real-time and unstructured customer data and transaction information can be taken in hand by NLP technology to analyze customer needs and then optimize responses to them, taking out some of the human labor costs of doing so.

Read more

Edamam Food Knowledge Site Takes To The iPad, Improves Desktop Experience

Edamam, which has built a food ontology for its food knowledge site (which The Semantic Web Blog initially covered here), is adding an iPad version of its app to its existing iPhone and Android versions. The company also did a full relaunch of its web site to optimize the experience for desktop users, as well, with improved browsing and search.

Originally, the web site app mirrored the mobile versions. But, says co-founder and CEO Victor Penev, “We realized that people wanted to be able to access recipes and search on the desktop, and they should have a holistic experience from anywhere.” While the company had been more focused on the mobile arena, Penev says building traffic for the website is going to be a priority too. Among the capabilities users should see in the near future are functions like one that will let people save recipes on their iPhone or Android mobile devices and then access them on their iPads or desktops, or vice verse.

Read more

Edamam’s Semantic Smarts Help Serve Up Dinner Plans

Edamam wants to be the one place where all the food knowledge of the world is organized. That’s the goal of co-founder and CEO Victor Penev, who launched the site in April, and recently updated the several hundred major recipe sites in its knowledge base to also include some smaller blog sites that add additional variety.

Semantic technology is helping the company reach its goal. “A big problem is that data about food is very messy,” says Penev. “It’s hard to find something, what you find often contradicts other information of what is good for you and what the calories are. So we set out to solve that problem. We played around with different approaches but settled on using semantic technology.”

The confusion arises in part from the fact that recipe sites themselves usually just hire services to calculate nutritional data. But that may lead to mistakes when calculations aren’t undertaken with exactitude — substituting white cream for heavy cream nutritional details changes the whole profile of the recipe, he says.

So, what is that right semantic stuff? One piece of it is that, in conjunction with Ontotext, Edamam built a food ontology. An ontology can be the foundation for a lot of things, such as extracting the knowledge of the chemical composition of a particular recipe and thus inferring its flavor and texture. And Edamam means to grow its own to include various datasets such as chemical data (for flavor and texture), geolocation (for local and seasonal recipes), product data (for e-commerce). and more.

But initially, it’s taken the simple approach, with the core of the ontology focused around classifying ingredients, nutrients and food. “We have started with the simplest ontology and focused on the most common use case — mobile recipe search,” he says.

Read more

Showing at The International American Toy Fair: Tangible, Touchable Semantic Technology

The coolest thing at the 109th International American Toy Fair in New York City this week might have been the Lazer Tag Blaster or the World of Warcraft version of Monopoly. Or, for semantic tech aficionados, it would have been Uma’s semantic Skin multitouch display installation. Even the Power Rangers were getting into it (see photo).

Here is the marriage of semantic technology with interactive signage and multi-touch displays, RFID technology, Intel’s Audience Impression Metrics suite, and social media integration. It is, as Christian Doegl, founder and CEO of uma, an example “where semantics gets tangible.”  And touchable by everyone.

For the Toy Fair, Uma got access to the exhibitor database, itself complete with structured metadata such as company name, location on the floor, and Twitter handle. “From this we can build up a semantic database connecting all different databases to the system,” says Doegl.

Read more

Ten Grand Challenges of IT

Mike Bergman recently shared a list of the top ten challenges facing IT over the last ten years and the amazing strides that have been made in each area. Bergman states that in the last ten years, “a whole slew of Grand Challenges in computing hung out there: tantalizing yet not proven. These areas ranged from information extraction and natural language understanding to speech recognition and automated reasoning. But things have been changing fast, and with a subtle steadiness that has caused it to go largely unremarked. Sure, all of us have been aware of the huge changes on the Web and search engine ubiquity and social networking. But some of the fundamentally hard problems in computing have also gone through some remarkable (but largely unremarked) advances.” Read more

The Semantification of Chemistry


Executive Summary

Chemistry is a central science and the data produced as a consequence is immense. However, much of this data is which makes data integration difficult. In this article, we demonstrate how chemical data can be retrieved from reports, scientific theses and papers or patents and discuss how these sources can be processed using natural language processing techniques and named-entity recognisers to produce chemical data and knowledge expressed in RDF.

Read more

In Search of Tags Lost: Combining Social Bookmarking and SemWeb Technologies

— PHILIP DUDCHUK, SERGE MINOR


Executive Summary

We introduce a web application which integrates the core idea of social bookmarking with semantic components allowing to enhance search and navigation, and to overcome the drawbacks of collaborative tagging.

Read more