By Dan McCreary on June 14, 2011 11:00 AM
There are three trends that I observed at SemTech 2011 in San Francisco last week. First was the increased role of native XML databases used in combination with RDF data stores. Second was the many natural-language processing tools and vendors at the conference. And third was the role of semantic annotations and standards directly in web content. I think these trends are related.
One of the keynote presentations at the SemTech 2011 conference was done by the BBC. They presented their core architecture for managing web content as having two main components: a native XML database(MarkLogic) for content and a RDF triple store for “metadata.” These tools were at the core of their architecture for their web sites.
Another presentation was done by the Mayo Clinic. They also are using MarkLogic for web content and are also using semantic web technologies. Their diagrams show that there are many ways for these systems to interact.
Read more

The
Semantic Tech & Business Conference (SemTechBiz) is coming to San Francisco on June 3-7! Join us for case studies, innovative panels, tutorials, and keynotes that will provide you with practical advice, hands-on guidance, and breakthrough approaches to solving business problems with semantic technology.
Passes go up $200 at the door.
Sign up now and save !
By Dean Allemang on December 1, 2009 8:24 PM
In part I of this two-part series, Dean Allemang & Scott Henninger draw on years of teaching TopQuadrant’s introduction course on the Semantic Web to make some observations on teaching Semantic Web concepts to a wide variety of students.
Read more
By Semantic Universe on June 23, 2009 12:38 PM
The “XSPARQL†specification has been published as a W3C member submission, co-authored by experts of Asemantics S.R.L., DERI Galway, Fundación CTIC, INRIA, Ontotext, OpenLink Software Inc., Profium, Talis Information Ltd., and the University of Innsbruck. This specification defines a merge of SPARQL and XQuery, and has the potential to bring XML and RDF closer together. XSPARQL provides concise and intuitive solutions for mapping between XML and RDF in either direction, addressing both the use cases of GRDDL and SAWSDL.
By Michael M David on April 14, 2009 1:03 AM
Executive Summary
XML Keyword Search is still a popular academic subject. It has not reached or been recognized by XML and Internet commercial products yet. The concepts involved are also very important to the semantic web. The semantics industry today with its work on higher level semantics like ontologies and taxonomies has overlooked the importance of utilizing the semantics of hierarchical structured data like XML. When working with hierarchically structured data, the first level of handling semantic understanding must be recognizing the hierarchical structure and its (lower level) hierarchical semantics. This is then used to eliminate false keyword search results that can show up as matches in hierarchical structures; otherwise they will go undetected to the higher level semantic processing which will also not detect them since they are not concerned with the structure of the data. This will cause unmeaningful results to be returned.
Read more
By Dan McCreary on January 12, 2009 8:08 PM
Entity Extraction is the process of automatically extracting document metadata from unstructured text documents. Extracting key entities such as person names, locations, dates, specialized terms and product terminology from free-form text can empower organizations to not only improve keyword search but also open the door to semantic search, faceted search and document repurposing. This article defines the field of entity extraction, shows some of the technical challenges involved, and shows how RDF can be used to store document annotations. It then shows how new tools such as Apache UIMA are poised to make entity extraction much more cost effective to an organization.
Read more