The Semantic Standards Gap
Jennifer Zaino
SemanticWeb.com Contributor
A number of efforts underway at the Object Management Group, a non-profit standards-setting industry consortium, have important implications for the development of semantic technologies.
The OMG looks at the semantic technology space from a standards perspective, with the idea in mind that, to make this all work, organizations will have to be able to take existing artifacts — whether they are in databases, in text documents, in software, and so on — and at least partway automate the process of using them for developing semantic ontologies.
“If you can’t automate it, you can’t get the traction needed to be useful,” says Elisa F. Kendall, co-chair of the OMG’s ontology group and CEO of Sandpiper Software, Inc. Sandpiper provides context modeling and transformation products that work with enterprise application and data integration tools and services to facilitate collaboration and information sharing among multiple databases, applications, and users in multi-vendor environments.
“The OMG is all about interoperability, and secondly about automation through modeling, and how you can leverage existing artifacts and re-factor them and model them better to get new capabilities out of them, or even to develop new stuff that is structured well and can be reusable because you modeled it that way.”
Kendall, who is also an active member of two working groups at the W3C, Semantic Web Deployment and the new OWL working group, became involved in the OMG ontology group because of Sandpiper’s work in developing a graphical way for people to design ontologies. UML, one of the primary standards from the OMG, is a graphical model notation that has a huge following among software developers, making it a good candidate to leverage for the development of semantic technologies, she says. Thus, UML became a lynchpin of ODM, the OMG’s Ontology Definition Metamodel, its standard for model-driven ontology development. UML is actually a family of metamodels that have the ability to use UML in conjunction with RDF and OWL, as well as topic maps and Common Logic, a first-order logic language with expressive capabilities that OWL by itself lacks.
“Ultimately, all software and service-development processes will have to take a model-driven approach for scalability purposes,” Kendall says, as technologies such as RFID build up the data in corporate vaults. “There is no way to hard code all that. You have to be able to generate software, and common semantics should drive it.”
As Kendall explains it, the Ontology Definition Metamodel helps create a bridge over a gap that the W3C standards don’t address. “They don’t address traditional software engineering. They only address the stuff that is relevant for the web. That’s a big gap,” she says. ODM “lets you use the semantics with UML so that you can integrate them into your general software modeling and software engineering framework.”
Another gap is around ER diagramming, for building databases, which is where OMG’s new IMM (Information Modeling and Management) standard comes in. Using technologies that support this standard, organizations may find a way out of their database nightmares. So many large companies are running thousands of databases that they can’t turn off for operational reasons, but neither do they really understand what’s in them or how they work, and the people who built them are long gone.
“To understand what you have you must extract the semantics,” says Kendall. “Then you have to manipulate those semantics using semantic technologies, to clean them up, make them consistent, understand them, and then use them as a basis for going back to the database world and mapping across your databases,” she says. As a result of cracking this nut, companies potentially could turn off many of these mystery databases and save the millions of dollars they spend annually to maintain them. “People outside of the research community didn’t get that bridge between the database development world and the semantic world until relatively recently,” Kendall says.
Another gap to bridge: the business rules community. Business rules engines, Kendall notes, traditionally have been designed by business analysts trying to automate processes or individuals working on specific applications, like fraud detection in banking. Rarely do they use the same terminology or vocabulary, and often neither database and or IT personnel have a hand in their development. “The bridge between business rules development and other places in the business should be based on the same vocabulary, but because they are silo’d and maintained by different organizations, it’s not,” she says. “Through ODM we can make connections to to emerging business rules standards coming out of the W3C and OMG.” These include PRR (Production Rule Representation) and SBVR (Semantics for Business Vocabulary and Rules).
Buttoning down standards and specifications is just the half of it, though.
“The really important work is still to come,” says Kendall. “It’s the products that come out of these mappings that let you take advantage of semantics in other kinds of software engineering activities, in database alignment, in eliminating redundancy, or other kinds of things like that that will make this real to more people.”
Sandpiper, for instance, currently has a tool for building ontologies in UML, but it’s also working now on a second-generation framework that will incorporate more automation, such as automated integration with reasoning engines for validity and consistency checking, and also integration with back end knowledge bases.

Semantic Tech & Business Conference is returning to San Francisco in June! Join us from June 3-7 for complete coverage of Big Data, Linked Data, Extreme Information Management, and Semantic Web. From breakthrough approaches to solving business problems to the big data implications of fast–evolving technologies, SemTechBiz provides you with an unparalleled interactive experience and delivers tangible business value. We're offering a special early rate when you register by February 17.

Eric Franzon
VP Community
Jennifer Zaino
Contributor
Angela Guess Contributor
semanticweb.com Twitter feed loading...