Archives: September 2007

Read/Write/Request Web, What Is It?

The name Read/Write/Request Web is an extension of Read/Write Web, which often refers to Web 2.0. With Web 2.0, both web readers and web writers can simultaneously read and write to a same web space. Using this type of connection, the Web becomes not only a network of human publications but also a network of humans themselves. In comparison, these connections between web readers and writers were generally disconnected during the pre-Web 2.0 age. Web 1.0 writers basically had no way to know who read their pages, and readers also could hardly share comments with writers directly. This is why Web 2.0 has the nickname Read/Write Web.

Read more

Announcing Semantic Tech & Business Conference - San Francisco 2012

Semantic Tech & Business Conference is returning to San Francisco in June! Join us from June 3-7 for complete coverage of Big Data, Linked Data, Extreme Information Management, and Semantic Web. From breakthrough approaches to solving business problems to the big data implications of fast–evolving technologies, SemTechBiz provides you with an unparalleled interactive experience and delivers tangible business value. We're offering a special early rate when you register by February 17. Sign up now!

Semantic Web Goes to Court

Jennifer Zaino
SemanticWeb.com Contributor

The government has been one of the early sectors to adopt semantic web technologies. As you may know, the federal government had a good deal to do with the semantic web’s development in the first place, with the DARPA Agent Markup Language (DAML), designed by the Defense Research Projects Agency as an extension to XML and RDF.

In addition to the feds, state and local governments aren’t passing the semantic web by, either. Colin Britton, co-founder and CTO of Metatomix, says the company was founded on the idea of having a set of capabilities around a metadata-driven approach to managing information that would have value for dealing with problems related to business integration. Along the way, Metatomix discovered that state and local governments were excellent prospects, because unlike commercial enterprises they can’t always avail themselves of technologies such as data warehouses to solve some data integration problems. Political, privacy, and legal issues are an impediment.

“I would feel uncomfortable if the state of Massachusetts brought together everything they could find about me and put it in one database,” says Britton.

But as soon as someone is accused of doing something wrong, the game changes. This propels the need for what are termed “integrated justice” systems, where data about an individual existing within multiple sources of information can be quickly pulled together for a specific judicial purpose, and an audit trail can be maintained of which sources data was drawn from and in what context.






More From Jupitermedia


The Semantic Web and Your Intranet

The Business Case for Semantic Web

The Semantic Web: Are You Scared Yet?

BSM
and the Semantic Web: Parallel Courses

If you want to comment on
these or any other articles you see on Intranet Journal, we’d like to hear
from you in our IT Management
Forum
. Thanks for reading.

- Tom Dunlap, Managing
Editor
.

“If I did something wrong the judge should know all about me to make an appropriate decision,” Britton says.

Traditionally, the judge relied on that information being gathered on a per case basis by humans, swivel-chairing between multiple systems to pull a report together on an individual. That system has its flaws: Depending on the researcher’s case load, a judge might not get as much granular information on a particular individual, for example. Plus, the job itself is time-consuming, and takes staff resources away from other work they might be doing.

What was wanted — and what semantic technologies can provide — is a way to have a fuller view of an individual by looking at more data sources, in near real time. The state of Florida became the first in the nation to deploy the Judicial Inquiry System based on Metatomix’ semantic technology for integration. The state says this streamlines information from a variety of Florida agencies into a single, central dashboard accessible by judges and other related personnel, while also allowing those agencies to retain control over their individual database content.

Currently, more than 4,000 individuals at agencies in Florida use the technology, Metatomix says, and data also can be pulled into the system from other states, county, and federal agencies, because its solution supports standards for data exchange in the judicial world, such as NIEM (National Information Exchange Model). Metatomix cites success metrics such as reducing the processing time for building a first appearance docket from 45 minutes down to 3.

“But the anecdotal facts — less-hard ROI — are more compelling,” says Britton. Those are the ones in which judges have been able to have a bevy of information at their fingertips to determine the veracity of an individual. “They can go and check vs. taking something on face value when making assessments about bail or child custody or all sorts of things judges must make decisions on,” he says. “More accurate information equals more informed decisions, which reduces the risk of people being put in harm’s way.”

This month, the company released Version 5.0 of its semantic middleware platform. The company says the new version features service oriented architecture (SOA)-enabling technology, new semantic services functionality (including full support of the W3C standard for SPARQL Protocol and RDF Query Language), administrative and performance enhancements, and standards support, to help customers leverage semantic technology to integrate data, uncover and define information relationships, and provide business applications with meaning and actionable insight. The company also announced that it will enable its semantic middleware platform on the recently announced Oracle Database 11g.

Why You Need to Share Metadata

Uche Ogbuji
SemanticWeb.com Contributor

My background as a consultant is in the intersection of enterprise management of structured data, unstructured data, applications, and services. These have all been separate concerns for too long, claimed respectively by enterprise DBAs, content managers, software architects, and business process managers.

More and more, people are understanding how these disciplines need to come together, as evidenced by the emergence of service-oriented architecture — which aligns applications more closely with services — and the effort of enterprise database systems to accommodate unstructured data. I’ve watched this slow convergence, but since it comes too slowly to meet many practical needs in business, I’ve often had to find my own strategies for connecting these worlds. In my experience the sweet spot for such work depends on the nature of the organization, but most often lies in content management (CM).

More From Jupitermedia

The Semantic Web and Your Intranet

The Business Case for Semantic Web

The Semantic Web: Are You Scared Yet?

BSM
and the Semantic Web: Parallel Courses

If you want to comment on
these or any other articles you see on Intranet Journal, we’d like to hear
from you in our IT Management
Forum
. Thanks for reading.

- Tom Dunlap, Managing
Editor
.

The goal of CM is to establish a writing shop, publishing house, and library for an organization. The material produced might be for communication to the outside, Web publishing for example, or it might be for internal knowledge, as in the case of enterprise CM.

The hardest problem in managing such a combination of concerns is maintaining agreements and policies, which is why most CM processes and technology focus on workflow. CM workflow is now a fairly mature science, and it fits comfortably into related business processes. From an information management point of view, however, CM tools are just beginning to seek differentiation by their ready integration into structured databases, and general applications and services.

The key to such integration is in sharing content metadata with other systems. The richer the metadata shared, the more value created in the integrated result. Richness is a matter of how well the metadata is placed in context. Content tagged with a simple string “release” might indicate that it’s written as a press release, or perhaps that it relates to software releases. Rich metadata minimizes ambiguity of such details, and clarifies the relationship between content and each property.

Read more

Microformats: Toward a Semantic Web

Sean Michael Kerner
SemanticWeb.com Contributor


In the beginning of the Internet revolution, being connected was all about
the simple hyperlink. Like magic with one click a Web browser could be
transported anywhere on the Internet that content lives.


But what about context? What about connecting related information and
content in a semantic context? The hyperlink of the early era isn’t
enough.


Enter , which offer the promise of helping Web
content owners enable users to connect the disparate dots that connect
content in a semantic way. To be more precise and borrowing from the
official Microformats.org definition, “Microformats are small bits of HTML
that represent things like people, events, tags, etc. in Web pages.”


Though the term “microformats” may not yet be mainstream, mainstream vendors
have taken notice. Big names like Technorati, Mozilla, IBM, Microsoft,
Google, Digg, and Yahoo among countless others are all at work trying to
make microformats work. By some estimates there are already hundreds of
millions of microformatted pieces of information online.

“At this point, nearly every Web designer that learns about microformats
starts using them,” Tantek Çelik,
chief technologist at Technorati and co-founder of Microformats.org, told
InternetNews.com. “Because microformats require only some HTML
authoring ability, millions of Web authors and designers are able to use
them immediately. This is a much lower barrier to entry than many previous
Web technologies, such as XML and RSS, which require the skills of a
programmer.”

Where are microformats used today?


Technorati uses microformats throughout its products, including its main blog
search portal. Technorati tags are built from the rel-tag
microformat, which enables bloggers to “tag” their individual blog posts with
categories/keywords relevant to the posts in a visible manner. Technorati
also publishes microformats, like hCard on its contact page and on users’
profile pages in support of social-network portability.


The new Digg user profiles support the hCard microformat, as do the new
Google Sharing user profiles. Yahoo’s Flickr photo-sharing site also
extensively makes use of microformats, including hCard, as well as adr and
geo specifications for locations.


Microformats.org currently recognizes nine specifications for microformats:
hCalendar, hCard, rel-license, rel-nofollow, rel-tag, VoteLinks, XFN,
XMDP and XOXO. There are drafts for 11 additional specifications, some of
which are already in wide use: adr, geo hAtom, hResume, hReview,
rel-directory, rel-enclosure, rel-home, rel-payment, robots exclusion
and xFolk.


Though microformats enable semantic Web connections, Mozilla’s User
Experience Designer Alex Faaborg explained that microformats are sometimes
referred to as the lower-case semantic Web, since they are not as complex or as
expressive as RDF (define) and OWL (define).


“While microformats are less formal, they are also easier to author, and the
semantic information is human readable, in addition to being machine
readable,” Faaborg said. “But it isn’t about one approach being better than
the other, as much as each approach being useful in different situations.”

Technically speaking, though microformats and the Semantic Web are now
actually interoperable as the W3C has announced that (Gleaning Resource Descriptions from Dialects of Languages) now extracts data from microformats and make it part of the Semantic Web.


This article was first published on InternetNews.com. To read the full article, click here.

Microformats Hop on Semantic Web ‘Griddle’

Sean Michael Kerner
SemanticWeb.com Contributor

There are a lot of different ways to connect the dots of content and data that comprise the Web. Two of them are microformats (define) and the GRDDL (Gleaning Resource Descriptions from Dialects of Languages) Semantic Web construct.

Both approaches have attempted to make sense of content by making use of metadata descriptors, but they differ somewhat. The World Wide Web Consortium (W3C) is trying to change that.

“Microformats and the Semantic Web always had a lot in common at a high level, but GRDDL fills in a few technical details so that they interoperate at a practical, running-code level,” Dan Connolly, the W3C GRDDL Working Group staff contact, told InternetNews.com.

While microformats use XHTML and HTML, GRDDL (pronounced “griddle”) extracts RDF (Resource Description Framework) data from XML so that it can be transformed or understood by other applications in a mashup or other application settings.

The way that GRDDL will work with microformats will require some effort from microformat authors.

“Microformat authors that want their data to integrate seamlessly with other semantic Web data should use well-formed XHTML and profile URIs,” Connolly explained. “For example, the dbpedia project takes millions of facts from Wikipedia and exports them using URIs and RDF and SPARQL.”

Connolly explained that the HTML specification lets authors use any name they like for conventions for class, as well as rel attribute values for styling and other processing. The conventional microformat names are hcard, hcalendar, summary and description, among others.

HTML has a special “profile” hook to use a URI to take those document-local conventions and share them in the Web. Some microformat documents use the profile hook, but not all do.

GRDDL links the profile documents to XSLT transformations that allow machines to exploit microformats and other syntax conventions.

The GRDDL specification has been under review for nearly a year. Connolly noted that early adopters have been participating in working out the kinks and finishing the test suite.

“At this point, the design has been through the entire W3C process, and it is stable and ready for widespread deployment,” Connolly said.

So what’s next for GRDDL?

“What’s next is to tweak flight itineraries and soccer schedules so that I can stop copying them by hand, one field a time, and start letting the computer manage the data for me,” Connolly said. “That’s what GRDDL and microformats and RDF and URIs are all about.”


This article first appeared on internetnews.com.

Oracle Sees Semantic Tech Solving Business Problems

Jennifer Zaino
SemanticWeb.com Contributor

Oracle is pushing hard on the application of semantic technologies to enterprise business problems. bITa Planet recently caught up with Robert Shimp, vice president of Oracle’s global technology business unit, to discuss what problems Oracle believes semantic technologies can solve for customers — and how.


bITa Planet: How does Oracle support semantic technologies?

Shimp:

Today we not only support the core data standards for the semantic web inside our database, but our application integration architecture, or AIA, offering uses semantic technology. We are already shipping that capability today as an integration capability for our applications. You’ll see more and more semantic technologies weaving its way into products as we go along.

We have a fundamental advantage in that with enterprise applications, you know an awful lot about the business process going on, the people that are using it, and so on, and that means the application can have a lot of metadata that it knows about what’s going on. The fact that we can store that metadata inside the database means we can provide far richer transactions and business processes for customers. It’s that ability to provide both the applications and the database that gives us that advantage.


How did Oracle get out in front on the semantic technology?

Oracle is the leading supplier of database technology for geographic information systems. We’ve been doing that for at least a decade or more. It turns out that maps or geographic information systems use a mathematical model called a network model. In other words, basically a map is all about different points and locations and the distance between points. It turns out that semantic technology uses exactly the same mathematical model to represent the links between concepts and ideas. So as we developed GIS technology over the years, it was relatively easy for the same development team to supply this same technology for semantic technology purposes.


Among the first companies Oracle worked with that were interested in applying semantic technologies to their business problems were those in the pharmaceuticals industry and government. How is the make-up of companies interested in this changing?

We’re now seeing lots of customers in different areas – financial services, pharmaceuticals companies, large manufacturers in the airplane or automotive industries – using semantic technologies for many different enterprise purposes. It boils down to having lots of data sources to reconcile information, like parts numbers to understand real time data flows, or obscure proteins in the drug discovery process. Semantic technology is very powerful for creating inferences about data and organizing the information in new and interesting ways.

The two main business problems that mainstream types of companies deal with are having lots of sources of customer names and addresses and trying to reconcile them out of their CRM and support systems and order processing systems — who is the customer and ensuring I have one customer record. That’s a very typical example. There are multiple databases of sources that were never designed to work together but you must be able to pull data out of them and rationalize the information. Traditionally that’s been done using ETL technologies and data hubs and data warehousing technologies.

The other common business problem is parts – lots of parts and suppliers, with different numbers even if they’re very similar. Trying to keep track of what’s what and what you can order from someone is very complicated. Semantic technology lets you say this part from supplier one is equivalent to this part from the other supplier.

There are even bigger possibilities for semantic technology. Think about business intelligence … What if BI tools could tap into existing data and provide you intelligence in real time. There are huge opportunities there to increase the value of your IT.

Read more

A $3.5 Billion Semantic Distinction

Steven Pinker is well on his way to a best selling book on, of all things: semantics. I haven’t finished it yet, but am finding it fascinating. What is surprising me is how popular it is. Must be a lot more armchair ontologists out there than I thought.
One of the things that sent me to the blog, though, was his quantification of a semantic distinction. It turns out that the insurance policy on the World Trade Center has a maximum payout of $3.5 billion per event. The $3.5 billion dollar semantic question is: did the two plane crashes constitute one event or two?

Stuff_of_thought Slightly longer review

Read more

A $3.5 Billion Semantic Distinction

Steven Pinker is well on his way to a best selling book on, of all things: semantics. I haven’t finished it yet, but am finding it fascinating. What is surprising me, is how popular it is. Must be a lot more armchair ontologists out there than I thought.

Read more

What Is the Resource Description Framework?

By Michael C. Daconta, Leo J. Obrst, and Kevin T. Smith

At the simplest level, the Resource Description Framework (RDF) is an XML-based language to describe resources. While the definition of “resource” can be quite broad, let’s begin with the common understanding of a resource as an electronic file available via the Web. Such a resource is accessed through a Uniform Resource Locator (URL). While XML documents attach metadata to parts of a document, one use of RDF is to create metadata about the document as a stand-alone entity. In other words, instead of marking up the internals of a document, RDF captures metadata about the “externals” of a document, like the author, the creation date, and type. A particularly good use of RDF is to describe resources, which are “opaque” like images or audio files. Figure 1 displays an application, which uses RDF to describe an image resource.

The RDFPic application is a demonstration application developed by the W3C to embed RDF metadata inside JPEG images. The application can work in conjunction with the W3C’s Jigsaw web server to automatically extract the RDF metadata from images stored on the server. As you see in Figure 1, the application loads the image on the right side and allows data entry in a form on the left side. The tabbed panels on the left side allow you to load custom RDF schemas to describe the image. The two built-in schemas available for describing an image are the Dublin Core (www.dublincore.org) elements and a technical schema with metadata properties on the camera used. Besides embedding the metadata in the photo, you can export the RDF annotations to an external file, as shown in Listing 1.

image title

Figure 1. An RDFPic application describing an image. RDFPic is copyrighted by the World Wide Web Consortium. All Rights Reserved. www.w3.org/Consortium/Legal/

image title

Figure 2. The RDF triple.

The first thing you should notice about Listing 1 is the consistent use of namespaces on all elements in the listing. In the root element , four namespaces are declared. The root element specifies this document is an RDF document. An RDF document contains one or more “descriptions” of resources. A description is a set of statements about a resource. The element contains an rdf:about attribute that refers to the resource being described. In Listing 1, the rdf:about attribute points to the URL of a JPEG image called shop1.jpg. The rdf:about attribute is critical to understanding RDF because all resources described in RDF must be denoted via a URI. The child elements of the Description element are all properties of the resource being described. Two properties [to note are] one in the Dublin Core namespaces and one in the technical namespace. The values of those properties are stored as the element content. In summary, this code demonstrates a syntax where we describe a resource, a resource’s properties, and the property values. This three-part model is separate from the RDF syntax. The RDF syntax in Listing 1 is considered to be one (of many) serializations of the RDF model. Now let’s examine the RDF model. The RDF model is often called a “triple” because it has three parts, as described previously. Though described in terms of resource properties in the preceding text, in the knowledge representation community, those three parts are described in terms of the grammatical parts of a sentence: subject, predicate, and object. Figure 2 displays the elements of the tri-part model and the symbology associated with the elements when graphing them.

Editor’s Note: This article is an excerpt of Chapter 5, “Understanding the Resource Description Framework,” in The Semantic Web: A Guide to the Future of XML, Web Services, and Knowledge Management (Wiley Publishing 2003).

This excerpt first appeared on Devx’s Semantic Web Zone. Go there for the full excerpt.

Read more

The Semantic Web and Your Intranet

Paula Gregorowicz
SemanticWeb.com Contributor

Much like in the music industry where there are so-called “overnight sensations” that toiled for years to create their success, web technology seems to sprout the “next big thing” out of nowhere. At least that is how it feels to me with the semantic web.

What is the semantic web, you say? According to Wikipedia, the semantic web is an evolving extension of the World Wide Web in which web content can be expressed not only in natural language, but also in a format that can be read and used by software agents, thus permitting them to find, share, and integrate information more easily. It is a vision of information that is understandable by computers, so that they can perform more of the tedious work involved in finding, sharing, and combining information on the web. The semantic web derives from Tim Berners-Lee’s vision of the Web as a universal medium for data, information, and knowledge exchange.

What does that really mean?

There are basically two types of information — data stored in some sort of database and information stored in unstructured ways (think — desktop applications, audio, video, etc.). Currently web applications can query databases effectively to bring back information stored in databases. If the query matches the data, viola, you have the right results. Unstructured data is more difficult for machines. Sure we have a variety of search technologies that help to manage that informational space, but anyone who follows the developments in the search game knows that there remain many challenges to a strictly machine-based approach. (After all, if it were simple and accurate for machines, why would a company like Mahalo www.mahalo.com even get started?)

With the semantic web, in theory, technology will be able to bridge the gap between all forms of data. The secret is in the metadata and the namespaces that tell the machines what the markup in another document really means.

More From Jupitermedia

The Semantic Web and Your Intranet

The Business Case for
Semantic Web

The Semantic Web: Are You Scared Yet?

BSM
and the Semantic Web: Parallel Courses

If you want to comment on
these or any other articles you see on Intranet Journal, we’d like to hear
from you in our IT Management
Forum
. Thanks for reading.

- Tom Dunlap, Managing
Editor
.


What technology is involved?

The foundational pieces for the semantic web include URI’s (which identify resources) along with XML and namespaces. These drive the engine behind locating, marking up, and interpreting information so that machines can read and understand the data.

The OWL Web Ontology Language provides the construct for defining and instantiating web data models that need to be read and presented to machines rather than humans. What that means is that it provides the vocabulary for describing objects and how they relate.

Resource Description Framework (RDF) provides the metadata model which is a major component of the semantic web. Think of RDF as an XML-based language to describe resources. Using a subject — predicate — object format (called triples) it is used to express relationships between objects. Since the semantic web is all about defining relationships between disparate objects that can be read and interpreted by machines and presented to humans in a meaningful way you can see how RDF will drive the success (or lack thereof) of this new web.

What is the business case for semantic web?

In the article “The Business Case for Semantic Web,” the author talks about how organizations have tons of information but often don’t know how to find it, interpret it, or make meaningful connections with other, related information. This is not news as anyone involved with the web and with complex corporate intranets knows how challenging it can be to find something you need.

Read more

NEXT PAGE >>