Posts Tagged ‘data.gov’

Navigating The World Of Open Data On The Web

At a session discussing open data on the web at the Semantic Technology and Business Conference last week, W3C eGov consultant Phil Archer had this to say: That in his mind and the minds of the semantic web technology business people gathered at the event, “Open data is strongly associated with Linked Data, but the world doesn’t necessarily agree with us.”

What they are thinking about: “JSON and CSVs are the kings,” he said. “If you look at open data portals, CSVs [which get converted to JSON files] outweigh Linked Data by a mile,” he noted. And, he said, religious wars between those who see the world as triples vs. CSVs won’t be good for anyone. “If we keep telling the public sector to aim for 5-star data, vs. CSV 3-star data, we are in danger of the whole open data movement collapsing.”

No one wants that, and to address the big picture of realizing the promise of open data, April saw The Open Data on the Web workshop take place. It was organized by the W3C, the Open Data Institute, founded by Sir Tim Berners-Lee and Professor Nigel Shadbolt, and the Open Knowledge Foundation.

Read more

White House Changing Policies to Open Up More Data…

…and make it machine readable. Jason Miller of Federal News Radio reports, “White House technology leaders are close to issuing a new policy that will change the way agencies release data to the public. Todd Park, the federal chief technology officer, said Friday the new policy is one of several steps to spur the release of more data from agencies. ‘We are going to continue to enlist additional federal agencies in the open data initiatives program as fast track liberators of key existing data sets that could create large scale economics benefit while protecting privacy,’ Park said at the President’s Council of Advisors on Science and Technology meeting in Washington. ‘We also, as per the recently announced Digital Government Strategy just this past summer, with OMB will be releasing policy soon that makes open and computer readable the default status of new data created by the government going forward’.” Read more

Data.gov Moving to Open Source Platform

The team at Nextgov reports, “The team that manages Data.gov is well on its way to making the government data repository open source using a new back-end called the Open Government Platform, officials said during a Web discussion Wednesday. The governments of India and Ghana have already launched beta versions of their data catalogues on the open source platform, said Jeanne Holm who heads the Data.gov team. Government developers from the U.S. and India built the OGPL jointly. They posted it to the code sharing site GitHub where other nations and developers can adopt it as is or amend it to meet their specific needs.” Read more

Good-Bye to 2012: Continuing Our Look Back At The Year In Semantic Tech

Courtesy: Flickr/LadyDragonflyCC <3

Yesterday we began our look back at the year in semantic technology here. Today we continue with more expert commentary on the year in review:

Ivan Herman, W3C Semantic Web Activity Lead:

I would mention two things (among many, of course).

  •  Schema.org had an important effect on semantic technologies. Of course, it is controversial (role of one major vocabulary and its relations to others, the community discussions on the syntax, etc.), but I would rather concentrate on the positive aspects. A few years ago the topic of discussion was whether having ‘structured data’, as it is referred to (I would simply say having RDF in some syntax or other), as part of a Web page makes sense or not. There were fairly passionate discussions about this and many were convinced that doing that would not make any sense, there is no use case for it, authors would not use it and could not deal with it, etc. Well, this discussion is over. Structured data in Web sites is here to stay, it is important, and has become part of the Web landscape. Schema.org’s contribution in this respect is very important; the discussions and disagreements I referred to are minor and transient compared to the success. And 2012 was the year when this issue was finally closed.
  •  On a very different aspect (and motivated by my own personal interest) I see exciting moves in the library and the digital publishing world. Many libraries recognize the power of linked data as adopted by libraries, of the value of standard cataloging techniques well adapted to linked data, of the role of metadata, in the form of linked data, adopted by journals and soon by electronic books… All these will have a profound influence bringing a huge amount of very valuable data onto the Web of Data, linking to sources of accumulated human knowledge. I have witnessed different aspects of this evolution coming to the fore in 2012, and I think this will become very important in the years to come.

Read more

Summary of 11th International Semantic Web Conference

Big Graph Data Panel at ISWC 2012

Big Graph Data Panelists (L to R): Mike Stonebraker, John Giannandrea, Bryan Thompson, Tim Berners- Lee, Frank van Harmelen

Last week, the 11th International Semantic Web Conference (ISWC 2012) took place in Boston. It was an exciting week to learn about the advances of the Semantic Web and current applications.

The first two days, Sunday November 11 and Monday November 12, consisted of 18 workshops and 8 tutorials. The following three days (Tuesday November 13 – Thursday November 15) consisted of keynotes, presentation of academic and in-use papers, the Big Graph Data Panel and industry presentations. It is basically impossible to attend all the interesting presentations. Therefore, I am going to try my best to summarize and offer links to everything that I can.

Read more

Understanding Machine Readability

Jim Hendler and Theresa A. Pardo have created a primer on machine readability for online documents on Data.gov. The pair explain, “Historically, efforts to make government information available to the public have focused on pushing static information about government programs and services to the web. The intended user has been a human who can read, print, and take actions based on reading the material or by engaging in a form-based transaction. In some cases, users were able to query the data or map the results using sophisticated geospatial displays. Access to the data itself, on the other hand, was rarely provided.” Read more

Linked Open Government Data: Dispatch from the Second International Open Government Data Conference

“What we have found with this project is… the capacity to take value out of open data is very limited.”

With the abatement of the media buzz surrounding open data since the first International Open Government Data Conference (IOGDC) was held in November 2011, it would be easy to believe that the task of opening up government data for public consumption is a fait accompli.  Most of the discussion at this year’s IOGDC conference, held July 10-12, centered on the advantages and roadblocks to creating an open data ecosystem within government, and the need to establish the right mix of policies to promote a culture of openness and sharing both within and between government agencies and externally with journalists, civil society, and the public at large.   According to these metrics the open government data movement has much to celebrate:  1,022,787 datasets from 192 catalogs in 24 languages representing 43 countries and international organizations.

The looming questions about the utility of open government data make it clear, however, that the movement is still in its early stages.    Much remains to be done to to provide usable, reliable, machine-readable and valuable government data to the public.

Read more

Datasets Addition Promising Extension For Schema.Org

A call for comments is out for a proposal for a ‘Datasets‘ addition to schema.org, via the W3C’s Web Schemas task force group that is used by the schema.org project to collaborate with the wider community.

The proposal extending schema.org for describing datasets and data catalogs introduces three new types, with associated properties, as follows:

Writing at the Schema.org blog, Dan Brickley calls it a “small but useful vocabulary,” with particular relevance to open government and public sector data.

Read more

EPA to Publish Linked Open Data

Data.gov reports that the EPA is moving ahead with a plan to start publishing linked open data. According to the article written by Michael Pendleton, “Making environmental data available for reuse by others is a core strategy EPA uses to help meet its mission of protecting human health and safeguarding the environment.  EPA has a long tradition of publishing data – notable examples include Toxics Release Inventory (TRI) data, as well as Envirofacts – a data warehouse that provides public access to data extracts from Agency program information systems.  EPA is also helping to support data discovery through the Environmental Data Gateway, a centralized catalog for EPA data sets.” Read more

Closing In On A Million Open Government Data Sets

A million data sets. That’s the number of government data sets out there on the web that we have closed in on.

“The question is, when you have that many, how do you search for them, find them, coordinate activity between governments, bring in NGOs,” says James A. Hendler, Tetherless World Senior Constellation Professor, Department of Computer Science and Cognitive Science Department at Rensselaer Polytechnic Institute, and a principal investigator of its Linking Open Government Data project lives, as well as Internet web expert for data.gov, He also is connected with many other governments’ open data projects. “Semantic web tools organize and link the metadata about these things, making them searchable, explorable and extensible.”

To be more specific, Hendler at SemTech a couple of weeks ago said there are 851,000 open government data sets across 153 catalogues from 30-something countries, with the three biggest representatives, in terms of numbers, at the moment being the U.S., the U.K, and France. Last week, the one million threshold was crossed.

Read more

NEXT PAGE >>