Common Crawl has announced the winners of their first ever Common Crawl Code Contest. According to the site, “We were thrilled by the response to the contest and the many great entries. Several people let us know that they were not able to complete their project in time to submit to the contest. We’re currently working with them to finish the projects outside of the contest and we’ll be showcasing some of those projects in the near future! All entries creatively showcased the immense potential of the Common Crawl data.” Read more
Posts Tagged ‘developers’
Last week we reported here on the progress that Nova Spivack’s #OccupyTwitter petition was making in terms of attracting signatures, and on the petition’s request that Twitter clarify just what its intentions for the developer community are around its API. Many semantic and sentiment analysis applications, of course, depend heavily on the Twitter API.
Well, the end of last week saw a blog post from Michael Sippey of Twitter that provided some more information on the API issue. He wrote:
Benjamin J. Balter recently discussed the need to publish open government data that is developer-friendly. He writes, “Despite increasing public support (as well as a number of executive mandates) publishing public data in a machine-readable format is not as simple as pressing the ‘publish’ button. Why? Equally important as exposing the information itself is fostering a vibrant developer ecosystem around it. By making the publishing agency, not the public, responsible for making information immediately useful, government can lower the barriers associated with consuming its data and introduce additional citizen services at little to no cost to the agency.”
He continues, “Good, clean data may be surprisingly difficult to come by, especially when working with government systems that have been coupled together over decades. Data standards and conventions change, mechanisms of data collection evolve, and the data itself may be interpreted differently as new policies are introduced. Read more
Andrea Di Maio of Gartner recently articulated concerns about open data processing, particularly the divide between data professionals who have the skills to do so and those who do not. Di Maio writes, “Over the last four years open government and open data have been at the forefront of the debate on how governments can become more transparent, participative and efficient. The theory is well known: rather than (or alongside) providing the government’s interpretation or packaging of public data, this data should be made available in raw, open format for people to build their own views and applications… The downside is a deluge of data. People can easily drown in raw open data that is either too much or simply meaningless unless some processing takes place.” Read more
Semantic tech startup Ontodia took the grand prize at New York City’s BigApps 3.0 contest on Tuesday. As covered in this article, Ontodia’s NYCFacets is a Smart Open Data Exchange for the developer community that catalogs all the NYC-related data sources already present in the New York City Open Data Catalogue.
“Now that we’ve gotten this validation, we’ll charge full-steam ahead with our bigger vision for pragmatic Linked Big Open Data in NYC,” says Ontodia co-founder Joel Natividad.
PaySwarm recently announced a new PaySwarm Alpha for developers (read our two part series about PaySwarm here). The article states, “This is a public sandbox — a developer test ground — that implements some of the newest PaySwarm REST API features.” It continues, “The biggest change was replacing the authentication mechanism that we were using for PaySwarm. We had implemented the system last year using the OAuth protocol only to find out that it made things more complicated than they need to be for the use cases that we were trying to cover. Some developers have asked why OAuth didn’t work for PaySwarm when it works for large sites like Facebook and Twitter.” Read more
Jeni Tennison has added to her discussion of microdata and RDFa with a new post on how the two might live in harmony. Tennison writes, “One of the options that the TAG put forward when it asked the W3C to put together task force on embedded data in HTML was the co-existence of RDFa and microdata. If that’s what we’re headed for, what might make things easier for consumers and publishers who have to live in that world? In a situation where there are two competing standards, I think that developers — both on the publication and consumption sides — are going to want to hedge their bets. They will want to avoid being tied to one syntax in case it turns out that that syntax isn’t supported by the majority of publishers/consumers in the long term and they have to switch.” Read more
In addition to the new Semantic News Community Group, the W3C has launched a full range of community and business groups “so that developers, designers, and anyone passionate about the Web has a place to have discussions and publish documents.” The homepage explains, “A W3C Community Group is an open forum, without fees, where Web developers and other stakeholders develop specifications, hold discussions, develop test suites, and connect with W3C’s international community of Web experts. A W3C Business Group gives innovators that want to have an impact on the development of the Web in the near-term a vendor-neutral forum for collaborating with like-minded stakeholders, including W3C Members and non-Members.” Read more