Posts Tagged ‘Giovanni Tummarello’

SindiceTech Relaunch Features SIREn Search System, PivotBrowser Relational Faceted Browser

sindiceLast week news came from SindiceTech about the availability of its SindiceTech Freebase Distribution for the cloud (see our story here). SindiceTech has finalized its separation from the university setting in which it incubated, the former DERI institute, now a part of the Insight Center for Data Analytics, and now is re-launching its activities, with more new solutions and capabilities on the way.

“The first thing was to launch the Knowledge Graph distribution in the cloud,” says CEO Giovanni Tummarello. “The Freebase distribution showcases how it is possible to quickly have a really large Knowledge Graph in one’s own private cloud space.” The distribution comes instrumented with some of the tools SindiceTech has developed to help users both understand and make use of the data, he says, noting that “the idea of the Knowledge Graph is to have a data integration space that makes it very simple to add new information, but all that power is at risk of being lost without the tools to understand what is in the Knowledge Graph.”

Included in the first round of the distribution’s tools for composing queries and understanding the data as a whole are the Data Types Explorer (in both tabular and graph versions), and the Assisted SPARQL Query Editor. The next releases will increase the number of tools and provide updated data. “Among the tools expected is an advanced Knowledge Graph entity search system based on our newly released SIREn search system,” he says.

Read more

Big Data Is Big Focus At SemTechBiz (Part 2)

LOGO: Semantic Technology & Business Conference; June 2-5, 2013, San Francisco, CaliforniaOur discussion of Big Data at SemTechBiz, begun here, continues:

The Enterprise Linked Data Cloud Needs Semantics, And More

Another exploration of Big Data’s intersection with semantic technology will take place at this session, where Dr. Giovanni Tummarello, senior research fellow at DERI and CTO of SindiceTech, will talk about the former becoming an enabler for the latter to be really useful in enterprises. “A lot of people say it’s via Big Data that semantic technologies like RDF will see a coming of age and clear applications in certain industries,” he says. There’s value to adding data first and understanding it later, and to that end, “semantic technologies give you the most agile tool to deal with data you don’t know, where there’s a lot of diversity, and you don’t know what of it particularly will be useful.”

Read more

Dandelion Geo And Linked Data Marketplace Private Beta On The Way

This week Dandelion, which bills itself as the one-stop shop for smart, high-quality Geo and Linked Data from trusted sources, starts its private beta. The service, which promises end users quality, normalized, linked and enriched data for their apps and reports; developers a simple API for any kind of language on any kind of platform; and corporate and government entities a way to publish and profit from their data, comes from SpazioDati.

That company is the creation of four Italian entrepreneurs – CEO Michele Barbera, president Gabriele Antonelli, partnerships director Andrea Di Benedetto, and Luca Pieraccini – who lived first-hand the frustrating experience of trying to find and leverage useful data for the custom web and mobile apps they were developing while running and working in small IT consulting companies. In an attempt to reverse the ratio of finding and cleaning data to actually building apps, says Barbera, the founders began participating in several EU-funded research projects and in the Open Data movement in Europe and Italy, including founding the non-profit Linked Open Data Italy. They also started experimenting with Semantic Web technologies.

“Open Data helps us to find valuable data and to build value-added web and mobile apps,” says Barbera. “So, let’s say that we solved partly the first problem of finding data, but not the second one, normalizing and cleaning data, since it is still very difficult to merge different data sources to put data in context.” Read more

On What Shores Will Semantic Tech Be Better Commercialized?

Courtesy: Flickr/Images of Money

Where is semantic technology better poised to be better commercialized – the U.S. or Europe? With The Semantic Business and Technology conference heading across the ocean to the U.K. next month, it seems a good time to provide some perspective on the question.

At the last SemTech conference in San Francisco, 3RoundStones took first place in the Startup competition with Callimachus Enterprise. During a discussion of some of the product’s winning features, talk turned to some of the differences between how semantic technology has progressed in the States and overseas.

Read more

Unclogging the Data Pipeline

Giovanni Tummarello has written a new article for Sindice discussing how the company “ingested 100M semantic documents in a day.” He writes, “First: build an infrastructure to process millions of documents. Instead of just doing it home-brew… do your big data homework, no shortcuts. Second: unclog some long standing clogged pipe. The [resulting] feeling is that of ‘it all makes sense’ and it happened to us the other day when we started the dataset indexing pipeline with a queue of a dozen large datasets… After doing that, we just sat back and watched the Sindice infrastructure, [which] usually takes 1-2 million documents per day, reason and index 50-100 times as much in the same timeframe, no sweat.” Read more

SindiceTech Helps Enterprises Build Private Linked Data Clouds

Last week The Semantic Web Blog covered the launch of the SindiceTech Assisted SPARQL Editor as an open source project, noting that SparQLed also is part of SindiceTech’s commercial suite for large enterprises building private linked data clouds. This week, we’ll dive a little deeper into SindiceTech and its progress since the founders of the Sindice web of data search engine turned their attention to focusing on the commercial application of its technology as a real-time semantic warehousing infrastructure, which leverages cloud computing for integrating and normalizing the massive amounts of data the enterprise must deal with.

 

As SindiceTech founder and CEO Giovanni Tummarello explains, companies actually approached his team to help them make a reality of their visions to use RDF and SPARQL, as the best knowledge representation and querying technologies available, by providing the missing scalability and stability. Sindice.com was evidence that the technology the team had developed could answer these enterprises’ needs; currently there are about 700 million semantically marked-up web pages indexed in the Sindice.com search engine, with a live updated index of some 80 billion triples daily. Its database is over 5 terabytes.

Read more

SindiceTech Releases SparQLed As Open Source Project To Simplify Writing SPARQL Queries

(Editor’s Note, June 29: The SparQLed project URL now is available here.)

SindiceTech today released SparQLed, the SindiceTech Assisted SPARQL Editor, as an open source project. SindiceTech, a spinoff company from the DERI Institute, commercializes large-scale, Big Data infrastructures for enterprises dealing with semantic data. It has roots in the semantic web index Sindice, which lets users collect, search, and query semantically marked-up web data (see our story here).

SparQLed also is one of the components of the commercial Sindice Suite for helping large enterprises build private linked data clouds. It is designed to give users all the help they need to write SPARQL queries to extract information from interconnected datasets.

“SPARQL is exciting but it’s difficult to develop and work with,” says Giovanni Tummarello, who led the efforts around the Sindice search and analysis engine and is founder and CEO of SindiceTech.

Read more

Bing Brings It On (RDFa, That Is)

The Twittersphere is buzzing about the Semantic Web at last grabbing onto the hearts and minds of the whole web community. It started off with a tweet from Juan Sequeda – a contributor to The Semantic Web Blog and a well-known figure in our area – that reads:

 

 

 

 

A follow-up message explains:

 

 

 

Follow that link and you’ll find yourself at a Bing webmaster help site that indicates Microsoft wants to play nice with whatever markup approach webmasters want to implement – microdata, microformats, or RDFa. The site mark-up overview on the page referenced says that Bing’s “crawlers do not prefer one specification over another. It’s entirely up to you to decide which of the supported specifications best fits your data.

Read more

Sindice Puts The Web of Data At Your Disposal


Sindice
Ltd. launched as a startup company this week, complete with a publicly available beta SPARQL endpoint to its indexed and live-updated dataset of some 12 billion triples. Next week will see Sindice –which began as a joint academic research project among DERI, the Fondazione Bruno Kessler and OpenLink Software to collect, search, query and build applications on top of semantically marked up Web data — deliver formal support for Schema.org.

Sindice, of course, is agnostic when it comes to ingesting semantic markup formats. Supporting new formats is just a matter of syntax adaptation for the service. Whatever format a web site decides to employ — from RDF to RDFa to microformats to microdata — Sindice has coverage of the structured web data and keeps it fresh.

The service opens up vast possibilities for business: As long as a web site structures data in one of these formats, and uses standards like Sitemaps for publishing semantic content, it can become a part of Sindice’s continuously updated repository. And thus it become a datasource for business use, one that also can join with other datasets.

Read more

SemTech 2010 Hosts Web Day 3.0 at the Annual Semantic Technology Conference, Thursday, June 24th – Limited Seating Still Available

David Recordon, Senior Open Programs Manager of Facebook and best-selling Author, David Siegel will keynote the conference.

San Francisco, CA (June 24, 2010) — SemTech 2010 is currently taking place at the San Francisco Hilton in Union Square. Web Day 3.0 will be held Thursday, June 24, and will include a number of informative presentations by industry experts. Attendees can expect to see demos of leading semantic search engines and products, hear experts discuss the technology and its impact in the industry, meet the world’s leading developers of Web 3.0, and understand how Web 3.0 technologies are changing web development.

Presented by co-chairs Tony Shaw and Dave McComb, this year’s program is the most extensive to date. Over 1,200 attendees are expected, including industry experts, researchers, analysts, content providers, developers, and technology enthusiasts.

"Interest in semantic technology is on the rise," Shaw said. "Early numbers indicate a 30% increase in conference attendance. The world wants to understand where the Web is going and semantics play an imperative role in that."

Highlights of Web Day 3.0 will include:

• Keynote by David Siegel, author of Power of Pull, Pull: The Business Shift Behind the Web
• Keynote by David Recordon of Facebook on their new Open Graph protocol
• Real-Time SemanticWeb with Twitter Annotations, Joshua Shinaview of RPI – Tetherless World Constellation and Jans Aasman of Franz
• Building Web 3.0 Web Sites, Mark Windholtz of AgileDNA, John Walker of Semantic Clarity, and Mark Birbeck of Backplane, Ltd.
• Applying Semantic Extensions and New Service to Drupal, Daniel Hladky of Ontos AG
• Comprehensive Validation: Pragmatically Joing the “Web of Data”, the Easy Supported Way, Giovanni Tummarello of Digital Enterprise Research Institute (DERI)
• Drupal and the Semantic Web: from RDF to Whitehouse.gov, Jeff Walpole of Phase2 Technology
• Semantic Web Tools for Video and Social Change, Hannah Eaves of ViewChange.org
• Google Rich Snippets, Kavi Goel and Pravir Gupta of Google
• The Personal Data Locker, David Boardman of Atigeo, David Siegel author of The Power of Pull, Von Wright of AT&T, Drummond Reed of International Card Foundation & Open Identity Exchange, Chris Messina of Google, and Kaliya Hamlin of Identity Commons
• Search Day – featuring a dozen search sessions
• PitchSlam – Startups pitch their ideas to VCs and analysts
• Mergers and Acquisitions panel with VCs, acquirers, and acquired companies
• Exhibits open all day

SemTech represents a full spectrum of business, government, and consumer activity taking place within the field of semantic technology and brings together the most practical and advanced solutions in the industry.

To register for the conference and to see a full schedule, visit http://semtech2010.semanticuniverse.com To register for press credentials, visit http://semtech2010.semanticuniverse.com/press.htm

About SemTech
SemTech is the world's largest conference on semantic technologies. It is the only event focused on the commercialization of semantic technologies, bringing the most recognized names, knowledge, leadership, and innovation in the semantic community together at one location. SemTech is owned and operated by Semantic Universe–the semantic community’s trusted information resource. For more information please visit SemanticUniverse.com, email info@semanticuniverse.com, or call (310) 337-2616.