Posts Tagged ‘data warehouse’

WEBINAR: Using Semantic Technology to Drive Agile Analytics

DATE: May 14, 2014
TIME: 2 PM Eastern / 11 AM Pacific
PRICE: Free to all attendees

This webinar has passed. A recording of the webinar will be made available On Demand within two US business days.


About the Webinar

How do you accelerate data warehousing to meet the demands of the data-driven economy? Semantic technology provides an agile platform to bring data together, focus on data that matters and ultimately derive a target data model that can be easily extended. This webinar will present a semantically-based data federation case study and highlight the semantic components that facilitate agile data federation in the enterprise.

Read more

Semantic Technology May Help NIH In Its HealthCare Advancement Mission

ashoknareOcto Consulting, a technology solutions and management services company for both the intelligence and healthcare sectors, recently published an infographic exploring the intersection among the Semantic Web, Linked Data and Health IT as it relates to accessing and interacting with data from an array of sources in the healthcare chain. “Our point of view is that in healthcare there are multiple data sources and so much data – especially when it comes to clinical trials, pharmaceuticals research and scientific data,” says CTO Ashok Nare. “It’s very possible that each of those data elements is represented in a different format, so how to take them all and connect them to ask questions you aren’t able to ask otherwise. That’s where semantic technologies are extremely useful.”

One health-care sector project in which Octo is putting semantic technologies to use these days is an effort it has underway with the U.S.’s medical research agency, the National Institutes of  Health, whose mission includes providing grants to the scientific community to engage in research and experiments “to enhance health, lengthen life, and reduce illness and disability,”  as its web site explains. Now, not only does the NIH want to understand what it’s funding and how those grants are progressing, but also  “what opportunities it may be missing out on,” Nare explains.

That means continually assessing not only what’s in its portfolio but also what research gaps there are, which requires conducting analysis on more and more data sources and investigating more queries: That could mean more development and expense, without the help of semantic web technologies.

Read more

Part II: At SemTechBiz, Enterprise IT Can Explore Reasons To Go Semantic

We continue our discussion from yesterday of what enterprise IT will learn to love about semantic technologies at the upcoming SemTechBiz conference (the story began here):

Another Score For Data Agility

Looked at from the data warehouse point of view by Thomas Kelly, Practice Director, Enterprise Information Management, for Cognizant Technology Solutions, semantic technology makes it possible to apply Agile development practices to the data warehouse itself. “You can start modeling, work with data, generate analytics and then start tuning based on what you learn,” says Kelly, who will be discussing semantic technology for the data warehousing practitioner at this session. Several semantic technology-based practices can be applied that support iterative, evolutionary improvements with little or no impact to data loading and analytics functions that were built before the refinements were made, he says.

Read more

Search And Next-Gen Big Data Apps

Search is a fundamental, a system building block, and something that should be a critical part of enterprise architectures. That’s what Grant Ingersoll, co-founder and CTO at search, discovery and analytics vendor LucidWorks – which leverages the Apache Lucene/Solr open source search project – told an audience at last week’s GigaOM Structure Data event.

The company late last year launched LucidWorks Big Data for developing Big Data applications, which builds on top of its heritage developing the LucidWorks Search solution. “It’s a platform for organizations and developers to build out next-generation data applications,” Ingersoll said in a conversation with the Semantic Web Blog. Its focus is on tight integration of key Apache open source projects and layering with a REST API, to provide developers single-source access to the stack’s richness for creating applications that provide comprehensive search, discovery and analysis of an organization’s vast content and user interactions.

LucidWorks Big Data is made up of Apache Hadoop; the Apache Mahout machine-learning library; Hive, a data warehouse system for Hadoop that facilitates data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop-compatible file systems; and Apache OpenNLP, a machine-learning based toolkit for the processing of natural language text that supports common NLP tasks.

Read more

Centralized Data Repositories: An Idea Whose Time Has Come, And Gone

Is it time for the enterprise to lose the idea of the holy grail of a central repository of multi-sourced data?

Experts in the semantic web space think that’s the case. At the recent Semantic Web Technology and Business Conference in New York City, a few weighed in on the issue.

Read more

Pharma Becoming Ever-Bigger Fan of Semantic Technology

Courtesy: Flickr/

At the Semantic Technology and Business Conference in New York City last week, attendees got to hear a lot about how semantic technology is influencing various sectors, such as government (see our stories here and here) and media (see this article and this one). Another prominent one on display: pharmaceuticals.

Pharma, for example, was the driving use case for the update to Callimachus that focuses on helping users deal with data that’s external to the framework for data-driven applications, David Wood, CTO of Callimachus project sponsor 3 Round Stones, told The Semantic Web Blog at the event. (To learn more about the update, see our story here.)

A session on Tuesday last week saw Lee Feigenbaum, vp of marketing at Cambridge Semantics, which makes the Anzo express and Anzo Enterprise solutions,  put forth a case for semantic tech as being key to data integration and interoperability in the sector, as well. “Can semantic web technologies break down enterprise data silos just as they break down document silos on the web?” he said. “The answer to the question is, “Of course.” Compared to the web, the data silo challenges of even the largest pharmaceuticals organization is relatively minor.”

Read more

Down with the Data Warehouse! Long Live the Semantic Data Warehouse!

East wall of Courtyard brick work, construction of the McKim BuildingI had a call with a Fortune 100 IT team that is looking at using semantic technology as an alternative to the Data Warehouse.  This is my favorite kind of conversation, since I firmly believe the traditional data warehouse is dead but just doesn’t know it yet.

This is the situation the IT team explained:

We need to aggregate information and present it to the user, so we build a warehouse.  We spend all this time building and designing the warehouse, and when it’s done they need something else.  Unfortunately, it’s not so easy to modify a warehouse once it’s running, so we build another one.  And then another.  The cycle has been repeating itself for years and is not sustainable.

Philadelphia Spectrum demolition: brick by brick

The alternative to warehousing is Data Virtualization (EII, Data Federation…lots of terms for it)…or, at least that’s what they, and many others, see!  Essentially, they have been burned by years of working with an inflexible technology, so are looking to dump the approach all together.

I get this.  If a Durian is the only fruit you’ve ever smelled, you’d think all fruit were really stinky.

Read more

Time for Semantic ETL?

What’s the link between the trends of more and more objects and even commercial transactions on the web being described in a machine-readable, semantic format and the endless streaming of all that data? Revenue-funded startup First Retail, whose principals Anne Jude Hunt and Simon G. Handley will be speaking at the upcoming Semantic Technology Conference in June, thinks the answer is semantic ETL.

Extract, transform, load (ETL) is a widely known concept in the well-charted terrain of the IT world. That’s about transforming a bunch of heterogeneous data to unify it within a data warehouse and get some use out of it.

Semantic ETL, says Hunt, is brought on by the fact that today people want to deal with the growing loads of streaming data while it’s streaming and that “people want intelligent data, machine-readable tags,[they want] to slice and dice it for BI in lots of different ways, so the  traditional data warehouse and relational database approach is just not working for people.” Cleansed and integrated semantic data loaded into distributed, scalable triple stores can come to the rescue.

Read more

The Federated Enterprise (Using Semantic Technology Standards to Federate Information and to Enable Emergent Analytics)

comparing apples to diets
Photo Credits: Apple – FlickR/muffet ; Food Pyramid – FlickR/teacher_caroline_acsp

[Editor’s Note: This article is intended to complement the post, “I’ve got a Federated Bridge to Sell You (A Defense of the Warehouse)” by Rob Gonzalez (Cambridge Semantics). Here, Michael Lang, CEO & Founder of Revelytix, weighs in on the degree to which federating information using semantic technology could compete with the capabilities delivered by data warehouses.]

The term “federation” is used often in the IT domain, but I have never seen it precisely defined, so here goes: A federation is a collection of entities that act according to a set of policies such that all of the entities interoperate and integrate with each other, and as a collection it is viewed as a single entity. Of course this definition is a description of the United States of America (which is a federation) – the road systems are integrated and interoperable, so are the legal systems of the states; and it could be applied equally well to the IT infrastructure of any large enterprise or the World Wide Web (emphasis on could), if we put the right pieces in place to make an IT infrastructure a federation. Read more

I’ve got a Federated Bridge to Sell You (A Defense of the Warehouse)

[Editor’s Note: At the Semantic Web Summit conference in Boston in November, a discussion arose around Federated Data vs. Data Warehousing.  Rob Gonzalez of Cambridge Semantics raised some very interesting points that I asked him to expand on in the post below.  And whether you agree or disagree, we want to hear from you.

Bridge for sale - 1/2 off!The Semantic Web dream of data federation is awesome.  You type in a query, and magical, intelligent agents scurry all over the datasphere, bringing back information to give you a complete, up-to-date, correct answer to your question.  No need for a messy, time-consuming datamart project!  What’s not to love, right?

Eric asked me to write this piece, and so I find myself in the unenviable position of having to tell you, dear reader and Semantic Web fan, that there is no Santa Claus (of data federation).  I’d like to make a case for the continued need for data consolidation in datamarts—yes, even in the Semantic Web-world—to gain real value from your enterprise data. Read more