Big Data For Lean Startups, Or, A Poor Man’s Watson
What do big companies have that most emerging businesses don’t have to help them get value from Big Data? Well, to start with, there’s lots of money and a ton of technology resources.
Never fear. At the upcoming Semantic Tech & Business conference in Berlin, Christopher Testa, CTO of startup WhiteBox Inc., plans to give companies with considerably fewer resources than giants like Google and IBM insight into how to use Big Data as a small, lean startup. His guidance will draw from his own past experiences at Google training AdSense; lessons learned studying the development of IBM’s Watson; and his current efforts to apply Big Data principles to create an expert system for amateur radio operator license exams at his own startup, with limited engineering resources. Most recently Testa was head of engineering at Ad.ly, and that will factor into advice about how to run a data center with free and open source solutions, too.

Semantic Tech & Business Conference returns to San Francisco in June! Join us from June 3-7 for complete coverage of Big Data, Linked Data, Extreme Information Management, and Semantic Web. From breakthrough approaches to solving business problems to the big data implications of fast–evolving technologies, SemTechBiz provides you with an unparalleled interactive experience and delivers tangible business value. We're offering a special early rate when you register by February 17.
Some 1,000 individual organizations compose the Dutch government, each with their own websites. An effort to employ a search engine a few years ago to spider those different and separate web sites to have one single point of access didn’t work as anticipated. The next step to bring some order was to assign all the documents published on those sites a common kernel of metadata fields, which led to building an XML application to enable a structured approach. Linked Data entered the picture about a year and a half ago.
Google this week posted a video about its evolution of search that’s worth the watch. All six minutes of it.
Antidot
About this time last year The Semantic Web Blog
Let’s start with Google, which in the spring closed on its acquisition of ITA Software, a move that was expected to help it get semantics into travel booking thanks to ITA’s
Regator actually has been around as a curated blog directory and search engine for a couple of years, and in and of itself is a perfectly good and pretty fast source for the word on the digital street, along with the other usual suspects (Twitter, Facebook, Google blogs, CNN, etc.) – at least in so far as the big stories go. “But unless it’s a really big story and Twitter explodes eventually, you won’t find those second-tier news stories so easily,” says Scott Lockhart, Regator cofounder and CEO.

Eric Franzon
VP Community
Jennifer Zaino
Contributor
Angela Guess Contributor
semanticweb.com Twitter feed loading...