By Eric Franzon on September 15, 2011 9:00 AM
We sat down with Mike Tung, CEO of Diffbot to learn more about this innovative technology that takes a different approach to deriving meaning from web pages.
SemanticWeb.com: What is Diffbot?
Mike Tung: Diffbot is a technology that allows software applications to interpret web pages the way human beings do–visually. We offer an API to developers that lets them visually extract semantic information from web pages depending on the page type. We’ve observed that the entire web can be classified into roughly 30 structural page types and have trained our visual extraction algorithm on two of those page types so far–frontpage and article pages.
Read more

Semantic Tech & Business Conference is returning to San Francisco in June! Join us from June 3-7 for complete coverage of Big Data, Linked Data, Extreme Information Management, and Semantic Web. From breakthrough approaches to solving business problems to the big data implications of fast–evolving technologies, SemTechBiz provides you with an unparalleled interactive experience and delivers tangible business value.
We're offering a special early rate when you register by February 17.
Sign up now!
announcements,
Community,
Event News,
Events,
How-to,
Industry News,
Learning,
Linked Data,
News,
Open data,
UI
By Eric Franzon on June 15, 2011 6:53 PM
SemanticWeb.com is pleased to announce that we are sponsoring a Web Mining Hack Day, Saturday, June 25, 2011 in Palo Alto, California. More details below after the jump.
Hosted by AOL, and organized by Diffbot and StartX (the Stanford University incubator), the Hack Day promises to be a great opportunity for back-end coders and UI/UX design experts to get together with the goal of building exciting semantic applications. The organizers suggest that participants will be able to:
- Meet and network with other web mining experts, hackers, and students.
- Learn about new semantic technologies and open web APIs.
- See the new the AOL West Coast Headquarters, StartX, Stanford University’s startup accelerator. Have some pizza on us.
- Hack on new ideas and show off your projects.
Read more
By Jennifer Zaino on April 22, 2011 2:20 AM
We reported a couple of months back about how AOL’s purchase of the Huffington Post delivered some more semantic technology to its doorstep. But that’s not the only semantically-related project underway there: Amit Moran, R&D Manager at the AOL Relegence team, is working on a movie sentiment ranking project using Twitter and other social media data. It should see the light of day in the near future on one of its properties.
“We are developing a general infrastructure for sentiment analysis that deals with finding out what people think of entities in a content stream,” Moran told attendees at the recent Sentiment Analysis Symposium in New York City. The use cases for sentiment analytics are wide – what do you think of a restaurant, or a politician, for that matter – so domain specificity becomes important in its actual application. AOL went with movies as a first option – not a surprising choice given its ownership of sites like Moviefone.
Read more
By Angela Guess on February 18, 2011 6:30 PM
In the wake of AOL’s purchase of the Huffington Post, bystanders have reacted with confusion. Stephen Colbert, for example, teased the $315 million purchase by creating The Colbuffington Re-Post and offering his site up for $316 million.
A recent article comments, “In what current chatter has labeled a last and perhaps impotent attempt at salvation for AOL (see their marriage to Time Warner), it appears that this latest venture could yield some unexpected advantages. HuffPo has been attracting record readership.”
Read more
By Jennifer Zaino on February 8, 2011 10:12 AM

What got a bit lost in the news about AOL’s purchase of the Huffington Post was that the deal also gets the Internet content provider some more semantic technology (the company’s past acquisitions include social/semantic matching Q-and-A platform Yedda, for example). In addition to The Huffington Post having been an early adopter of Thomson Reuters Open Calais semantic web service to identify and extract entities, facts and events for localized content initiatives, the site last year acquired Adaptive Semantics and its JuLiA platform for helping publishers discover the leaders in their social graph.
That acquisition now could bear fruit for AOL in influencing community engagement across a whole lot more web sites. AOL properties include community-specific sites such as the Patch network of localized news and events information, the DailyFinance business and investment news site, Autoblog, and many more.
Read more
By Semantic Universe on September 28, 2010 11:35 AM
By David Scott Lewis on January 16, 2008 1:36 PM
Back when I was an industry analyst (VP, E-Business Strategies at the META Group, since acquired by Gartner), I often had to critique emerging markets. Unlike venture capitalists, industry analysts are privy to product roadmaps from publicly-traded companies, including the industry giants (Oracle, SAP, Microsoft, IBM). And unlike i-bankers, they are privy to product roadmaps from start-ups. And as a kicker, some analysts (actually, only those with the largest firms; back then, primarily limited to those analysts with Gartner, Forrester, META and Giga) get a lot of great feedback from CIOs and other end users.
Read more