SemTechBiz SF SemTechBiz UK SemTechBiz NYC more TVNewser TVSpy GalleyCat AppNewser UnBeige AgencySpy PRNewser 10,000 Words FishbowlNY FishbowlLA FishbowlDC MediaJobsDaily SocialTimes AllFacebook AllTwitter

Posts Tagged ‘microdata’

Semantic Commerce: Structuring Your Retail Website for the Next Generation Web

Are you wondering why your product pages don’t stand out in search results like those from Amazon (shown below) or other competing e-commerce websites? These expanded results are commonly known as Rich Snippets (as named by Google) and are the result of having your HTML structured correctly with semantic markup. Whether you’re savvy to HTML5 and the latest design trends, or you haven’t updated your website code in years, this is article will explain why it’s important you structure your data properly utilizing semantic standards.

Sample of Rich Snippet result

There are a number of ways to structure your data to make it more relevant to search engines, as well as social media sites. As an e-commerce retailer it is important to understand which of these standards you should consider including in your website. You should take some time to ensure you are implementing semantic markup, and doing it correctly. It has the power to better inform potential customers with upfront knowledge prior to landing on your site. Customers can see product reviews, pricing and stock information, and even images before clicking through to your website. This can lead to increased click-through rates, improve conversions, and generally enhance your SEO objectives.

Read more

SemTechBiz is Less Than 3 Weeks Away

The Semantic Tech & Business Conference (SemTechBiz) is coming to San Francisco on June 3-7! Join us for case studies, innovative panels, tutorials, and keynotes that will provide you with practical advice, hands-on guidance, and breakthrough approaches to solving business problems with semantic technology. Passes go up $200 at the door. Sign up now and save !

Catching Up With Yandex: What Russia’s Leading Search Engine Has To Say About Schema.org

Update: Yandex today (April 26th) reported that net income in the first three months of 2012 rose 53 percent from the same period last year to 1.26 billion rubles ($43 million) as text-based advertising revenue rose, according to Bloomberg. Sales gained 51 percent to 5.9 billion rubles.

In November Russian search engine Yandex joined Google, Microsoft Bing, and Yahoo! to collaborate on schema.org. The Semantic Web Blog recently caught up by email with Alexander Shubin, Yandex product manager and head of strategic direction, to discuss this and other developments.

The Semantic Web Blog: Can you update us about how Yandex is doing? We know it’s still leading search traffic in Russia, but do you see more competition there, and how have international expansion plans been proceeding?

Shubin: Yandex is the leader in Russia with 59 to 60 percent market share. Russia is one of the few countries where a local search engine keeps a leading position, in spite of international players’ expansion.

Last year Yandex was launched in Turkey, where we suggest 12 services (including web search) so far. According to our statistics, yandex.com.tr processes more than 1 million queries daily. Turkey is the first non-Russian speaking market for us and we have done a lot of work to deliver services that would be interesting for the local community.  The main target for Yandex in Turkey, where one search engine still keeps 90 percent of search market, is to become the Number 2 player and to deliver more local search results and services than our competitor does.

Turkey is more or less an experiment for us: If we meet our target there, we can potentially do the same on any other non-Russian speaking market. But it is too early to make any conclusions or announcements so far as we have worked in Turkey only half of year. Stay tuned!

Read more

Linked Data on the Web Workshop at WWW 2012

Juan Sequeda photoThis year was the 5th version of the Linked Data on the Web Workshop co-located at the World Wide Web Conference going on in Lyon, France.

At this workshop, seven issues caught my attention:

1) Media: Yunja Li presented on Synote: Weaving Media Fragments and Linked Data. This is interesting for those who not only want to link to an entire video, but want to link to a part of a video at a specific interval of time, and also add metadata information about that.

2) NLP to Linked Data: How can we relate the results of different named entity extraction tools to Linked Data. Giuseppe Rizzo introduced their project, NERD, which is working on this area.

Read more

Google Announces Updates to Rich Snippets

Google has announced two updates to rich snippets, the enhanced format that they announced in 2009 for displaying content in search results that use semantic markup.

The first update addresses an issue raised on answers.semanticweb.com in July of 2011. Prior to this update, only some places in the world saw rich snippets in their local results. Now product rich snippets is getting global support, meaning that users worldwide will be able to preview product information in the rich snippet. Here is an example from www.google.fr:

sample of rich snippet from Google France

Read more

Growing Resource: WebDataCommons.org

Following this teaser last week, Dr. Christian Bizer has reported, “We are happy to announce WebDataCommons.org, a joint project of Freie Universität Berlin and the Karlsruhe Institute of Technology to extract all Microformat, Microdata and RDFa data from the Common Crawl web corpus, the largest and most up-to-data web corpus that is currently available to the public. WebDataCommons.org provides the extracted data for download in the form of RDF-quads. In addition, we produce basic statistics about the extracted data.” Read more

Common Crawl To Add New Data In Amazon Web Services Bucket

The Common Crawl Foundation is on the verge of adding to its Amazon Web Services (AWS) Public Data Set of openly and freely accessible web crawl data. It was back in January that Common Crawl announced the debut of its corpus on AWS (see our story here). Now, a billion new web sites are in the bucket, according to Common Crawl director Lisa Green, adding to the 5 billion web pages already there.

“When are you going to have new data is one of most frequent questions we get,” she says. The answer is that processing is underway now, and she hopes they’ll be ready to go this week.

Read more

Microdata, RDF, or Both?

Roy Tennant recently wrote an opinion piece declaring that microdata, not RDF, will power the semantic web. Needless to say, this stirred up some strong opinions in the comments. Tennant writes, “While RDF is complex, and designed to be implemented as a stand-alone depiction of metadata, it does have an implementation that is designed for embedding in web pages: RDFa. On the other hand, microdata is relatively simple and solely designed to be embedded in web pages. While the metadata cognoscenti are in the RDF camp, Google, Microsoft, and Yahoo! have thrown their lot in with microdata by launching the Schema.org effort. Were I a betting man, I wouldn’t be backing RDF at this point.”

As we reported in November, schema.org has indicated support for both microdata and RDFa. Read more

Introduction to: RDFa

Name Badge - Hello, My Name is RDFaSimply put, RDFa is another syntax for RDF. The interesting aspect of RDFa is that it is embedded in HTML. This means that you can state what things on your HTML page actually mean. For example, you can specify that a certain text is the title of a blog post or it’s the name of a product or it’s the price for a certain product. This is starting to be commonly known as “adding semantic markup”.

Historically, RDFa was specified only for XHTML. Currently, RDFa 1.1 is specified for XHTML and HTML5. Additionally, RDFa 1.1 works for any XML-based language such as SVG. Recently, RDFa Lite was introduced as “a small subset of RDFa consisting of a few attributes that may be applied to most simple to moderate structured data markup tasks.” It is important to note that RDFa is not the only way to add semantics to your webpages. Microdata and Microformats are other options, and I will discuss this later on. As a reminder, you can publish your data as Linked Data through RDFa. Inside your markup, you can link to other URIs or others can link to your HTML+RDFa webpages.

Why publish RDFa? Read more

Common Crawl Founder Gil Elbaz Speaks About New Relationship With Amazon, Semantic Web Projects Using Its Corpus, And Why Open Web Crawls Matter To Developing Big Data Expertise

The Common Crawl Foundation’s repository of openly and freely accessible web crawl data is about to go live as a Public Data Set on Amazon Web Services.  The non-profit Common Crawl is the vision of Gil Elbaz, who founded Applied Semantics and the AdSense technology for which Google acquired it , as well as the Factual open data aggregation platform, and it counts Nova Spivack  — who’s been behind semantic services from Twine to Bottlenose – among its board of directors.

Elbaz’ goal in developing the repository: “You can’t access, let alone download, the Google or the Bing crawl data. So certainly we’re differentiated in being very open and transparent about what we’re crawling and actually making it available to developers,” he says.

“You might ask why is it going to be revolutionary to allow many more engineers and researchers and developers and students access to this data, whereas historically you have to work for one of the big search engines…. The question is, the world has the largest-ever corpus of knowledge out there on the web, and is there more that one can do with it than Google and Microsoft and a handful of other search engines are already doing? And the answer is unquestionably yes. ”

Read more

Microdata in HTML5 for SEO

In a post that touts the value of adding semantic markup using HTML 5 microdata for SEO benefits, Ben Truyman explains, “Microdata is a component of HTML5 aimed at adding more semantics and contextual information to existing content on a page. By doing so, Microdata provides others, like search engines or browsers, with more information about the contents of a page. This allows them to handle data in new and interesting ways. For example, a product detail page may list out a product’s SKU, pricing, reviews and availability — but there’s no real way for Google’s search engine crawlers to know exactly what that information means. With Microdata, we can explicitly tell Google how much our products cost and what rating our users gave it.” Read more

NEXT PAGE >>