<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SEO Skeptic</title>
	<atom:link href="http://www.seoskeptic.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.seoskeptic.com</link>
	<description>Fact, Fiction and Opinion in the World of SEO</description>
	<lastBuildDate>Sun, 24 Mar 2013 23:35:58 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.2</generator>
		<item>
		<title>Bing Mounts a Personal Offensive Against Google&#039;s Knowledge Graph</title>
		<link>http://www.seoskeptic.com/bing-mounts-a-personal-offensive-against-googles-knowledge-grap/</link>
		<comments>http://www.seoskeptic.com/bing-mounts-a-personal-offensive-against-googles-knowledge-grap/#comments</comments>
		<pubDate>Thu, 21 Mar 2013 19:09:49 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Bing]]></category>
		<category><![CDATA[Bing Snapshot]]></category>
		<category><![CDATA[Google Knowledge Graph]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1475</guid>
		<description><![CDATA[As reported by Danny Sullivan on Search Engine Land, Bing has announced and rolled out an update to its Satori-fueled Snapshot today. As with Google's Knowledge Graph, Bing's Snapshot (or "Snapshots" &#8211; the label is awkward one) is entity-focused, and the update extends the number of entities being shown, improves and extends the relationships displayed [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/bing-mounts-a-personal-offensive-against-googles-knowledge-grap/" title="Permanent link to Bing Mounts a Personal Offensive Against Google's Knowledge Graph"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/bing-snapshot.png" width="125" height="125" alt="Bing Mounts a Personal Offensive Against Google's Knowledge Graph with Snapshot Improvements" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/bing-mounts-a-personal-offensive-against-googles-knowledge-grap/"></g:plusone></div><p>As reported by <a title="Danny Sullivan on Twitter" href="dannysullivan" target="_blank">Danny Sullivan</a> on <a title="Bing Snapshots Grow To Understand People, Places &amp; Things With Satori Expansion" href="http://searchengineland.com/bing-snapshots-people-places-things-satori-152460" target="_blank">Search Engine Land</a>, Bing has <a title="Understand Your World with Bing" href="http://www.bing.com/blogs/site_blogs/b/search/archive/2013/03/21/satorii.aspx" target="_blank">announced</a> and rolled out an update to its Satori-fueled Snapshot today.</p>
<p>As with Google's Knowledge Graph, Bing's Snapshot (or "Snapshots" &#8211; the label is awkward one) is entity-focused, and the update extends the number of entities being shown, improves and extends the relationships displayed between entities, and displays Bing's prowess with entity disambiguation.</p>
<p>While this is resulting in more detailed and better-connected results being shown for all named entities, the biggest impact is certainly in the display of personal named entities (you know, "people") in Snapshot verticals.  And in this they're arguably now doing a much better job than Google is with their Knowledge Graph (and so my reference to a "personal offensive" in the title). With this latest update Bing is more than ever taking advantage of its social partnerships, and is in general exploiting the availability of social information for (living) people.<br />
<span id="more-1475"></span></p>
<p>This is an interesting &#8211; and I think substantial &#8211; challenge to the Knowledge Graph, which draws its information from a more limited set of ("trusted") sources, and so has a much higher "fame" threshold that a personal entity has to overcome before it can appear in this type of vertical.  It may turn out that the Snapshot barrier to entry is as low as possessing a LinkedIn profile.</p>
<p>Bing appears to have effectively removed this barrier by doing a really good job at disambiguating personal entities:  if they're displaying information from the <em>right</em> Twitter, LinkedIn, Facebook and Klout accounts, then (regardless of the <em>veracity</em> of the information) they're linking to "trusted" social sources insofar as they're accounts that are owned by the same person.</p>
<div id="attachment_1477" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/dan-brickley.png"><img class="size-full wp-image-1477" title="Bing Search Results for the Query &quot;dan brickley&quot;" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/dan-brickley-t.png" alt="Bing Search Results for the Query &quot;dan brickley&quot;" width="500" height="402" /></a><p class="wp-caption-text">Bing search results for the query "dan brickley". While Dan is well-known in the semantic web community, he is not a "famous person" in the sense that John Kerry (the example used in the Bing post) is, and to the best of my knowledge (and basic testing) he's never appeared in a Google Knowledge Graph vertical.<br />(Click to enlarge)</p></div>
<p>Practical takeaways?</p>
<ul>
<li>From a personal branding perspective, LinkedIn is more important than ever.  Aside from the large amount of information that's pulled from LinkedIn into the Snapshot vertical for a personal entity, it also seems to be the location from which the entity's photograph is pulled:  no LinkedIn photo, no Bing Snapshot photo.</li>
</ul>
<ul>
<li>Marketers have long pushed back against Klout, arguing (fairly persuasively) that Klout is a pretty blunt measure of social influence.  That is to say, the message has ended up being "your Klout score doesn't matter."  Guess what?  Your Klout score matters &#8211; maybe not to you, maybe not to those "in the know," but from a raw optics point of view it's now closely associated with your Snapshot in Bing.</li>
</ul>
<p>These takeaways are only "practical," of course, insofar as it impacts those who see such results in Bing:  currently <a title="comScore February 2013 U.S. Search Engine Rankings" href="http://www.comscore.com/Insights/Press_Releases/2013/3/comScore_Releases_February_2013_U.S._Search_Engine_Rankings" target="_blank">about 17% of searchers</a> in the United States, and a negligible number elsewhere in the world.</p>
<p>This limited reach may not be the case for long, however, depending on whether or not Google responds in kind.  This represents &#8211; for the first time, I think &#8211; a <em>real</em> challenge to Google's Knowledge Graph results, and I can't see Google sitting on their hands while Bing is now producing <em>better</em> semantically-fueled and interlinked results for non-celebrity personal entities than them.</p>
<p>And Bing is seemingly doing at least as good a job as Google for "famous" people too (for Google and Bing the working definition of "famous" or "celebrity" seems to be "a person with an established page about them on Wikipedia").</p>
<div id="attachment_1481" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/john-kerry.png"><img class="size-full wp-image-1481" title="Google Knowledge Graph and Bing Snapshot Results for &quot;john kerry&quot;" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/john-kerry-t.png" alt="Google Knowledge Graph and Bing Snapshot Results for &quot;john kerry&quot;" width="500" height="426" /></a><p class="wp-caption-text">Google Knowledge Graph (left) and Bing Snapshot results (right) for "john kerry".<br />(Click to enlarge)</p></div>
<p>Bing might arguably be returning <em>more</em> useful Snapshot results than the equivalent Knowledge Graph results because of the additional social properties (Twitter, LinkedIn, Klout) that are displayed for living celebrities in Snapshot, but not in the Knowledge Graph.</p>
<p>Again, I can't see Google taking this lying down.  Bing's socially-connected search results were at one time derided as being not particularly appetizing icing on the search results cake ("I don't care what movies my friends like").  Now the labors of socially integrating entities are beginning to bear fruit, and Google is starting to appear woefully behind in this realm.  Google is certainly promoting and expanding Google+ with nothing less than frenetic energy, but even though this isn't a walled garden in the Facebook sense, it is <em>one</em> garden:  Bing is now foraging <em></em>far further afield for social produce to include in its organic basket.</p>
<p>All of this is made possible by Bing's Satori graph-based repository that uses RDF and SPARQL, and has been designed to handle a huge number of triples.  Having (at least until now) flown lower under the radar than Google's Knowledge Graph technologies, not a ton has been published on Satori, but here's a couple of articles with interesting insights.</p>
<ul>
<li><a title="How Google and Microsoft taught search to “understand” the Web" href="http://arstechnica.com/information-technology/2012/06/inside-the-architecture-of-googles-knowledge-graph-and-microsofts-satori/">How Google and Microsoft taught search to “understand” the Web</a><br />
<em><a title="Sean Gallagher on Twitter" href="https://twitter.com/thepacketrat">Sean Gallagher</a>, Arts Technica, 6 June 2012</em></li>
<li><a title="Bing Now Knows Much More About People And Places Thanks To LinkedIn And Its Satori Entity Engine" href="http://techcrunch.com/2013/03/21/bing-just-got-a-lot-smarter-now-knows-more-about-people-and-places/">Bing Now Knows Much More about People and Places Thanks to LinkedIn and its Satori Entity Engine</a><br />
<em><a title="Frederic Lardinois on Twitter" href="https://twitter.com/fredericl">Frederic Lardinois</a>, TechCrunch, 21 March 2013</em></li>
<li><a title="Knowledge Graph, Satori, and Unicorn" href="http://www.mediapost.com/publications/article/196386/#axzz2OOisgn1w" target="_blank">Knowledge Graph, Satori, and Unicorn</a><br />
<em><a title="Laurie Sullivan on Twitter" href="https://twitter.com/LaurieSullivan" target="_blank">Laurie Sullivan</a>, 21 March 2013</em></li>
</ul>
<p>[Update, 24 March 2013:  Added the last link above, which also put me onto more Satori information from the Microsoft Trinity project website, which <a title="More on Satori" href="https://plus.google.com/106943062990152739506/posts/KRvj9RGzGPK" target="_blank">I've summarized here.</a>]</p>
<p>Completely independent of search market share wars, it's really great to see Bing putting competitive pressure on Google through improvements to their products rather than shrill and aggressive marketing campaigns.  Search engine users will ultimately be the beneficiaries of more competition in the semantic search results space.</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/bing-mounts-a-personal-offensive-against-googles-knowledge-grap/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/bing-mounts-a-personal-offensive-against-googles-knowledge-grap/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Beyond Rich Snippets: Semantic Web Technologies for Better SEO</title>
		<link>http://www.seoskeptic.com/beyond-rich-snippets-semantic-web-technologies-for-better-seo/</link>
		<comments>http://www.seoskeptic.com/beyond-rich-snippets-semantic-web-technologies-for-better-seo/#comments</comments>
		<pubDate>Fri, 08 Mar 2013 01:28:12 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Bing]]></category>
		<category><![CDATA[Google Knowledge Graph]]></category>
		<category><![CDATA[Google Plus]]></category>
		<category><![CDATA[Rich Snippets]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1352</guid>
		<description><![CDATA[The rallying cry from the search engines to encourage the use of semantic markup has been, so far, centered on rich snippets.  Markup you site with structured data, they say, and your search result snippet might be enhanced with information and graphics that will increase the number of searchers that click through from the results [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/beyond-rich-snippets-semantic-web-technologies-for-better-seo/" title="Permanent link to Beyond Rich Snippets: Semantic Web Technologies for Better SEO"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/beyond-rich-snippets.png" width="125" height="125" alt="Beyond Rich Snippets: Semantic Web Technologies for Better SEO" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/beyond-rich-snippets-semantic-web-technologies-for-better-seo/"></g:plusone></div><p>The rallying cry from the search engines to encourage the use of semantic markup has been, so far, centered on rich snippets.  Markup you site with structured data, they say, and your search result snippet might be enhanced with information and graphics that will increase the number of searchers that click through from the results to your website.</p>
<p>Absolutely true.</p>
<p>But the benefits of employing structured data and associated semantic web technologies extend past rich snippet generation, and that's what I'll be exploring here.<span id="more-1352"></span>This post is an annotated version of slides I presented at <a title="SearchFest 2013" href="http://www.sempdx.org/searchfest/2013-agenda/" target="_blank">SearchFest 2013</a> in Portland, Oregon.  It was a fabulous conference, and I was honored by the invitation from <a title="Matthew Brown on Twitter" href="https://twitter.com/MatthewJBrown" target="_blank">Matthew Brown</a> to appear there, and by the opportunity to present alongside <a title="Jeff Preston on Twitter" href="https://twitter.com/jeffreypreston" target="_blank">Jeff Preston</a> of Disney Interactive &#8211; thanks guys!</p>
<p>So what was that about rich snippets?  Let's start with a little history.</p>
<h3>Building the foundations:  1999-2005</h3>
<p><img class="aligncenter size-full wp-image-1354" title="Semantic Web Syntax and Vocabulary Development Related to Search, 1999-2005" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/syntax-vocabularies-1999-2005.png" alt="Semantic Web Syntax and Vocabulary Development Related to Search, 1999-2005" width="500" height="282" /></p>
<p>The <a title="Dublin Core Metadata Initiative (DCMI)" href="http://dublincore.org/" target="_blank">Dublin Core Metadata Initiative</a> (DCMI, or simply "Dublin Core") was among the first efforts at developing metadata standards for describing both physical and online resources (1995).   <a title="Microformats" href="http://microformats.org/" target="_blank">Microformats</a> (2004) enabled semantic markup of particular types of information by leveraging the attributes of markup tags.</p>
<p>Specific protocols like <a title="The Sitemap Protocol" href="http://www.sitemaps.org/" target="_blank">XML sitemaps</a> (2005) and product feeds like Google Base (2005) were among the earliest efforts by the search engines to better understand and index with structured data.</p>
<p>Underpinning all of these developments was the <a title="Resource Description Framework (RDF)" href="http://en.wikipedia.org/wiki/Resource_Description_Framework" target="_blank">Resource Description Framework</a>, or RDF (1999), which provided the general mechanism for describing resources and modeling information on the web.  <a title="Friend of a Friend (FOAF) Project" href="http://www.foaf-project.org/" target="_blank">FOAF</a> ("Friend of a Friend," 2000) built a simple but powerful ontology for describing persons on this framework.  And <a title="RSS - Rich Site Summary aka RDF Site Summary aka Really Simple Syndication" href="http://en.wikipedia.org/wiki/RSS" target="_blank">RSS</a> (originally "RDF Site Summary," 1999)" &#8211; a feed format for describing frequently updated resources &#8211; rapidly became one of the most widely-employed XML applications on the web.</p>
<h3>Rise of the vocabularies:  2005-2011</h3>
<p><img class="aligncenter size-full wp-image-1355" title="Semantic Web Syntax and Vocabulary Development Related to Search, 2005-2011" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/syntax-vocabularies-2005-2011.png" alt="Semantic Web Syntax and Vocabulary Development Related to Search, 2005-2011" width="500" height="282" /></p>
<p>By mid-decade the search engines (and in particular Google) were producing rich snippets and displaying topical verticals in search results by parsing and structuring the content of specific sites and, to a certain extend, making sense of data provided by microformats.</p>
<p>But it was the introduction of <a title="RDFa (Resource Description Framework in attributes)" href="http://en.wikipedia.org/wiki/RDFa" target="_blank">RDFa</a> (Resource Description Framework in attributes, 2007) and <a title="Microdata" href="http://en.wikipedia.org/wiki/Microdata_%28HTML%29" target="_blank">microdata</a> in HTML5 (2005-2008) that really allowed the search engines to leverage structured data.  These protocols provided a mechanism for marking up the visible content of web pages with structured data (without relying on class attributes), allowing rich metadata from external vocabularies to be embedded in machine-readable code.</p>
<p><a title="GoodRelations" href="http://www.heppnetz.de/projects/goodrelations/" target="_blank">GoodRelations</a> (2008) and <a title="rNews at the International Press Telecommunications Council" href="http://dev.iptc.org/rNews" target="_blank">rNews</a> (2011) leveraged the power of RDFa to build specialist vocabularies for ecommerce and news content, respectively.  The introduction of <a title="schema.org" href="http://schema.org/" target="_blank">schema.org</a> (2011), though, provided webmasters with the first general-purpose set of schemas that were officially sanctioned by the search engines.</p>
<p><a title="Freebase" href="http://www.freebase.com/" target="_blank">Freebase</a> (2007) developed as a large collaborative knowledge base of metadata, and <a title="DBpedia" href="http://dbpedia.org/About" target="_blank">DBpedia</a> (2007) evolved to extract structured content from Wikipedia.  The <a title="Open Graph Protocol" href="http://ogp.me/" target="_blank">Open Graph protocol</a> (2010) grew out of the Facebook Platform, and enabled developers to integrate web pages into the social graph (a graph of the relationships between internet users).  All of these initiatives were to play a role in the semantic evolution of search engines in the years ahead.</p>
<h3>Consolidation and growth:  2011-present</h3>
<p><img class="aligncenter size-full wp-image-1357" title="Semantic Web Syntax and Vocabulary Development Related to Search, 2011-2013" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/syntax-vocabularies-2011-20131.png" alt="Semantic Web Syntax and Vocabulary Development Related to Search, 2011-2013" width="500" height="282" /></p>
<p>Since 2011 the schema.org vocabulary has continued to grow, both by the addition of extensions built for schema.org, and by incorporating types and properties from rNews and GoodRelations.  This has meant that a increasingly richer, search engine-sanctioned vocabulary is available to webmasters, even if not all schema.org types and properties currently result in the production of rich snippets in the search results, or other clearly demonstrable benefits for search visibility.</p>
<p>Google introduced its <a title="Google's Knowledge Graph" href="http://www.google.com/insidesearch/features/search/knowledge.html" target="_blank">Knowledge Graph</a> in 2012, which draws on a number of structured data sources to populate this knowledge base that currently resides alongside Google's main search results.  Google's 2010 acquisition of Metaweb (developers of Freebase) turned out to be fundamental to the Knowledge Graph.  Bing's own version of the Knowledge Graph, <a title="Bing Snapshots" href="http://www.bing.com/blogs/site_blogs/b/search/archive/2012/12/10/find-famous-people-and-places-in-a-snap.aspx" target="_blank">Bing Snapshots</a>, references many of the same sources as the Knowledge Graph.</p>
<p>Just as Google leaned on structured data sources to build its knowledge graph, Facebook Platform technologies facilitated the release of <a title="Facebook Graph Search" href="https://www.facebook.com/about/graphsearch" target="_blank">Facebook Graph Search</a> in 2013.</p>
<p>So the search engines are using structured data to improve their search results and to build complementary products, while the power of the social graph is allowing sites like Facebook to challenge the search engines with their own, socially-informed search products.</p>
<h3>Specialty search and structured data</h3>
<p><img class="aligncenter size-full wp-image-1362" title="Google Recipe Search Results" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/recipe-specialty-search-in-google.png" alt="Google Recipe Search Results" width="500" height="282" /></p>
<p>Along with reviews, one of the earliest types of rich snippets to appear in search results based on structured data markup were recipes.  Building on this, and starting with recipes, Google has constructed a number of specialty search engines (or, depending on how you look at, subsets of its main results) that allow searches restricted to be restricted to specific topical domains.</p>
<p>Structured data markup is the price of admission to obtain a presence in these specialty engines, as the underlying markup for pages that appear in recipe search readily attest.</p>
<h3>Chicken?  Egg?</h3>
<p><img class="aligncenter size-full wp-image-1363" title="Recipe Results in Google Web Search" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/recipes-in-google-web-search.png" alt="Recipe Results in Google Web Search" width="500" height="282" /></p>
<p>The fundamental piece of information that recipe markup provides to the search engines is the formal declaration "this resource is a recipe."</p>
<p>Does this information help the search engines return relevant resources when they judge that the intent for a particular query might be a recipe, even if a keyword trigger like "recipe" does not appear in the query?  Judging from the similarity of the web search results for "chicken cacciatore" this might well be the case.</p>
<p>Even the image vertical for this query is dominated by results which might be informed by the use of structured data, the exception being a highly-optimized page containing only a highly-optimized picture of chicken cacciatore.</p>
<h3>Application search and recipe lessons learned</h3>
<p><img class="aligncenter size-full wp-image-1364" title="Results for Image Editing Software in Google Application Search" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/image-editing-software-in-applications-search.png" alt="Results for Image Editing Software in Google Application Search" width="500" height="333" /></p>
<p>As with recipe search, Google's application search relies on structured data markup to return results about, and only about, software applications.  Obviously sites with one or more pages dedicated to software have the ability to improve their search visibility by using application markup.</p>
<h3>[SIDEBAR] Google and the iTunes store</h3>
<p><img class="aligncenter size-full wp-image-1368" title="An iOS App that Appeared in Google Application Search" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/itunes-store1.png" alt="An iOS App that Appeared in Google Application Search" width="500" height="222" /></p>
<p>If the price of admission to a Google specialty search engine is structured data, what is a page from the iTunes store &#8211; which does not contain schema.org/MobileApplication markup &#8211; doing in the applications search results?</p>
<p>Google knows that any iTunes store app is a mobile application.  And as the iTunes store is built on standard templates, it's straightforward for Google to parse and structure the data it finds there.</p>
<p>The lesson here is that when Google considers a source of information to be important enough to provide value to its search results, it will go to exceptional lengths to parse and structure unstructured or semi-structured data.  It has previously done this with Yelp and pages with BazaarVoice reviews to generate review rich snippets in the SERPs, and has returned product and review rich snippets for Amazon, which is devoid of any Google-sanctioned structured data.  The exceptional lengths to which Google will go to structure data provides a pretty strong clue about the value it places in it.</p>
<h3>Web search and recipe lessons learned</h3>
<p><img class="aligncenter size-full wp-image-1370" title="Results for Image Editing Software in Google Web Search" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/image-editing-software-in-web-search.png" alt="Results for Image Editing Software in Google Web Search" width="500" height="361" /></p>
<p>Google application search is newer than recipe search, and the application schema is much newer than hRecipe, data-vocabulary.org/Recipe or schema.org/Recipe.  Accordingly, adoption of application semantic markup has lagged behind that of recipe markup.</p>
<p>This relatively poor adoption means there's a competitive advantage available for those that choose to markup software pages.  Aside from the fact that this markup might generate rich snippets in the main SERPs, and that it is required to appear in application search, there is the possibility of the sort of knock-on effect from specialty search observed in the web recipe results.  And if Google determines that a particular query's intent is software-related, there's no clearer signal that can be provided to Google here than saying unambiguously in the code that "this page is about a software application."</p>
<h3>Web search and the Knowledge Graph</h3>
<p><img class="aligncenter size-full wp-image-1373" title="An Events Vertical in the Google Knowledge Graph for the Query Seattle" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/knowledge-graph-events-vertical-for-seattle.png" alt="An Events Vertical in the Google Knowledge Graph for the Query Seattle" width="500" height="282" /></p>
<p>Google's Knowledge Graph has typically contained information derived from major trustworthy structured (or structure-able) sources like Wikipedia or IMDb.  Recently, though, results for events have begun to appear for city queries.</p>
<h3>Clicking through from the Knowledge Graph</h3>
<p><img class="aligncenter size-full wp-image-1374" title="Results for a Seattle Event from a Click on the Linked Google Knowledge Graph Vertical" src="http://www.seoskeptic.com/wp-content/uploads/2013/02/seattle-event-results-from-knowledge-graph-click-through.png" alt="Results for a Seattle Event from a Click on the Linked Google Knowledge Graph Vertical" width="500" height="309" /></p>
<p>While there is no specialty events engine where semantic markup is the price of admission, the results displayed when clicking through on Knowledge Graph event results suggests that Google favors structured data sources for these results, and might be relying on them in some form to populate the Knowledge Graph events vertical in the first place.</p>
<h3>[SIDEBAR] The Google Data Highlighter</h3>
<p><img class="aligncenter size-full wp-image-1396" title="The Google Data Highlighter in Action" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/google-data-highlighter.png" alt="The Google Data Highlighter in Action" width="500" height="262" /></p>
<p>Around the time that the Knowledge Graph events vertical started to appear, Google introduced the <a title="Google's Data Highlighter" href="http://googlewebmastercentral.blogspot.ca/2012/12/introducing-data-highlighter-for-event.html" target="_blank">Data Highlighter</a>, which allows webmasters to visually match visible website information with properties supported by the Highlighter.  In other words, it provides a mechanism to provide Google with structured information about a resource without marking it up in the code.</p>
<p>The first (and at time of writing, only) content type supported by the Highlighter?  Events.</p>
<p>If the Knowledge Graph events vertical is in some way predicated on the presence structured events data, this suggests that Google created the Highlighter to help grow the amount of structured data available to it about events.  It stands to reason that Google doesn't want to throw out the baby with the bathwater by restricting Knowledge Graph event listings to sites that have marked up their events, and has introduced the Highlighter to ensure that unstructured pages about events still have a presence in the events vertical and their linked results.</p>
<p>It also stands to reason that webmasters concerned with search visibility will want to pay attention to any new types supported by the Highlighter, even if they don't actually employ the Highlighter; if, say, software applications are supported by the Highlighter, webmasters with the wherewithal to do so would be well advised to markup their software pages.</p>
<h3>[SIDEBAR] An event thought experiment</h3>
<p><img class="aligncenter size-full wp-image-1398" title="SEMpdx's SearchFest 2013 Was Not Featured in the Knowledge Graph Events Vertical for the Query Portland" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/searchfest-2013.png" alt="SEMpdx's SearchFest 2013 Was Not Featured in the Knowledge Graph Events Vertical for the Query Portland" width="500" height="249" /></p>
<p>The conference at which these slides were presented, SearchFest, did not appear in a Knowledge Graph search vertical for a "portland" query.  The SearchFest site did not feature event markup.  While there are doubtlessly a number of factors that go into Google's determination of what should appear in the Knowledge Graph events vertical (such as, perhaps, the number of times an event is cited, or the authority of the sites on which a given event appears), might SearchFest have appeared in the "portland" events vertical if the SearchFest site had carried events markup?</p>
<p>I subsequently learned that a SearchFest organizer <em>had</em> used the Data Highlighter to provide event data about the conference, but &#8211; aside from not appearing in the events vertical &#8211; the effort did not result in a rich snippet.  With so many unknowns and variables its hard to say whether this is due to the lack of events markup or any number of other factors.</p>
<p>I can report, however, in regard to the slide above used to demonstrate the Google Data Highlighter, that some two weeks after using the Highlighter that this events rich snippet has started to appear for the query "paste 2013":</p>
<p><img class="aligncenter size-full wp-image-1445" title="PASTE 2013 - An Events Rich Snippet from a Google Highlighter-Processed Page without Events Markup" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/paste-2013.png" alt="PASTE 2013 - An Events Rich Snippet from a Google Highlighter-Processed Page without Events Markup" width="518" height="88" /></p>
<h3>No rich snippets and no specialty search?</h3>
<p>There are myriad schema.org types that currently do not produce rich snippets, have no specialty search engine associated with them, and have no visibility in the Knowledge Graph.  What's the point of adding schema.org RDFa or microdata markup for these types?</p>
<p>The history of recipes, applications, events, products and reviews suggests both that pages with semantic markup can suddenly make an enhanced appearance in the search results, and that the types most likely to appear there are related to the types of information that people frequently search or browse for on large, well-used, topically-specific sites.</p>
<p>So marking up pages with structured data today may prove to be a competitive advantage tomorrow, and that return on investment may come sooner rather than later for certain types of information.  Absent a crystal ball, one can still make educated guesses about what some of the more important types might be.</p>
<ul>
<li><a title="schema.org/JobPosting" href="http://schema.org/JobPosting" target="_blank">schema.org/JobPosting</a><br />
Already in use at the <a title="National Resource Directory" href="https://www.nrd.gov/" target="_blank">US National Resource Directory</a>, bringing job postings directly to the SERPs would obviously be useful to searchers (however much a full-fledged Google or Bing jobs specialty engine might present perhaps too much of a politically-charged challenge to the likes of Monster or LinkedIn).</li>
<li><a title="Organization" href="http://schema.org/Organization" target="_blank">schema.org/Organization</a>, <a title="Person" href="http://schema.org/Person" target="_blank">Person</a>, <a title="Place" href="http://schema.org/Place" target="_blank">Place</a>, etc.<br />
Named entities are the life's blood of the Knowledge Graph and Bing Snapshots.  And even if marking up named entities don't end up producing fireworks in the SERPs, I'd argue that <strong>any structured information provided to the search engines about entities helps them better understand those entities</strong>.</li>
<li><a title="Product" href="http://schema.org/Product" target="_blank">schema.org/Product</a>, <a title="schema.org/Offer" href="http://schema.org/Offer" target="_blank">schema.org/Offer</a><br />
At this point its remarkable to me how <em>little</em> the search engines have done with ecommerce information in the web results.  Even if this is a result of a deliberate effort to protect product search (either to retain its monetary potential, or to promote the submission of account-linked and verified product feeds), sooner or later somebody is going to make hay of structured product and offer information.</li>
</ul>
<h3>[SIDEBAR] No feeds required</h3>
<p><img class="aligncenter size-full wp-image-1399" title="Structured Data Provides the Same Product Attributes - and more - Encoded in Product Feeds, Such As These Samsung Television Attributes" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/samsung-televisions-shopping-results.png" alt="Structured Data Provides the Same Product Attributes - and more - Encoded in Product Feeds, Such As These Samsung Television Attributes" width="500" height="198" /></p>
<p>Not that long ago, the only way to provide reliable, detailed product and offer information to search engines (and other data consumers) was through XML product feeds.</p>
<p>Ecommerce-related structured data has changed this.  Using schema.org, not only can ecommerce sites expose the same information in markup that is supplied in feeds, but &#8211; with GoodRelations integration &#8211; websites can now markup<em></em> <em>more detailed information</em> about their products, offers and services than are supported by product feeds.</p>
<p>So there might be a future where product feeds might not be required for sites to appear in product search (data cleanliness and veracity being the chief impediment to that right now).  Might Bing &#8211; already trying to gain favor with consumers by publicly attacking Google for moving to paid product listings &#8211; circumvent the restrictiveness of product feeds altogether?  Or might some other player do an end run around them both by creating their own product search engine?</p>
<h3>[SIDEBAR] Structured markup and Google CSE</h3>
<p><img class="aligncenter size-full wp-image-1401" title="A Google Custom Search Engine - Google Site Search - Allows Structured Data to Be Used for Filtering, Sorting and Biasing Results" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/google-custom-search-engine.png" alt="A Google Custom Search Engine - Google Site Search - Allows Structured Data to Be Used for Filtering, Sorting and Biasing Results" width="500" height="281" /></p>
<p>Sites employing <a title="Google Site Search" href="http://www.google.com/enterprise/search/products_gss.html" target="_blank">Google Site Search</a> (the paid version of <a title="Google Custom Search Engine" href="http://www.google.com/cse/" target="_blank">Google Custom Search</a>) can leverage almost any sort of structured data in their search results.  Certain data can be used for filtering results; other types of data can be used in the sorting or biasing (ranking) of Site Search results.</p>
<h3>No schema.org types that are relevant to you?</h3>
<p><img class="aligncenter size-full wp-image-1402" title="schema.org Includes an Extension Mechanism to Extend the Vocabulary" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/extension-mechanism.png" alt="schema.org Includes an Extension Mechanism to Extend the Vocabulary" width="500" height="282" /></p>
<p>schema.org is a large general-purpose vocabulary, but is not comprehensive. There are many types of information found on websites that cannot describe be described using schema.org.</p>
<p>If a website owner judges that there's potential value in doing so, they can create<a title="schema.org Extension Mechanism" href="http://schema.org/docs/extension.html" target="_blank"> an extension to schema.org</a> for their content.  For example, there's no type for video games in the current schema, and so a site dedicated to video games might build an extension in order to be able to provide detailed semantic information about video games listed in their (publicly-consumable) code.</p>
<p>One could build such an extension, employ it one's site and make internal use of the types and properties developed.  One might further hope that other video game sites use the extension, and so extend its utility.  For a search marketer, however, ultimately the wisdom decision to spend time and money on developing an extension has to hinge on improved search visibility.</p>
<h3>Different approaches to schema.org extensions</h3>
<p><img class="aligncenter size-full wp-image-1404" title="Semantic Web Developers and SEOs Take Different Approaches to schema.org Extensions" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/vocabulary-reuse-and-extension.png" alt="Semantic Web Developers and SEOs Take Different Approaches to schema.org Extensions" width="500" height="282" /></p>
<p>There are countless well-developed formal vocabularies and ontologies available on the web, covering a large number of specialist domains.  Rather than extend schema.org by adding more types and properties to it, semantic web developers are more inclined to <a title="Schema.org markup for external lists - 11 May 2012 blog post" href="http://blog.schema.org/2012/05/schemaorg-markup-for-external-lists.html" target="_blank">link schema.org markup</a> to controlled vocabularies, standards and datasets where they exist.  This makes a lot of sense, as it keeps the maintenance and development of these vocabularies singularly in the hands of the specialists in the domains to which they are related.</p>
<p>For search marketers, though, types and properties formally integrated into schema.org have a much better chance of eventually surfacing in search engine results.  And while there still might be search benefits for a site employing external lists &#8211; insofar as the search engines might derive a better understanding of that site's content &#8211; the nebulous and unproven nature of this value proposition is unlikely to result in resources being committed to vocabulary building (at least where the impetuous is SEO).</p>
<p>Accordingly, search marketers with a terrific use case for an extension might find the best route is to lobby for its inclusion in schema.org directly.  This is <a title="The intial proposal for health and medical extensions to schema.org" href="http://lists.w3.org/Archives/Public/public-vocabs/2012May/0057.html" target="_blank">more-or-less</a> <a title="Documentation related to health/medical types in schema.org, including the history of the extension" href="http://schema.org/docs/meddocs.html" target="_blank">how</a> MedicalEntity and its sub-types ended up in schema.org.</p>
<p>Proposed extensions to schema.org will have the greatest chance of success if the following steps are followed:</p>
<ul>
<li>Conduct research on vocabularies and datasets that may have already been developed in your topical domain, allowing you to build on them and to avoid unnecessary duplication of effort</li>
<li>World collaboratively with other leaders in your industry to develop and promote your extension</li>
<li>Make your proposed extension publicly available and solicit feedback from interested parties, refining and improving your extension based on this feedback</li>
<li>Formally propose adoption of your extension, citing need, use cases and the benefits to site owners, data consumers (both human and machine varieties) and search engines</li>
</ul>
<p>The proposal from Jindrich Mynarz for a <a title="A proposed JobMarket extension to schema.org" href="www..w3.org/wiki/WebSchemas/JobMarket" target="_blank">JobMarket extension to schema.org</a>, featured below, is an excellent example of a well thought-out and executed extension strategy.</p>
<div id="attachment_1406" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/extension-proposal.png"><img class="size-full wp-image-1406" title="A Well Thought-Out Extension Proposal" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/extension-proposal-t.png" alt="A Well Thought-Out Extension Proposal" width="500" height="280" /></a><p class="wp-caption-text">An example of a well thought-out extension proposal<br />(Click to enlarge)</p></div>
<h3>Structured data for structure’s sake</h3>
<div id="attachment_1409" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/breadcrumbs.png"><img class="size-full wp-image-1409" title="Even Breadcrumb Markup Can Provide Search Engines with Valuable Structural Clues" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/breadcrumbs-t.png" alt="Even Breadcrumb Markup Can Provide Search Engines with Valuable Structural Clues" width="500" height="281" /></a><p class="wp-caption-text">Even breadcrumb markup can provide the search engines with valuable clues about a website's structure<br />(Click to enlarge)</p></div>
<p>As I've already alluded to more than once, adding relevant, consistent, valid structured data to your site helps the search engines better understand the content and structure of your site, regardless of any special visibility you may or may not receive in the SERPs, such as rich snippets.  Let me reiterate this with some bold text for extra emphasis:  <strong>structured data helps the search engines better understand your site</strong>.</p>
<p>A <a title="Rich snippets - Breadcrumbs (Google Webmaster Tools Help)" href="http://support.google.com/webmasters/bin/answer.py?hl=en&amp;answer=185417" target="_blank">breadcrumb rich snippet</a> in the SERPs is hardly going to cause a doubling of your click-through rate from search.  But what if the underlying code helps the search engines more clearly understand the hierarchy of your site and the relationship between pages?  In the absence of the breadcrumbs I marked up with structured data, might Google nonetheless generate the ideal breadcrumbs, sitelinks and mini-sitelinks in the SERPs featured above?  Perhaps.  But given the results I don't feel in any way that this was wasted effort.</p>
<p>In delivering the SearchFest closing keynote, Bing's <a title="Duane Forrester of Bing on Twitter" href="https://twitter.com/DuaneForrester" target="_blank">Duane Forrester</a> urged webmasters to employ structured data, and said that doing so would not immediately boost your rankings, but would <em>pay off in the long run</em>.  Why?  Because <strong>structured data helps the search engines better understand your site</strong>.</p>
<h3>Structured data for greater data fidelity</h3>
<div id="attachment_1412" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/data-reuse.png"><img class="size-full wp-image-1412" title="Consistent Data Reuse" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/data-reuse-t.png" alt="Consistent Data Reuse" width="500" height="282" /></a><p class="wp-caption-text">Providing consistent data values helps search engines and other data consumers<br />(Click to enlarge)</p></div>
<p>Well-executed structured data can help provide a more consistent and positive experience for website users, whether their exposure to your content is through search results, social media or third-party applications.</p>
<p>Employing structured data can help developers and optimization specialists see what underlies a resource through a data lens &#8211; that is, as related data points and data values, rather than disconnected widgets and disparate pieces of information.  For example, a page's "like" button seen through this lens is not generically a widget for Facebook, but a mechanism to provide Facebook users and Open Graph consumers with precisely crafted information.</p>
<h3>Data fidelity as a trust-building measure</h3>
<div id="attachment_1414" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/product-data-fidelity.png"><img class="size-full wp-image-1414" title="Product Data Fidelity" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/product-data-fidelity-t.png" alt="Product Data Fidelity" width="500" height="267" /></a><p class="wp-caption-text">Data fidelity between a product feed, visible page content and structured data markup<br />(Click to enlarge)</p></div>
<p>Consistency of data is important for search engines.  In particular, as this principle applies to structured data, search engines will trust your content more if it can see that your visible content aligns with the data provided in your markup.</p>
<p>This is, in fact, almost certainly one of the reasons that the search engines put their weight behind schema.org, which was literally designed for attribute-based markup, rather than opting for Open Graph-like invisible metadata (remember &lt;meta&gt; keywords?).  And this is also the reason Google makes a point of advising against marking up non-visible content except in situations where a very precise data type is required but is not available on a page, such as a numeric representation of review star ratings, or event durations in ISO 8601 date format.</p>
<p>An additional point of data reference for the search engines are XML product feeds.  As in the case of the toaster oven pictured above, the search engines will accord greater trust to ecommerce pages if the product feed, markup and visible content are in sync.</p>
<p>If it looks like a duck, quacks like a duck and walks like a duck Google might feel reasonably confident that what they're looking at is a duck; if it looks like a duck, honks like a goose and does the Harlem Shuffle &#8230; not so much.</p>
<p>The inspiration for the slide above was from an excellent piece by <a title="Barbara Starr on Twitter" href="https://twitter.com/BarbaraStarr">Barbara Starr</a> on <a title="How Leveraging Data Quality In Google Shopping Can Increase Product Sales" href="http://searchengineland.com/how-leveraging-data-quality-in-google-shopping-can-increase-product-sales-148189" target="_blank">leveraging data quality in Google Shopping</a>, in which she says:</p>
<blockquote><p>Paid inclusion is now a fact of life in Google Shopping, and Google made a simultaneous mandate requiring clean, relevant, rich data inputs to product feeds, data that is verifiable by cross referencing other sources. That means the following data must all be the same and in sync:</p>
<ul>
<li>Data on the webpage visible to humans</li>
<li>On-page semantic markup</li>
<li>Data in the product feed</li>
</ul>
<p>Google’s mandate is strongly reminiscent of Semantic Web philosophy for dealing with data quality and provenance. As is the fact that Google uses some forms of Rich Snippets to expand its Knowledge Graph.</p></blockquote>
<p>Provenance &#8211; knowing precisely from where a particular piece of data originated &#8211; is an important semantic web concept, and the W3C has been developing <a title="PROV Overview on W3C" href="http://www.w3.org/TR/prov-overview/" target="_blank">the PROV model</a> to support the "inter-operable interchange of provenance information" on the web and elsewhere.</p>
<p>Where an SEO, in trying to assess whether a particular document is trustworthy, might look at the absence or presence of spam signals, a semantic web type is more liable to first ask the question, "where did it come from?"</p>
<h3><a name="profiles"></a>[SIDEBAR] Provenance and profiles</h3>
<p><img class="aligncenter size-full wp-image-1416" title="Personal and Corporate Named Entities for Google+, Bing and the Equivalent schema.org Type" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/verified-entities.png" alt="Personal and Corporate Named Entities for Google+, Bing and the Equivalent schema.org Type" width="500" height="246" /></p>
<p>Initial assessments of Google+ have tended to focus on its traction to date as a social network:  how many users does it have, how engaged are its users, how much information is shared over the platform, and so on.</p>
<p>This glosses over what I think is the true strength and ultimately the great potential of Google+, namely, as a interlinked network of <em>verified, canonical, named entities</em>.  The role that Google+ could potentially play in determining provenance is obvious.  The combination of a Google+ Profile or Page and structured data means Google can connect the dots between a web resource and the entity that produced that resource.</p>
<h3>[SIDEBAR] Provenance and a/Authorship</h3>
<div id="attachment_1418" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/verified-personal-entity.png"><img class="size-full wp-image-1418" title="Verified Personal Entities and Linked Data" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/verified-personal-entity-t.png" alt="Verified Personal Entities and Linked Data" width="500" height="264" /></a><p class="wp-caption-text">A possible linked data use of a verified personal entity profile<br />(Click to enlarge)</p></div>
<p>Much has been said about the role of authorship since the introduction of author rich snippets in Google search results, which &#8211; of course &#8211; are predicated on the existence of a Google+ Profile.</p>
<p>What I find interesting about these linkages is less the role that authorship ("Author Rank") might play in the display and ranking of search results (which might, indeed, turn out to be quite important), but the potential that the combination of structured data and verified Google+ Profiles and Pages might have in supporting complex queries based on linked data.  I could, for example, state on a page I authored (and which Google can verify I authored) that I work for InfoMine, even though that information might not appear on my Google profile.  From that Google could, with a high degree of confidence, include me in the results for the query "people that work for InfoMine."</p>
<h3>Using semantic tools to improve site structure</h3>
<div id="attachment_1420" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/ibm.png"><img class="size-full wp-image-1420" title="Entity Disambiguation - IBM vs. International Business Machines" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/ibm-t.png" alt="Entity Disambiguation - IBM vs. International Business Machines" width="500" height="281" /></a><p class="wp-caption-text">Entity disambiguation: is the name of this Company "IBM" or "International Business Machines"?<br />(Click to enlarge)</p></div>
<p>As much as adding structured data may help improve your site's visibility in search, semantic web technologies can be leveraged to help improve your site in general ways, even if these improvements don't result in the creation or manipulation of structured data on your site.</p>
<p>Extracting and disambiguating entities using APIs is one such use of semantic web technologies.  Categorizing topically identical content in two or more places because of entity variants may result in duplicate content, keyword cannibalization, or both.  On a news site, for examples, news stories pertaining to IBM might be listed either on a page about "IBM," a page about "International Business Machines," or both.  By using an API to identify stories about IBM, all these stories are associated with a single resource page, regardless of whether "IBM," "International Business Machines" or some other variant on the name appears in the content itself &#8211; as in this example of the IBM page on Wikipedia put through <a title="Calais Document Viewer" href="http://viewer.opencalais.com/" target="_blank">Calais</a>.</p>
<div id="attachment_1422" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/calais.png"><img class="size-full wp-image-1422" title="Entity Extraction and Disambiguation - Calais Example" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/calais-t.png" alt="Entity Extraction and Disambiguation - Calais Example" width="500" height="277" /></a><p class="wp-caption-text">Entity extraction and disambiguation: sample Calais output<br />(Click to enlarge)</p></div>
<p>Rich applications can be built on these technologies to help produce well-organized, meaningfully-linked, content-rich pages &#8211; all attributes that curry favor with the search engines regardless of the presence or absence of structured data.  <a title="Zemanta" href="http://www.zemanta.com/" target="_blank">Zemanta</a>, for example, has used semantic web technologies to provide bloggers with tags, multimedia resources, and relevant inline links based on a post's content.</p>
<div id="attachment_1424" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.seoskeptic.com/wp-content/uploads/2013/03/zemanta.png"><img class="size-full wp-image-1424" title="Entity Extraction and Disambiguation - Zemanta Example" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/zemanta-t.png" alt="Entity Extraction and Disambiguation - Zemanta Example" width="500" height="294" /></a><p class="wp-caption-text">Entity extraction and disambiguation: sample Zemanta output<br />(Click to enlarge)</p></div>
<h3>[SIDEBAR] Linked (Open) Data and search</h3>
<p><img class="aligncenter size-full wp-image-1428" title="Tim Berners-Lee's Four Rules of Linked Data as Applied to SEO" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/linked-data.png" alt="Tim Berners-Lee's Four Rules of Linked Data as Applied to SEO" width="500" height="250" /></p>
<p>While the <a title="Linked Data - by Tim Berners-Lee" href="http://www.w3.org/DesignIssues/LinkedData.html" target="_blank">principles of linked data</a> may not be directly applicable to SEO, there's certainly analogies that can be made between best practices in the two realms.  While I won't dwell on this, most SEOs would benefit from getting better acquainted with linked data, and understanding its importance in the semantic web world.  No time?  Purchase the <a title="Linked Data Mug" href="http://www.cafepress.com/+mug,480759174" target="_blank">linked data coffee cup</a> and keep it at your desk as a reminder.</p>
<h3>Structured data and semantic architecture</h3>
<p><img class="aligncenter size-full wp-image-1429" title="The BBC World Cup 2010 Website Employed Dynamic Semantic Publishing" src="http://www.seoskeptic.com/wp-content/uploads/2013/03/bbc-semantic-architecture.png" alt="The BBC World Cup 2010 Website Employed Dynamic Semantic Publishing" width="500" height="288" /></p>
<p>"Once we get the site built we'll SEO it."  Any seasoned search marketer has probably heard a variation on this declaration at least once in their career, and knows the folly the statement engenders.  While it's perfectly possible to markup a site with structured data after it's been built, a site that's constructed with an eye to semantic structure will fare better in the long run than one that's not.</p>
<p>The BBC World Cup 2010 website did not have structured data <em>applied</em> to it, but was actually <em>assembled</em> with the aid of semantic web technologies.  The result is a rock-solid resource that at once reliably meets human visitors' needs, and at the same time provides search engines with explicit, utterly unambiguous data (the BBC <a title="A Bundle of Links about Semantic Web Technologies at the BBC" href="http://bitly.com/bundles/aaranged/1" target="_blank">has taken a lead role</a> this sort of semantic architecture).</p>
<p>Fine for the megalith that is the BBC, but impractical for your site?  Perhaps.  But in the future expect more platforms and tools to emerge that can make this sort of architecture more accessible.  From a data perspective, after all, what is a standard ecommerce site if not a collection of entities with associated properties, all of which may be fully described using existing syntaxes and vocabularies?</p>
<p>&nbsp;</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/beyond-rich-snippets-semantic-web-technologies-for-better-seo/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/beyond-rich-snippets-semantic-web-technologies-for-better-seo/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>New York Times Prototypes a Linked Data Search Engine</title>
		<link>http://www.seoskeptic.com/new-york-times-prototypes-a-linked-data-search-engine/</link>
		<comments>http://www.seoskeptic.com/new-york-times-prototypes-a-linked-data-search-engine/#comments</comments>
		<pubDate>Mon, 04 Mar 2013 19:38:38 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[News Media]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[New York Times]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1376</guid>
		<description><![CDATA[On Beet.TV, Andy Plesser recently featured a short but fascinating video of Michael Zimbalist, Vice President of Research and Development Operations at the New York Times, talking with Joanna O'Connell of Forrester about a prototype linked data search engine being developed by the Times. Zimbalist begins by talking about the great asset that is the [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/new-york-times-prototypes-a-linked-data-search-engine/" title="Permanent link to New York Times Prototypes a Linked Data Search Engine"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/zimbalist-new-york-times.png" width="125" height="125" alt="New York Times Prototypes a Linked Data Search Engine" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/new-york-times-prototypes-a-linked-data-search-engine/"></g:plusone></div><p>On <a href="http://www.beet.tv/2013/03/timeslinked.html" title="Beet.TV" target="_blank">Beet.TV</a>, <a href="https://twitter.com/Beet_TV" title="Andy Plesser of Beet.TV on Twitter" target="_blank">Andy Plesser</a> recently featured a short but fascinating video of <a href="https://twitter.com/zimbalist" title="The New York Times' Micael Zimbalist on Twitter" target="_blank">Michael Zimbalist</a>, Vice President of <a href="http://nytlabs.com/" title="The New York Times Company Research and Development Lab" target="_blank">Research and Development Operations</a> at the <em>New York Times</em>, talking with <a href="https://twitter.com/joannaoconnell" title="Forrester Principal Analyst Joanna O'Connell on Twitter" target="_blank">Joanna O'Connell</a> of Forrester about a prototype linked data search engine being developed by the <em>Times</em>.<br />
<span id="more-1376"></span><br />
Zimbalist begins by talking about the great asset that is the <a href="http://en.wikipedia.org/wiki/New_York_Times_Index" title="The New York Times Index" target="_blank">New York Times Index</a>, and the relationship between the Index's metadata and linked data.</p>
<blockquote><p>For almost the entire life of the newspaper &#8230; we've been annotating that.  We've had the Times Index &#8211; you've probably seen it at the library, these big fat red books so you could go and look up the Index and find what issue of the Times and what section there was a story about that.  So that really provides the basis for some exceptionally rich metadata that's been consistent over the history of the organization.</p>
<p>So we're able to look at the text of articles as data, as very unstructured data, and begin to put some structure around it.  And we do it in a lot of really different ways.  So we have our metadata and our index now rationalized.  So all the terms in the index match up to all the terms in our metadata, and we've connected that to this linked data cloud.  There's this linked data movement that's trying to treat the entire web as this database of information.</p></blockquote>
<p>Zimbalist goes on to talk about entities and how, for example, the <em>New York Times</em> may have one way of referencing the named entity "Barack Obama" and Amazon another.</p>
<blockquote><p>And through this linked data movement we're able to say that their Barack Obama is the same as our Barack Obama and create these new editorial products that fuse these different bits of information.</p>
<p>So, for example, one the things we have built as a tool, just as a prototye example of the kind of services we might be able to deliver in the future, is a search engine where you can type in a college or university and we will go look through the articles that we've written and surface any articles that mention alumni or alumnae from that university even though we might not have mentioned in the article that that person attended that school.  Because we're able to link up the person's name with a larger database of college graduates.</p></blockquote>
<p>This is a great example of a leveraging linked data to create a product that is obviously useful in tying together disparate pieces of information that aren't available from a single source.</p>
<p>In response to a question from O'Connell, Zimbalist agreed that "normalization &#8230; does matter," but said that the bigger challenge was broad-based indexation of content.</p>
<blockquote><p>I feel like the bigger problem is getting all this stuff indexed.  Rationalizing among the indices seems like a more solvable problem than getting everybody who's publishing on the Internet to adequately index their content, but I don't know.</p></blockquote>
<p>Some definite food for thought here for data producers.  That is, how useful is that data outside your organization if it is not exposed?  And while Zimbalist does not explain the difference between "adequate" and "inadequate" indexation, any student of the semantic web could reasonably infer that "adequately" indexed data is ultimately some form of structured data.</p>
<p>Members of the "linked data movement" to which Zimbalist refers would likely also argue that maximum benefit is derived from data that is not only <em>structured</em> but <em>open</em>.  In discussing the prototype engine, O'Connell makes an oblique reference to this in terms of the possible data ownership issues.</p>
<blockquote><p>Well, I was just thinking about monetization challenges that it might cause, or rights challenges &#8230; how does that work when you have everything linked together like that doesn't it create a who owns what and who gets paid for what question?</p></blockquote>
<p>While Zimbalist concurs that this is "an interesting question," one doesn't get the impression that, as O'Connell suggests, that this is a question that the <em>Times</em> thinks about "every day."  To me, in fact, the prototype engine is something of a poster child for how useful applications may be built when structured data is made freely accessible to all data consumers.  The degree to which an application harnessing these data may be monetized is dependent on that data existing in a consumable format in the first place; that is, linked open data is itself a condition for creating products that have a monetization potentional.</p>
<p>Watch the entire interview:</p>
<p><iframe src="http://blip.tv/play/goRrg47WcQI.x?p=1" width="475" height="286" frameborder="0" allowfullscreen></iframe><embed type="application/x-shockwave-flash" src="http://a.blip.tv/api.swf#goRrg47WcQI" style="display:none"></embed></p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/new-york-times-prototypes-a-linked-data-search-engine/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/new-york-times-prototypes-a-linked-data-search-engine/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google Data Highlighter: Markup-Free Structured Data for Google</title>
		<link>http://www.seoskeptic.com/google-data-highlighter-markup-free-structured-data-for-google/</link>
		<comments>http://www.seoskeptic.com/google-data-highlighter-markup-free-structured-data-for-google/#comments</comments>
		<pubDate>Wed, 12 Dec 2012 20:52:47 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Data Highlighter]]></category>
		<category><![CDATA[Google Webmaster Tools]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1301</guid>
		<description><![CDATA[Google announced today a new, potentially powerful addition to its Webmaster Tools options today:  the Data Highlighter. The Data Highlighter allows webmasters to select text on a web page and associate it with properties for a particular data type.  At this time the only supported data type is for events, but (perhaps based on lessons [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/google-data-highlighter-markup-free-structured-data-for-google/" title="Permanent link to Google Data Highlighter: Markup-Free Structured Data for Google"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/data-highlighter.png" width="125" height="125" alt="Google Data Highlighter - Markup-Free Structured Data for Google" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/google-data-highlighter-markup-free-structured-data-for-google/"></g:plusone></div><p>Google <a title="Official Google Webmaster Central Blog: Introducing Data Highlighter for event data" href="http://googlewebmastercentral.blogspot.com/2012/12/introducing-data-highlighter-for-event.html" target="_blank">announced today</a> a new, potentially powerful addition to its Webmaster Tools options today:  the Data Highlighter.</p>
<p>The Data Highlighter allows webmasters to select text on a web page and associate it with properties for a particular data type.  At this time the only supported data type is for events, but (perhaps based on lessons learned from this initial rollout) expect more to follow.  Indeed, in the <a title="Data Highlighter help pages on Google" href="https://support.google.com/webmasters/bin/topic.py?hl=en&amp;topic=2692946&amp;parent=21997&amp;ctx=topic" target="_blank">Data Highlighter help section</a> on Google, there is already <a title="Google Help: Data types supported by Data Highlighter" href="https://support.google.com/webmasters/bin/topic.py?hl=en&amp;topic=2774098&amp;parent=2692946&amp;ctx=topic" target="_blank">a page</a> listing "data types supported by Data Highlighter."</p>
<p>While marking up &#8211; I mean "highlighting" &#8211; a single page with data related to an event without needing to add any additional code to a site is pretty useful in itself, what makes the Data Highlighter a potentially important leap forward in the the provision of structured information to Google is the ability to highlight <em>patterns</em> that pertain similar pages on a site.  In the sphere of events, this means that if your site has multiple, similarly structured pages containing event data, you can highlight event properties on a single page and have Google apply the same logic to pages that have "a consistent format."</p>
<p><span id="more-1301"></span></p>
<p>The utility of this move is obvious:  it allows webmasters to inform Google of structured information on a site without the need to add additional markup to a page, or even without knowledge of any vocabularies that pertain to the information at hand.   This in turn facilitates the production of rich snippets in Google search results, and provides Google with <em>de facto</em> structured data that it can use in all sorts of situations.</p>
<p>Put another way, if yesterday I wanted to reliably provide Google with structured information about an upcoming event listed on my website, I would need to know a fair amount about <a title="The schema.org Event data type" href="http://schema.org/Event" target="_blank">schema.org/Events</a> and its properties, encode that information using either microdata or RDFa, and publish or republish the page.  Today, I can trot on over to Webmaster Tools, spend a few minutes highlighting some text and press "publish."</p>
<p>Conceptually, I personally find the Data Highlighter fascinating because the tool replicates for webmasters what Google has been doing behind the scenes for many, many years:  making sense of unstructured and semi-structured data by uncovering consistent patterns on web pages.  Now everyday webmasters have the ability to "train" Google about the presence of consistently-formatted on their site.</p>
<p>Before I get to a walk-through of the Highlighter, a couple of quick initial thoughts about some of the Highlighter features and benefits.</p>
<ul>
<li>The Data Highlighter explicitly names required properties for events.  As schema.org is parser-agnostic, previously the only way webmasters could learn what properties Google required for any given data type was to run a page or some code through the Structured Data Testing Tool and try to decipher the resulting error messages.</li>
<li>While the event properties available in the Data Highlighter are clearly based on schema.org/Events it is apparently not a faithful one-to-one mapping (they say that the "information that Data Highlighter can extract for events is slightly different from the event data that you can specify with HTML markup").  Though in their enumeration of the differences between the two, the only thing that appears radically different is the date formats that Data Highlighter allows.  This makes sense as it's unlikely that many websites reliably publish dates on-page in ISO 8601 format (when encoded in microdata or RDFa date information is almost always placed in a non-displaying &lt;meta&gt; tag).</li>
<li>Knowledge Graph sources are being extended.  That is, in their video and documentation Google specifically references the Knowledge Graph, and even provides a graphic of how successfully-extracted event information will appear in a Knowledge Graph vertical.  This may not mean a change in Google's Knowledge Graph aim &#8211; in the <a title="Jon Mitchell's interview with Emily Moxley about the Knowledge Graph, 26 July 2012" href="http://readwrite.com/2012/07/26/how-google-organizes-the-world-qa-with-the-manager-of-knowledge-graph" target="_blank">words</a> of Google's <a title="Emily Moxley's Google+ profile" href="https://plus.google.com/113142037585587988915" target="_blank">Emily Moxley</a> &#8211; to "purposely try to show things that are definitely true," but it does represent a vast expansion of what sources Google might consider to be "definitely true."</li>
<li>From a trust and proof point of view, "data highlighting" may circumvent the ability of websites to game Google with inaccurate structured data.  While the tool allows for the addition of "missing data" it is still largely predicated on the highlighting of existing, visible content on a website (and one would certainly think that Google would have a higher confidence in highlighted, visible on-page data than that supplied through the "missing tags" interface).  Which may be why (to my point above) Google is willing to push Data Highlighter-supplied information to the Knowledge Graph.</li>
</ul>
<h3>Adding event information to using the Data Highlighter</h3>
<p>To access the data highlighter select "Data Highlighter" in the "Optimization" section of Google Webmaster Tools.</p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-13.jpg"><img class="aligncenter size-full wp-image-1312" title="Google Data Highlighter - Initial Webmaster Tools Interface" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-13.jpg" alt="Google Data Highlighter - Initial Webmaster Tools Interface" width="504" height="294" /></a></p>
<p>When you click on the big blue "Start Highlighting" button you're required to enter the URL of a page, and to select whether or not your want to tag just the page in question, or the page "and others like it."  Here I selected the single page option.</p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-12.jpg"><img class="aligncenter size-full wp-image-1314" title="Google Data Highlighter - URL Input and Tagging Options" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-12.jpg" alt="Google Data Highlighter - URL Input and Tagging Options" width="504" height="294" /></a></p>
<p>Once the page is loaded and a portion of text is highlighted, you assign it to a predefined tag (equivalent to a schema.org property).  As you'll see, I used what isn't really an event page per se (and it happened in the past), but had most of the elements of a "proper" event page.</p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-11.jpg"><img class="aligncenter size-full wp-image-1317" title="Google Data Highlighter - Associating On-Page Data with a Tag" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-11.jpg" alt="Google Data Highlighter - Associating On-Page Data with a Tag" width="504" height="294" /></a></p>
<p>Once a piece of information has been tagged, a label appears next to the highlighted text, and the tag content appears in a sidebar.  Sidebar items that Google regards as problematic appear with an "attention" icon next to them.</p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-10.jpg"><img class="aligncenter size-full wp-image-1319" title="Google Data Highlighter - Display Once Data Has Been Assigned to a Tag" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-10.jpg" alt="Google Data Highlighter - Display Once Data Has Been Assigned to a Tag" width="504" height="294" /></a></p>
<p>The Data Highlighter will allow data to be selected from any portion of the page.  Here I've selected my name to populate the "Performer" tag by using the anchor text of my linked WordPress author byline.</p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-5.jpg"><img class="aligncenter size-full wp-image-1322" title="Google Data Highlighter - Example Performer Data Selection" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-5.jpg" alt="Google Data Highlighter - Example Performer Data Selection" width="504" height="294" /></a></p>
<p>Data that isn't visually present on the page can be added to a tag by selecting "Add missing tags" from the settings icon at the upper right, and then following the prompts.</p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-4.jpg"><img class="aligncenter size-full wp-image-1325" title="Google Data Highlighter - Intitial Interface for Adding a Missing Tag" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-4.jpg" alt="Google Data Highlighter - Intitial Interface for Adding a Missing Tag" width="504" height="294" /></a></p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-3.jpg"><img class="aligncenter size-full wp-image-1327" title="Google Data Highlighter - Adding Missing Tag Data" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-3.jpg" alt="Google Data Highlighter - Adding Missing Tag Data" width="504" height="294" /></a></p>
<p>Once you click "Publish" information about the page or pages you've tagged using the Data Highlighter appear in the Data Highligther section of Webmaster Tools.  What data "will become available" once Google has recrawled the site remains to be seen (for example, whether or not successfully published and crawled Highlighter-entered data will appear in the Structured Data report).</p>
<p><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-15.jpg"><img class="aligncenter size-full wp-image-1329" title="Google Data Highlighter - Interface Showing Published Pages" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-15.jpg" alt="Google Data Highlighter - Interface Showing Published Pages" width="504" height="294" /></a></p>
<p>The interface for tagging multiple, pattern-based pages is similar, but includes a progress meter and additional steps (I haven't fully explored this yet).</p>
<p style="text-align: center;"><a href="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-2.jpg"><img class="aligncenter size-full wp-image-1331" title="Google Data Highlighter - Intitial Interface for Tagging Multiple, Pattern-Based Pages" src="http://www.seoskeptic.com/wp-content/uploads/2012/12/data-highlighter-t-2.jpg" alt="Google Data Highlighter - Intitial Interface for Tagging Multiple, Pattern-Based Pages" width="504" height="294" /></a></p>
<p>At first blush the highlighting interface is intuitive and seems to work well.  Now it's a waiting game to see when and if rich snippets appear for marked up events, how long it takes for rich snippets or Webmaster Tools data about an event to appear, and whether or not an event tagged through the Highlighter makes its way to the Knowledge Graph.</p>
<p>All of this unlikely with my example event that occurred in the past, but I have many other (and arguably more important) pages to tag with event properties.  I'll keep you posted.</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/google-data-highlighter-markup-free-structured-data-for-google/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/google-data-highlighter-markup-free-structured-data-for-google/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Fall of the Machine Part II: Google&#039;s &quot;Disavow Links&quot; Tool</title>
		<link>http://www.seoskeptic.com/the-fall-of-the-machine-part-ii-googles-disavow-links-tool/</link>
		<comments>http://www.seoskeptic.com/the-fall-of-the-machine-part-ii-googles-disavow-links-tool/#comments</comments>
		<pubDate>Tue, 16 Oct 2012 23:35:26 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Webmaster Tools]]></category>
		<category><![CDATA[Links and Link Building]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1252</guid>
		<description><![CDATA[Google has released a tool which allows webmasters to disavow links pointing to their site. Humans again have kicked the machine.]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/the-fall-of-the-machine-part-ii-googles-disavow-links-tool/" title="Permanent link to Fall of the Machine Part II: Google's "Disavow Links" Tool"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/disavow-links.png" width="125" height="125" alt="Fall of the Machine Part II - The Disavow Links Tool from Google" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/the-fall-of-the-machine-part-ii-googles-disavow-links-tool/"></g:plusone></div><p>In May of 2009 I wrote <a title="Paid Link Reporting and the Fall of the Machine" href="http://www.seoskeptic.com/paid-link-reporting-and-the-fall-of-the-machine/">a post on Google's paid link reporting mechanism</a>, opining that the introduction of a manual link review process undermined Google's traditional and successful reliance on machine-based algorithms to rank web pages.</p>
<p>Fast forward to 16 October 2012, when suddenly <a href="http://searchengineland.com/google-launches-disavow-links-tool-136826" target="_blank">the</a> <a href="http://searchenginewatch.com/article/2217602/Google-Disavow-Links-Tool-Now-Available" target="_blank">airwaves</a> <a href="http://www.seobodybuilder.com/seo/googles-new-disavow-link-tool/" target="_blank">are</a> <a href="http://www.webpronews.com/googles-link-disavow-google-answers-domain-related-questions-2012-10" target="_blank">buzzing</a> with Matt Cutts' announcement of Google's "<a title="The Official Google Webmaster Central Blog Post on the Disavow Links Tool" href="http://googlewebmastercentral.blogspot.ca/2012/10/a-new-tool-to-disavow-links.html" target="_blank">disavow links tool</a>" at PubCon.</p>
<p>To a certain extent this is a déjà vu moment. Like the paid links reporting tool it is at once an admission that Google's mighty machines aren't up to the task of assessing the native value of hyperlinks without some degree of human intervention. And as with paid link reporting, it also "formally makes the SEO industry necessary" as an auxiliary force to police Google's <a href="http://support.google.com/webmasters/bin/answer.py?hl=en&amp;answer=35769" target="_blank">webmaster guidelines</a>.<br />
<span id="more-1252"></span><br />
However, the possible necessity of link disavowal seems more nefarious to me in two respects.</p>
<p>First, while paid link reporting ultimately put the onus on those abusing Google's reliance on PageRank to clean up their acts (at least those identified through the paid link reporting tool), link disavowal means puts webmasters on notice that they're now on the hook for behavior by others <em>over which they have no control</em>. To use a legal analogy, the presumption of innocence is no more.</p>
<p>And a mechanism for reporting "bad links" does not place webmasters in control, except insofar as its an explicit admission that competitors <em>can</em> influence your search visibility (simultaneously placing a nail in the coffin of the "is there such a thing as 'negative SEO' debate &#8211; cased closed). Yes, a webmaster now can combat a competitor's malfeasance &#8211; <em>if</em> they opt into Google Webmaster Tools so they may receive "unnatural link" notifications, and <em>if</em> they have the time, knowledge and motivation to disavow linkspam.</p>
<p>(Google's announcement sports a marvelously ambiguous adverb in it's second sentence. "If you haven't gotten this ['unnatural links'] notification, this tool <em>generally</em> isn't something you need to worry about." Emphasis mine.)</p>
<p>In short, where one could previously garner something like a competitive advantage by <em>optionally</em> ratting out nefarious competitors, now one is seemingly <em>required</em> to protect one's reputation for purity by formally distancing oneself from slanderers.</p>
<p>Which leads me to my second point &#8211; namely that this sort of esoteric search engine optimization activity favors sites with bigger budgets and well-informed marketers. A small business with a site that offers "great content" to visitors ("great content" having become almost a sacred mantra in Matt Cutts' missives) may now, apparently, fall prey to the evil intentions of unscrupulous competitors unless they employ professional help to keep the vampires at bay.</p>
<p>To a large extent much of the concern over "negative SEO" is probably a tempest in a teapot, and really most webmasters shouldn't worry their pretty little heads over "unnatural links." As the announcement post says, the "<em>[v]ast, vast majority</em> of sites do not need to use this tool," that's "<em>primary</em> purpose is to help clean up if you've hired a bad SEO or made mistakes in your own link-building" and that one "<em>[t]ypically</em>" does not have to clean up links that a webmaster didn't create. Still, the profusion of qualifiers employed (emphasis, again, mine) leaves Google a lot of wiggle room to disavow their own advice.</p>
<p>The fact of the matter is that bad links, like good content, ultimately have a human behind the wheel. And one thing I've always considered a primary reason for Google's success and strategic brilliance is to remove humans from the process of ranking websites (keep in mind, for example, that the primary purpose of Google's "quality raters" is not to circumvent the algorithm, but to help hone it). There are probably in the neighborhood of <a href="http://www.worldwidewebsize.com/" target="_blank">40 to 50 billion pages</a> in Google's index: any significant human intercessory process to determine the relative value of these pages obviously doesn't scale.</p>
<p>So &#8211; to me at least &#8211; further crowd-sourcing of Google quality guidelines enforcement further undermines the foundation of their algorithmic success. All of which may be simply to say that the age of the link-as-an-impartial-vote may soon come to an end (link fraud, unlike voter fraud, is an <em>actual</em> problem). And maybe what rises from the ashes may once again allow Google to return to its mechanical roots, at least until pesky humans again upset the applecart.</p>
<p>Ready to disavow your social connections, y'all?</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/the-fall-of-the-machine-part-ii-googles-disavow-links-tool/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/the-fall-of-the-machine-part-ii-googles-disavow-links-tool/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The New Structured Data Report in Google Webmaster Tools</title>
		<link>http://www.seoskeptic.com/the-new-structured-data-report-in-google-webmaster-tools/</link>
		<comments>http://www.seoskeptic.com/the-new-structured-data-report-in-google-webmaster-tools/#comments</comments>
		<pubDate>Wed, 01 Aug 2012 21:06:48 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Webmaster Tools]]></category>
		<category><![CDATA[Structured Markup]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1225</guid>
		<description><![CDATA[Google has just announced a welcome new addition to the information available in Webmaster Tools:  the Structured Data Report. While Google has long been consuming structured data markup, until now there's been no mechanism to verify that Google had actually consumed the structured markup that webmasters had been providing.  With the Structured Data Report, Google [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/the-new-structured-data-report-in-google-webmaster-tools/" title="Permanent link to The New Structured Data Report in Google Webmaster Tools"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/google-webmaster-tools-structured-data.png" width="125" height="125" alt="Google Webmaster Tools Structured Data Report" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/the-new-structured-data-report-in-google-webmaster-tools/"></g:plusone></div><p>Google has <a title="Official Google Post on the Structured Data Report" href="http://googlewebmastercentral.blogspot.ca/2012/07/introducing-structured-data-dashboard.html" target="_blank">just announced</a> a welcome new addition to the information available in Webmaster Tools:  the Structured Data Report.</p>
<p>While Google has long been consuming structured data markup, until now there's been no mechanism to verify that Google had actually consumed the structured markup that webmasters had been providing.  With the Structured Data Report, Google is now reports on key pieces of information about structured data present on a website:</p>
<ol>
<li>The type and corresponding schema of structured data Google has discovered on a website.</li>
<li>The total number of occurrences of that structured data type Google has discovered.</li>
<li>The total number of pages containing each type listed.</li>
<li>The specific pages on which each structured data type is present.</li>
<li>Information about the properties present on each page for each structured data type.</li>
</ol>
<p><span id="more-1225"></span></p>
<p><strong>Structured Data Type, Schema, Items and Pages</strong></p>
<p>The main report gives an overview of structured data for a given domain, and shows the presence of different items and pages over time (all Google Webmaster Tools reports are live data from <a title="SEO Cooking" href="http://seocooking.com/" target="_blank">seocooking.com</a>).</p>
<div id="attachment_1228" class="wp-caption alignright" style="width: 516px">
	<img class="size-full wp-image-1228" title="Google Webmaster Tools Structured Data Overview Report" src="http://www.seoskeptic.com/wp-content/uploads/2012/08/main-structured-data-report.png" alt="Google Webmaster Tools Structured Data Overview Report" width="516" height="417"></a></p>
<p class="wp-caption-text">Google Webmaster Tools Structured Data Overview Report</p>
</div>
<p>One of the interesting things about this overview report is that webmasters will be able to see the lag time (if any) between page indexing and the appearance of structured data on a page as reported by Webmaster Tools.  I haven't yet created new content containing structured data, but I look forward to finding out whether the indexing of structured data occurs at the same time as page indexing, or if structured data is separately indexed.</p>
<p>While this is likely simultaneous, one will nonetheless be able to see how long it takes Google to index structured data once it has been added to a page, and whether or not the addition or removal of structured data without visible content changes is picked up by Google.</p>
<p>While downloads are always nice, the main utility I see for the data export is to calculate the data type distribution &#8211; really only useful for sites with a large number of types and pages.</p>
<p>Note that where there are a <em>greater number of items</em> reported than the <em>total number of pages</em> for that structured data type, this is because one or more pages contain <em>multiple instances</em> of that type.</p>
<p><strong>Information about Specific Structured Data Types</strong></p>
<p>Clicking on a type from the overview report will display a report showing the pages that contain that structured data type.  As noted above, pages containing more than one item contain multiple instances of that item.  The report displayed below is the result of clicking on the linked "VideoObject" type in the overview report displayed above.</p>
<div id="attachment_1229" class="wp-caption alignright" style="width: 516px">
	<img class="size-full wp-image-1229" title="Google Webmaster Tools Structured Data Report for a Specific Type" src="http://www.seoskeptic.com/wp-content/uploads/2012/08/type-structured-data-report.png" alt="Google Webmaster Tools Structured Data Report for a Specific Type" width="516" height="424"></a></p>
<p class="wp-caption-text">Google Webmaster Tools Structured Data Report for a Specific Type</p>
</div>
<p>Like the overview report, the type report shows the instances of that type recorded over time.  It also displays the specific pages on which that structured data appears and the number of occurrences over time.</p>
<p>Here the data download is potentially more useful, as a webmaster can compare the pages on which he or she thinks that typeshould be present against pages Google actually acknowledges containing that type.</p>
<p>It also alerts webmasters to the presence of structured data they may not have been aware that they were publishing.  Many WordPress site owners, for example, will be surprised to see that the platform and certain plugins are producing structured data of their own accord.</p>
<p>Clicking on a specific page will then provide details of the properties present for that structured data type, as well as a link to the Rich Snippets Testing tool (that provides that same information in a different format, as well as warnings about malformed syntax or invalid or missing properties).</p>
<div id="attachment_1231" class="wp-caption alignright" style="width: 516px">
	<a target="_blank" href="http://seocooking.com/how-to-chop-an-onion/" title="How to Chop an Onion - Page with VideoObject Markup"><img class="size-full wp-image-1231" title="Structured Data Report for a Specific Type on a Specific Page" src="http://www.seoskeptic.com/wp-content/uploads/2012/08/structured-data-page-report.png" alt="Structured Data Report for a Specific Type on a Specific Page" width="516" height="513"></a></p>
<p class="wp-caption-text">Structured Data Report for a Specific Type on a Specific Page</p>
</div>
<p>Interestingly, Open Graph properties for this page are appearing under the VideoObject itemtype for this page, even if those these properties are not a part of this schema, and are fully separated in the markup of the page (the do not appeared so grouped in the results of the Rich Snippets Testing Tool.</p>
<div id="attachment_1233" class="wp-caption alignright" style="width: 516px">
	<img class="size-full wp-image-1233" title="Open Graph and schema.org/VideoObject Properties on the Referenced Page" src="http://www.seoskeptic.com/wp-content/uploads/2012/08/open-graph-and-schema-tags.png" alt="Open Graph and schema.org/VideoObject Properties on the Referenced Page" width="516" height="362"></p>
<p class="wp-caption-text">Open Graph and schema.org/VideoObject Properties on the Referenced Page</p>
</div>
<p>This doesn't make a lot of sense to me, but I'll dig deeper to see whether this is a syntax problem on the publishing side or a bug on Google's reporting side.</p>
<p>In any case, I just wanted to provide a quick, first-blush overview of a new Webmaster Tool report that's obviously useful to webmasters and marketers, like me, that have been actively adding structured data markup to websites.  I welcome any thoughts or observations from others active in the semantic web as it pertains to search engines.</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/the-new-structured-data-report-in-google-webmaster-tools/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/the-new-structured-data-report-in-google-webmaster-tools/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>The Three Cardinal Sins of Tweet Button Implementations</title>
		<link>http://www.seoskeptic.com/the-three-cardinal-sins-of-tweet-button-implementations/</link>
		<comments>http://www.seoskeptic.com/the-three-cardinal-sins-of-tweet-button-implementations/#comments</comments>
		<pubDate>Tue, 29 May 2012 23:01:05 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[WordPress]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1191</guid>
		<description><![CDATA[The tweet button allows users to tweet a link to a page with a single click, and is one of the most ubiquitous social sharing buttons on the web.  This site uses it, your favorite sites almost certainly carry it, and if your mom has a blog you'll probably find a button there sporting a [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/the-three-cardinal-sins-of-tweet-button-implementations/" title="Permanent link to The Three Cardinal Sins of Tweet Button Implementations"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/tweet-button-sins.jpg" width="125" height="125" alt="Three Cardinal Sins of Tweet Button Implementations" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/the-three-cardinal-sins-of-tweet-button-implementations/"></g:plusone></div><p>The tweet button allows users to tweet a link to a page with a single click, and is one of the most ubiquitous social sharing buttons on the web.  This site uses it, your favorite sites almost certainly carry it, and if your mom has a blog you'll probably find a button there sporting a picture of a little bird with the word "Tweet" next to it.</p>
<p>As easy as it is to add the tweet button to a website, most users do not avail themselves of the full functionality that the tweet button offers.  By making three relatively simple implementation tweaks you can create more engaging tweets, improve tracking, provide better exposure for your Twitter username and increase the number of your followers.</p>
<p><span id="more-1191"></span></p>
<p>In the examples below I mostly reference the <a title="WP Tweet Button plugin for WordPress" href="http://wordpress.org/extend/plugins/wp-tweet-button/" target="_blank">WP Tweet Button plugin</a> for WordPress, as it is the most commonly-used tweet button utility for the most commonly-used blog platform.  However, there are other plugins available for other platforms, and Twitter itself offers both <a title="tweet button configurator on Twitter" href="https://twitter.com/about/resources/buttons#tweet" target="_blank">a button configurator</a> and detailed <a title="tweet button developer page on Twitter" href="https://dev.twitter.com/docs/tweet-button" target="_blank">tweet button documentation</a> for developers.</p>
<p>Proper configuration of the tweet button is straightforward regardless of the method you use, and the best practices described below should not be difficult to implement.  By following the guidelines below you'll get more mileage out of your button-based tweets, and find yourself unsullied by any of the three cardinal sins of tweet button implementations!</p>
<h3>Sin #1:  Failing to Shorten the URL</h3>
<p>The primary advantage of tweet button URL shortening is that much less of the 140 characters Twitter allows is eaten up by the web page address, freeing it up for use by the tweeter to add comments, and for you to add a Twitter @name to the end of the tweet (discussed under sin #2).  It also renders the resulting tweet easier to read, which may increase both the number of clicks on the link and the number of retweets.</p>
<p>For a (fairly standard) WordPress site like mine, which uses dash-separated words in the title to generate a post file name, a long title generates an equally long URL.  Longer URLs are also a consequence, of course, of sites with long domain names.</p>
<p>An out-of-the-box, unmodified tweet button implementation on WordPress using the plugin produces share windows like this when the tweet button is clicked.</p>
<p style="text-align: center;"><img class="size-full wp-image-1197 aligncenter" title="Unmodified Tweet Button Output for a Long Post Title" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/share-long.jpg" alt="Unmodified Tweet Button Output for a Long Post Title" width="543" height="187" /></p>
<p> An even longer title will result in truncation of that post's title.</p>
<p><img class="aligncenter size-full wp-image-1198" title="Unmodified Tweet Button Output for an Extremely Long Post Title" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/share-very-long.jpg" alt="Unmodified Tweet Button Output for an Extremely Long Post Title" width="543" height="187" /></p>
<p>The tweet button does do some URL truncation of its own accord &#8211; you may have observed that for both those titles the output exceeds Twitter's 140-character limit (the first tweet is actually 243 characters long!).  However, this still leaves little space for the addition of any commentary from the person posting the tweet, and the fact that the visible area of the share window is occupied with the post title and URL in any case discourages such additions.  And one way or another, when the tweet is posted a large amount of real estate is necessarily given over simply to the URL.</p>
<p><img class="aligncenter size-full wp-image-1200" title="Posted Tweet for a Long Post Title and Non-Shortened URL" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/share-long-output.jpg" alt="Posted Tweet for a Long Post Title and Non-Shortened URL" width="449" height="66" /></p>
<p>Sin not!  Shorten your URLs!  Using the plugin, this is readily accomplished by selecting one of the available shortener options.</p>
<p><img class="aligncenter size-full wp-image-1201" title="WP Tweet Button Shortener Options" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/plugin-shortener-options.jpg" alt="WP Tweet Button Shortener Options" width="516" height="385" /></p>
<p>Some of these shorteners may be used as is, whereas some shortening services require an API, for which credentials may be entered in the appropriate fields in the plugin's settings (for the sake of conciseness &#8211; here, as elsewhere in this post &#8211; additional plugin options are not displayed or discussed in detail).</p>
<p>Using a shortening service linked to your account is recommended, as the service (such as <a title="bit.ly Shortening Service" href="https://bitly.com/">bit.ly</a>, my quick and easy shortener of choice) may provide you with aggregated metrics about links you've shortened.  But using any shortener is better than not using one at all, as you can see from the share window output once a shortener has been enabled.</p>
<p><img class="aligncenter size-full wp-image-1203" title="Tweet Button Output for a Long Post Title with Shortened URL" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/share-shortened.jpg" alt="Tweet Button Output for a Long Post Title with Shortened URL" width="543" height="187" /></p>
<p>The would-be tweeter has more space to add their contribution (if only conceptually), and the tweet is now more fully focused around the more important of its two elements:  the post title.</p>
<p>Of course, few SEOs regularly create post titles 103 characters in length.  A properly length-optimized title (i.e. of 70 characters or less) makes for a very tidy share snippet indeed.</p>
<p><img class="aligncenter size-full wp-image-1204" title="Tweet Button Output for an Optimized Post Title with Shortened URL" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/share-shortened-optimized.jpg" alt="Tweet Button Output for an Optimized Post Title with Shortened URL" width="543" height="187" /></p>
<p>Hey, who wrote that post, anyway?</p>
<h3>Sin #2:  Failing to Attribute the Tweet to a User</h3>
<p>When I tweet a link to an article, I go out of my way find out if the author has a twitter account so I can attribute the piece to its creator ("I Head a Fly Buzz &#8211; poet.com/fly/ @emilydickinson").  I am the exception:  users of tweet buttons tend to accept the defaults, and if your @account doesn't appear in the share window its unlikely to make its way into the final tweet.</p>
<p>Attribution has some obvious benefits.  Whether by clicking on Twitter's "Connect" link or being alerted of @mentions by utilities like TweetDeck, you can more readily see activity on something you've created.  And both corporate and individual creators of content see branding and name-recognition benefits of having an @account more frequently exposed.</p>
<p>Like shortening, adding a user attribution to a tweet is dead simple using the WP Tweet Button plugin (and also dead simple when using Twitter's tweet button configurator).  Just add your user name in the appropriate spot in the plugin's settings panel.</p>
<p><img class="aligncenter size-full wp-image-1205" title="Adding a Default Twitter Account in WP Tweet Button" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/default-twitter-username.jpg" alt="Adding a Default Twitter Account in WP Tweet Button" width="507" height="125" /></p>
<p>By specifying my username in this box, now all tweets are appended "via @aaranged".</p>
<p><img class="aligncenter size-full wp-image-1206" title="Tweet Button Output with a Default Twitter User Specified" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/share-shortened-attribution.jpg" alt="Tweet Button Output with a Default Twitter User Specified" width="543" height="187" /></p>
<p>Again, there are more plugin options available for how usernames can be configured, and developers can modify the tweet button code to accommodate @user accounts in complex ways (" &#8230; by @neptune from @romangods").  I will note, however, that for WordPress sites it's possible to leverage author accounts to automatically populate posts with the right @user, and to override the default attribution on a post-by-post basis.</p>
<p>Sweet, eh?  Now if you only had more followers&#8230;.</p>
<h3>Sin #3:  Failing to Recommend Users to Follow</h3>
<p>You've shortened your URL.  You've attributed the link to a particular Twitter user.  Your loyal reader, enamored of your prose, readily clicks the tweet button on your post, accepts the default content of the share box and clicks the final "tweet" button there.</p>
<p><img class="aligncenter size-full wp-image-1209" title="Tweet Notification without Recommended User" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/tweeted-without-suggestions.jpg" alt="Tweet Notification without Recommended User" width="543" height="204" /></p>
<p>But unless that user already follows you that's not the end of the story, but rather a missed opportunity.</p>
<p>The tweet button allows you to specify up to two users that will be recommended to the tweeter after their tweet has been sent.  The benefit should be obvious:  it's a contextually relevant call-to-action that can help you acquire new followers.</p>
<p>Yet again, configuring a recommended user is straightforward.  For the WP Tweet Button plugin simply enter the account name of the user you wish to recommend (without going into details, I find the description field a little problematic &#8211; if you have a decent Twitter bio this should more than suffice as an enticement).</p>
<p><img class="aligncenter size-full wp-image-1210" title="Specifying a Default Recommended Twitter User Using WP Tweet Button" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/default-recommended-user.jpg" alt="Specifying a Default Recommended Twitter User Using WP Tweet Button" width="469" height="231" /></p>
<p>When a default recommended Twitter user is specified, sharers will be invited to follow this user after they send their tweet (unless they follow the recommended user already).</p>
<p><img class="aligncenter size-full wp-image-1211" title="Tweet Notification with a User Recommendation" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/tweeted-with-suggestion.jpg" alt="Tweet Notification with a User Recommendation" width="543" height="242" /></p>
<p>As with @attribution, there are other plugin options for user recommendations, including deriving the suggestion from the Twitter account associated with an author.  At the post level, an <em>additional</em> user recommendation can be entered.</p>
<p><img class="aligncenter size-full wp-image-1212" title="Specifying a Recommended Twitter User at the Post Level" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/post-added-recommended-user.jpg" alt="Specifying a Recommended Twitter User at the Post Level" width="286" height="213" /></p>
<p>If the tweet button user follows neither the default user nor the one specified at the post level, both will be recommended after the tweet is sent.</p>
<p><img class="aligncenter size-full wp-image-1213" title="Tweet Notification with a Two User Recommendations" src="http://www.seoskeptic.com/wp-content/uploads/2012/05/tweeted-with-suggestions.jpg" alt="Tweet Notification with a Two User Recommendations" width="543" height="329" /></p>
<p>That's all there is to it.  For the WordPress plugin configuring URLs to be shortened, assigning a default Twitter user to generate @user notices and assigning a default recommended Twitter to follow takes only a few minutes, yet the majority of publishers sin by neglecting one or more of these simple steps.  As I hope you can see, if you're one of these sinners repentance is readily at hand.</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/the-three-cardinal-sins-of-tweet-button-implementations/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/the-three-cardinal-sins-of-tweet-button-implementations/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Getting Started with Structured Markup for SEO</title>
		<link>http://www.seoskeptic.com/getting-started-with-structured-markup-for-seo/</link>
		<comments>http://www.seoskeptic.com/getting-started-with-structured-markup-for-seo/#comments</comments>
		<pubDate>Fri, 27 Apr 2012 00:20:43 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Microdata]]></category>
		<category><![CDATA[Microformats]]></category>
		<category><![CDATA[RDFa]]></category>
		<category><![CDATA[Rich Snippets]]></category>
		<category><![CDATA[Structured Markup]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1177</guid>
		<description><![CDATA[My SMX Toronto presentation on using structured markup for search engine optimization. Includes a brief overview of semantic web principles, a comparison of the different flavors of structured markup (microformats, microdata and RDFa) and examples of rich snippets generated from structured markup. Because of its obvious relevance to SEO, schema.org is the primary vocabulary discussed, [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/getting-started-with-structured-markup-for-seo/" title="Permanent link to Getting Started with Structured Markup for SEO"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/getting-started-with-structured-markup-for-seo.jpg" width="125" height="125" alt="Getting Started with Structured Markup for SEO" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/getting-started-with-structured-markup-for-seo/"></g:plusone></div><p>My SMX Toronto presentation on using structured markup for search engine optimization.</p>
<p>Includes a brief overview of semantic web principles, a comparison of the different flavors of structured markup (microformats, microdata and RDFa) and examples of rich snippets generated from structured markup.<br />
<span id="more-1177"></span><br />
Because of its obvious relevance to SEO, schema.org is the primary<br />
vocabulary discussed, although the importance of GoodRelations for ecommerce sites is also acknowledged.</p>
<p>See the section below the embedded presentation for a brief addendum.</p>
<div style="width:425px" id="__ss_12707472"><iframe src="http://www.slideshare.net/slideshow/embed_code/12707472" width="425" height="355" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe></div>
<p>&nbsp;<br />
I wasn't able to squeeze these into the presentation list of relevant people to follow on Twitter, but here are some great search marketers that sometimes tweet about schema.org and related topics:</p>
<ul>
<li><a href="http://twitter.com/#!/ajkohn" target="_blank">AJ Kohn</a></li>
<li><a href="http://twitter.com/#!/DocSheldon" target="_blank">Doc Sheldon</a></li>
<li><a href="http://twitter.com/#!/1918" target="_blank">Phil Buckley</a></li>
<li><a href="http://twitter.com/#!/anilopez" target="_blank">Ani Lopez</a></li>
<li><a href="http://twitter.com/#!/thegypsy" target="_blank">David Harry</a></li>
</ul>
<p>And here's the main list from the presentation, reiterated here because somehow SlideShare has pointed all these links simply to twitter.com (Twitter's damnable implementation of rel="canonical" perhaps?):</p>
<ul>
<li><a href="http://twitter.com/#!/danbri" target="_blank">Dan Brickley</a><br /><em>Currently under contract from Google to oversee schema.org matters</em></li>
<li><a href="http://twitter.com/#!/mfhepp" target="_blank">Martin Hepp</a><br /><em>The irrepressibly enthusiastic GoodRelations lead</em></li>
<li><a href="http://twitter.com/#!/jaymyers" target="_blank">Jay Myers</a><br /><em>Web architect for Best Buy, responsible for the first major implementation of RDFa in an ecommerce enviroment</em></li>
<li><a href="http://twitter.com/#!/MatthewJBrown" target="_blank">Matthew Brown</a><br /><em>SEO and news optimization expert at AudienceWise, previously Director of Search Strategy at the New York Times</em></li>
<li><a href="http://twitter.com/#!/manusporny" target="_blank">Manu Sporny</a><br /><em>RDFa/RDF WebApps chair at W3C and one of the world's key semantic web technologists</em></li>
<li><a href="http://twitter.com/#!/gkellogg" target="_blank">Gregg Kellogg</a><br /><em>Semantic media consultant and a part of the W3C's RDF Web Applications Working Group</em></li>
</ul>
<p>Finally, in determining what sort of sites and topics might experience search visibility benefits by employing structured markup, I'd like to stress these two broad categories:</p>
<ol>
<li>Any and all entities (people, places and organizations)</li>
<li>Anything relevant to the vending of goods and services, from online stores to brick-and-mortar local businesses</li>
</ol>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/getting-started-with-structured-markup-for-seo/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/getting-started-with-structured-markup-for-seo/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Agile SEO for Google</title>
		<link>http://www.seoskeptic.com/agile-seo-for-google/</link>
		<comments>http://www.seoskeptic.com/agile-seo-for-google/#comments</comments>
		<pubDate>Thu, 26 Apr 2012 22:08:56 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Search Engine Algorithms]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1168</guid>
		<description><![CDATA[This SMX Toronto presentation explores how an agile approach to SEO can be used to address algorithmic and other changes made by Google on an ongoing basis. In particular, "agile SEO" facilitates: The identification of opportunities that Google changes might represent. Responding in an effective way to big decreases or increases that come about as [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/agile-seo-for-google/" title="Permanent link to Agile SEO for Google"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/agile-seo-for-google.jpg" width="125" height="125" alt="Agile SEO for Google" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/agile-seo-for-google/"></g:plusone></div><p>This SMX Toronto presentation explores how an agile approach to SEO can be used to address algorithmic and other changes made by Google on an ongoing basis.</p>
<p>In particular, "agile SEO" facilitates:</p>
<ul>
<li>The identification of opportunities that Google changes might represent.</li>
<li>Responding in an effective way to big decreases or increases that come about as a result of Google algorithm changes.</li>
<li>Anticipating what changes Google might make in the future.</li>
</ul>
<p><span id="more-1168"></span><br />
I've used "agile" here in the everyday sense of "nimble."  I know this may result in some confusion with the term "agile development," which has quite a different meaning, but it really was the right word for what I wanted to convey.</p>
<div style="width:425px" id="__ss_12707399"><iframe src="http://www.slideshare.net/slideshow/embed_code/12707399" width="425" height="355" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe></div>
<p>&nbsp;</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/agile-seo-for-google/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/agile-seo-for-google/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Deciphering Google&#039;s &quot;Semantic Search&quot; Intentions</title>
		<link>http://www.seoskeptic.com/deciphering-googles-semantic-search-intentions/</link>
		<comments>http://www.seoskeptic.com/deciphering-googles-semantic-search-intentions/#comments</comments>
		<pubDate>Mon, 19 Mar 2012 14:16:39 +0000</pubDate>
		<dc:creator>Aaron Bradley</dc:creator>
				<category><![CDATA[Search Engines]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Entities]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Provenance]]></category>

		<guid isPermaLink="false">http://www.seoskeptic.com/?p=1136</guid>
		<description><![CDATA[In a Wall St. Journal article published recently, Google's Amit Singhal suggested that changes were afoot at the world's dominant search engine. While the Journal piece might overstate the magnitude of possible changes ahead (it is article author Amir Efrati who calls this "a makeover" to Google's formula, not Singhal himself) and while much of [...]]]></description>
			<content:encoded><![CDATA[<p><a class="post_image_link" href="http://www.seoskeptic.com/deciphering-googles-semantic-search-intentions/" title="Permanent link to Deciphering Google's "Semantic Search" Intentions"><img class="post_image alignleft" src="http://www.seoskeptic.com/wp-content/uploads/legacy/post-images/google-semantic-search.jpg" width="125" height="125" alt="Deciphering Google's Semantic Search Intentions" /></a>
</p><div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/deciphering-googles-semantic-search-intentions/"></g:plusone></div><p>In a <a href="http://online.wsj.com/article_email/SB10001424052702304459804577281842851136290-lMyQjAxMTAyMDEwNDExNDQyWj.html" target="_blank"><em>Wall St. Journal</em> article</a> published recently, Google's Amit Singhal suggested that changes were afoot at the world's dominant search engine.</p>
<p>While the <em>Journal</em> piece might overstate the magnitude of possible changes ahead (it is article author Amir Efrati who calls this "a makeover" to Google's formula, not Singhal himself) and while much of the substance of what Singhal is reported as having said is not new, there are indications that there is at least a substantial retooling of Google's search technology underway, and that the nature of these changes are related to Google's embrace of semantic web technologies.  As Efrati puts it, "the company is aiming to provide more relevant results by incorporating technology called 'semantic search,' which refers to the process of understanding the actual meaning of words."</p>
<p>What shape can we expect the nature of that retooling to take?  What changes might Google make to both better utilize semantic technologies and encourage their use by webmasters?  And what are the implications for search marketers and the SEO industry?</p>
<p><span id="more-1136"></span></p>
<h3>Provenance, Please!</h3>
<p>In determining the relevance of thousands of resources, one of the most important things Google does is weeding out maliciously irrelevant resources &#8211; spam detection and filtering.  By "spam" I'm talking the whole gambit, from pages that deliberately misrepresent themselves (e.g. a page built to match the query "golf clubs" that redirects the user to an online poker site) to pages that try to exaggerate their actual relevance (e.g. an on-topic but keyword-stuffed page on golf clubs).  Google needs pick the gems out of the goo, and this procedure is still a requirement when it comes to assessing the value of structured data.</p>
<p>You won't find much discussion about spam in the semantic web world.  It's not that the semantic web framework does not account for the necessity to validate the veracity of documents, but that it takes a different tack.  At the top of the classic semantic web "layer cake" lays the "trust and proof" layer.  The chief mechanisms being worked on in this layer surround issues of provenance:  the origin and chronology of a document.</p>
<p>The W3C <a href="http://www.w3.org/2011/prov/wiki/Main_Page" target="_blank">Provenance Working Group</a> recently published the third working draft of its <a title="http://www.w3.org/TR/prov-dm/" href="http://" target="_blank">provenance data model</a>; the introductory paragraph of this document provides an excellent overview of provenance issues:</p>
<blockquote><p>For the purpose of this specification, provenance is defined as a record that describes the people, institutions, entities, and activities, involved in producing, influencing, or delivering a piece of data or a thing in the world. In particular, the provenance of information is crucial in deciding whether information is to be trusted, how it should be integrated with other diverse information sources, and how to give credit to its originators when reusing it. In an open and inclusive environment such as the Web, users find information that is often contradictory or questionable: provenance can help those users to make trust judgments.</p></blockquote>
<p>(See an earlier post on "<a href="http://www.seoskeptic.com/open-linked-data-discovery-proof-and-trust/" target="_blank">trust and proof</a>" if you're dying to know more about approaches to this topic in the semantic web world.)</p>
<p>So how is this relevant to Google and its "semantic search" intentions?</p>
<p>As stated above, Google is very much in the trust and proof business, and if provenance proves to be a useful method to help Google determine to what extent resources can be trusted, and to verify the source of those resources, they'll add provenance mechanisms to their toolkit.</p>
<p>Google has previously stuck its toe in the provenance water in the form of the Google News meta tags <em>original-source</em> and <em>syndication-source</em>.  Interestingly, Google News seems to have (without fanfare) dropped support for <em>original-source</em> (it is detailed in the <a href="http://googlenewsblog.blogspot.ca/2010/11/credit-where-credit-is-due.html" target="_blank">original announcement</a>, but no long appears on the <a href="http://support.google.com/news/publisher/bin/answer.py?hl=en&amp;answer=191283" target="_blank">publisher metadata</a> tags help page, having been "replaced" by the more limited <em>standout</em> tag).  That <em>original-source</em> was a bust makes sense, as it was a unilateral publisher declaration where publishers could readily unilaterally lie (and continued support for <em>syndication-source</em> makes sense in this context, as there's limited benefit to lying about <em>not</em> being the original source of a news story).</p>
<p>Not so with Google's big entrance into the world of provenance:  <em>rel="author"</em> and <em>rel="publisher</em> tags, combined with Google+ profile pages.</p>
<p>What does Google have to say about your author profile (emphasis mine)?</p>
<blockquote><p>A rich profile is not only a great way to share information with users, but it also <em>gives Google information we need to better identify you as the author of web content</em>.</p></blockquote>
<p>Your publisher profile (emphasis mine)?</p>
<blockquote><p>Linking your Google+ page and your site like this not only helps you build relationships with friends and followers, but also <em>gives Google information we can use to determine the relevancy of your site to a user query in Google Web Search</em>.</p></blockquote>
<p>The combination of a meta tag and a profile page provides Google with something that unilateral publisher meta tags cannot:  an identity verification mechanism.  Both <a href="http://support.google.com/webmasters/bin/answer.py?hl=en&amp;answer=1229920" target="_blank">author</a> and <a href="http://support.google.com/webmasters/bin/answer.py?hl=en&amp;answer=1708844" target="_blank">publisher</a> markup use Google+ profiles to validate the identity of a publisher, enabling them to establish provenance for resources linked to verified sources.  (In all the hoopla about Google's failure or success so far to establish Google+ as a social network, it's easy to overlook how the introduction of Google+ profile pages is an important structural improvement to previous Google profile pages.  There's now value for authors and publishers to have a correctly-linked Google+ page, even if Google+ profile owners never create or consume a Google+ post).</p>
<p>If Google's "semantic search" plans include facilitating the propagation of more structured data, whether by encouraging structured markup or by other mechanisms, it makes sense that they're going to continue to lean on provenance measures to help make sense of it.  Being able to determine the provenance of resources makes it easier to produce something akin to a "author graph" where queries return pages based not just on the authority of the page or site, but the authority of the author as well.</p>
<p>Not just what you say, but who you are, will almost certainly play an increasingly important role in the search results.</p>
<h3>Vocabularies Big and Small</h3>
<p>In the biggest move to date by the search engines to facilitate a more semantic web, Google, Bing and Yahoo did two things when they unveiled schema.org:  they stipulated a preferred markup specification for structured data (microdata) and provided a vocabulary to use with this markup (<a href="http://schema.org/" target="_blank">schema.org</a>).</p>
<p>The bulk of analysis surrounding the introduction of schema.org has focused on the markup standard endorsed by Google, and in particular on the relative merits of microdata compared to microformats and RDFa.  From a practical perspective I think the schema.org vocabulary itself is of more importance to publishers, and probably to Google too.  Publishers will obviously produce more uniform &#8211; and so more readily-digestible &#8211; structured markup if they're provided with specific properties to apply to specific types of things.  And the more extensive that vocabulary, the more it allows for a greater volume and greater topical breadth of structured data.</p>
<p>Right now, the schema.org vocabulary is extremely useful marking up three types of data:</p>
<ol>
<li>Named entities (people, places, organizations, etc.)</li>
<li>Media (information about types of web pages, images, videos, etc.)</li>
<li>Things that are bought and sold (especially on the Internet)</li>
</ol>
<p>For this last type, schema.org is good about providing information about products and offers <em>in the abstract</em>, without defining domain-specific properties of the things being bought and sold.  For example, it allows publishers to say very precise things about the price of a specific television, and what consumers think of that television, but doesn't provide a mechanism to classify that television by size, display resolution, or any other property specifically relevant to televisions.</p>
<p>Which is at least in part to say that schema.org is <em>not </em>useful for marking up information specifically relevant to:</p>
<ol>
<li>Any vegetable, mineral or animal that is not a named entity</li>
<li>Concepts</li>
</ol>
<p>To a certain degree this is undoubtedly by design, and is certainly in keeping with the thrust of semantic web technologies, which is to provide machine-digestible data about <em>real-life objects</em> in the world (objects that can be represented by URIs, if you want to get technical about it).  Certainly the bulk of what Singhal apparently conveyed to the <em>Journal</em> surrounded improved recognition and inclusion of entity-based information (and, further to my previous parenthetical comment, of "identifying information about specific entities referenced" on web pages &#8211; which works hand-in-hand with linked URIs).</p>
<p>However, the consequences of a limited vocabulary are, well, limited information.  The best that a site about televisions that doesn't actually sell them can offer to Google now using schema.org is information about the web pages that house television information, but &#8211; again &#8211; no domain-specific information about the properties of any of those televisions.  As I've often contended, structured markup will not be truly useful to content product producers until it allows them to accurately describe a cat video.  Using schema.org the properties of the video itself may indeed by accurately described, but &#8211; aside from being able to declare the subject of the video as the entity "Mittens" &#8211; cannot provide any structured information about the adorable feline.</p>
<p>Among those actively building and finessing schema.org you'll find lively discussions about which new types should be added to the vocabulary, and the limits of extending the vocabulary (see the <a href="http://lists.w3.org/Archives/Public/public-vocabs/" target="_blank">W3C vocabularies mailing list</a>).  Semantic web types mostly caution against efforts at building all-inclusive vocabularies, and the foolhardiness of pursuing an "ontology of everything" &#8211; pointing out, sensibly, that domain-specific vocabularies should be reused by publishers, linking vocabularies when necessary.</p>
<p>This has a great deal of merit technologically and is sensible conceptually, but leaves the everyday webmaster (even a very technically adept webmaster) at a loss if he or she wants to express, in a structured way, some specific property of a topic not covered by schema.org.  However inelegant and monstrous it might be, building an "ontology of everything" &#8211; or at least a vastly expanded (and more readily extensible) schema.org vocabulary &#8211; might be in Google's best interest to facilitate and promote the production of structured data to improve its search capability and results.</p>
<p>The point is, from a taxonomic perspective, if Google hopes to exploit the benefits of classified data it will be in its best interest to support that classification by building or extending vocabularies.  They've been doing this on an ad hoc basis since the introduction of schema.org (June 2011), extending it to include <a href="http://insidesearch.blogspot.ca/2011/08/microdata-sports-stats-happy-fans.html" target="_blank">sports</a> (Aug. 2011) and then <a href="http://googlewebmastercentral.blogspot.ca/2011/09/introducing-application-rich-snippets.html">software applications</a> (Sept. 2011).</p>
<h3>Entities Rule!</h3>
<p>What I've suggested above about Google extending vocabulary support to cover a broader range of non-entity types is highly speculative.  That entities will play a pivotal role in Google "semantic search" is much less so.</p>
<p>In an <a href="http://mashable.com/2012/02/13/google-knowledge-graph-change-search/" target="_blank">interview with Mashable</a>, Singhal stressed &#8211; just as he subsequently did in the <em>Journal</em> interview &#8211; that entities will play a critical role in the road ahead for Google:</p>
<blockquote><p>Google is "building a huge, in-house understanding of what an entity is and a repository of what entities are in the world and what should you know about those entities," said Singhal.</p>
<p>In 2010, Google purchased Freebase, a community-built knowledge base packed with some 12 million canonical entities. Twelve million is a good start, but Google has, according to Singhal, invested dramatically to "build a huge knowledge graph of interconnected entities and their attributes."</p></blockquote>
<p>Apparently the work on entities has been continuing at a fevered pace.  According to the <em>Journal</em>:</p>
<blockquote><p>Mr. Singhal said Google and the Metaweb team, which then numbered around 50 software engineers, have since expanded the size of the index to more than 200 million entities, partly by developing "extraction algorithms," or mathematical formulas that can organize data scattered across the Web.</p></blockquote>
<p>Bill Slawski <a href="http://www.seobythesea.com/2010/08/not-brands-but-entities-the-influence-of-named-entities-on-google-and-yahoo-search-results/">has argued</a> &#8211; I think convincingly &#8211; that what <a href="http://www.seobook.com/brands-hardwired">some</a> <a href="http://searchenginewatch.com/article/2113724/Latest-Google-Panda-Update-Favors-Video-Big-Brands-Google-Properties" target="_blank">observers</a> have classified as "brand bias" is in fact Google endeavoring to make sense of queries as they might be related to named entities, and returning information relevant to an identified brand whenever it can (Slawski recently named Google's <em>Entity Detection</em> patent as one of "the 10 most important SEO patents &#8211; see the end of the <a href="http://www.seobythesea.com/2012/01/named-entity-detection-in-queries/" target="_blank">article on the patent</a> for more resources about Google and entities).</p>
<p>Between Google's filings of entity-related patents, it's 2010 purchase of Metaweb, the introduction of shema.org (supporting the structured markup of named entities) and Singhal's own unambiguous statements, it is clear that entities will play an important role in Google's "semantic search" initiative.</p>
<h3>Does Retooling Mean More Tools?</h3>
<p>Whether Google continues to build out from schema.org or introduces an entirely different knowledge organization scheme, the best vocabulary in the world isn't going to help if nobody uses it.  While it's not rocket science for developers, even relatively code-savvy webmasters and SEOs (these days, often the ones actually marking up HTML with microdata) find it difficult to get microdata right.  How might Google make it easier for webmasters to use structured markup?</p>
<p>One of the reasons often cited for the slow adoption of semantic web technologies is a paucity of useful, relatively easy-to-use tools.  Certainly there's very little in the way of microdata authoring tools, and the most widely-used content management systems (<a href="http://drupal.org/project/schemaorg">Drupal notwithstanding</a>) lack native microdata support.  Were Google to introduce authoring tools that made structured markup easy that would probably go a long way toward improving adoption, with the added benefit that the markup would be uniform and syntactically sound.</p>
<h3>Are "Direct Answers" (Or Much Else Here) New?</h3>
<p>Many commentators in the search marketing industry have remarked that a "semantic" Google might answer more user queries directly in the search results, obviating the need for searchers to visit a site for that information, and so depriving publishers of (search-derived) traffic.  The Journal recounts:</p>
<blockquote><p>Over the next few months, Google's search engine will begin spitting out more than a list of blue Web links. It will also present more facts and direct answers to queries at the top of the search-results page.</p></blockquote>
<p>A tweet from <a href="https://twitter.com/#!/CraigFifield" target="_blank">Craig Fifield</a> summarizes the reaction of many search marketers:</p>
<p><a href="https://twitter.com/#!/CraigFifield/status/180756049203175424"><img class="aligncenter size-full wp-image-1139" src="http://www.seoskeptic.com/wp-content/uploads/2012/03/craig-fifield-tweet.jpg" alt="semantic search from Google to fight SEO? come on, it's just another way to use our content on their site and keeping the traffic - Craig Fifield" width="520" height="72" /></a></p>
<p>Should Google force a user to click through to a site to consume a piece of information that it thinks it can accurately return in the search results?  As long as the source or sources of that information is cited and linked (a SERP-level provenance measure) it seems to me a user's best interests are served without requiring them to jump through that particular hoop.</p>
<p>While I recognize that there's lots of nuance involved in debates about information ownership and Google's use of information published by others, I think that SEOs worry far too much about "direct answers" poaching traffic to their sites.  The nature of such information is, first, limited to the sort of facts that can be encapsulated in relatively brief form (e.g. "when was William Shakespeare born") and, second, as likely or not be found on a non-commercial authority site that's going to outrank you anyway (e.g. "when was William Shakespeare born").  Sites with rich content are still likely to fare well, and do any of us bemoan the loss of content farm pages in the search results that were written specifically to answer queries like "when is Memorial Day 2012?".</p>
<p>In any case, as ReadWriteWeb's Jon Mitchell <a href="http://www.readwriteweb.com/archives/google_semantic_search_bad_for_seo_good_for_you.php">correctly notes</a>, such "direct answers" are not new.  Perhaps we'll see more of these, or perhaps even the development of user interfaces that permit searches to delve deeper into the response to their question without leaving a Google page.  Were this ever to present an egregious challenge to a publisher's traffic, that publisher could always prevent Google from indexing their content &#8211; though of course, that would also prevent users from discovering that content from search, and almost certainly end up being a case of cutting off one's nose to spite one's face.</p>
<p>As I never tire of telling people (but as they almost certainly tire of hearing), Google has been employing semantic web technologies for a long time (this earlier post on <a href="http://www.seoskeptic.com/seo-semantic-web/">SEO and the semantic web</a> provides examples).  It's possible that what's on the Google horizon when it comes to "direct answers," the use of named entities and other semantic web technologies may end up be being big changes under the hood, but rather less evident in the way actual search results are displayed.</p>
<h3>The Big Takeaways for Search Marketers</h3>
<p>Sites with structured markup will appear in the results of more queries, rank better than non-structured web pages for those queries, and have a greater visibility both in linked and "direct answer" verticals.</p>
<p>Sites that can successfully identify and interlink entities in a fashion that Google can readily understand, whether by the use of structured data or otherwise, will find themselves particularly favored, both in linked search results and as sources of information extracted by Google and presented directly in responses to queries.</p>
<p>Up until now Google has been insistent upon the fact that structured markup does not impact the ranking of a site in its search results.  To channel my inner Jan Brady, it's all <em>rich snippets, rich snippets, rich snippets</em>!  But here comes a point where it becomes disingenuous to leverage structured data solely to manipulate the way in which it is represented.  More to the point as it pertains to Google's business model, there comes a point where willfully ignoring the information structured data provides results an inferior user experience.</p>
<p>If a site &#8211; by virtue of declaring and linking named entities, employing structured markup and providing verifiable provenance information &#8211; helps Google understand that a given resource is a relevant match for a given query, Google will inevitably favor relevant results over a fear that, in doing so, it may be exhibiting bias against more loosely structured sites.  Google will absolutely continue to try to understand sites with unstructured data, but at the end of the day the better Google understands a resource, the more use it can &#8211; and will &#8211; make of it.</p>
<div name="googleone_share_1" style="position:relative;z-index:5;float: right; margin-right: 10px;"><g:plusone size="medium" count="" href="http://www.seoskeptic.com/deciphering-googles-semantic-search-intentions/"></g:plusone></div>]]></content:encoded>
			<wfw:commentRss>http://www.seoskeptic.com/deciphering-googles-semantic-search-intentions/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
	</channel>
</rss>
