<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tim Nash &#34;stuff&#34; Blog &#187; Search Marketing</title>
	<atom:link href="http://www.timnash.co.uk/category/search-marketing/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.timnash.co.uk</link>
	<description>The Stuff Consultant</description>
	<lastBuildDate>Tue, 08 Nov 2011 11:02:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Should Opinion Be a Ranking Factor?</title>
		<link>http://www.timnash.co.uk/05/2010/should-opinion-be-a-ranking-factor/</link>
		<comments>http://www.timnash.co.uk/05/2010/should-opinion-be-a-ranking-factor/#comments</comments>
		<pubDate>Sat, 15 May 2010 13:32:51 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Advanced SEO]]></category>
		<category><![CDATA[Search Marketing]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=597</guid>
		<description><![CDATA[Tim shows how he added the ability to determine if an article is fact or opinion and integrated into a clients internal search system. he then ponders a larger question should opinions be weighted above factual information. Should opinion be a ranking factor?]]></description>
			<content:encoded><![CDATA[<p>I have just finished an interesting project for a client. They run a network of news and analysis sites and have a custom search engine. One of the complaints about the search was that news and factual information was being missed because of a wealth of opinion and editorials. I was brought in to try and come up with a way of identifying if an article was factual, opinion driven or editorial (factual that has an opinion), and to provide a ranking factor for the three. The idea was that a full-on opinionated rant should appear lower in the results than a factual news story.</p>
<p>Now, there is no such thing as un-opinionated article. Everyone will have a view, even if they try to not be biased, so the first and obvious question is: When should opinion count?</p>
<h3>What Is an Opinion?</h3>
<p>Well, a quick visit to dictionary.com brought 2 out 7 definitions for &#8216;opinion&#8217; worth considering:</p>
<ol>
<li>a belief or judgment that rests on grounds insufficient to produce complete certainty.</li>
<li>a personal view, attitude, or appraisal.</li>
</ol>
<p>The words I think are important are judgment, view and appraisal. These lead us to right, wrong, negative and positive, so we could perhaps say an opinion, in context of a written prose, is something that is skewed towards negative or positive wordings.</p>
<blockquote><p>Tim is an utterly amazing and cool guy. I think everyone should follow him on <a href="http://twitter.com/tnash">twitter</a> because he is awesome.</p></blockquote>
<p>Can be safely assumed to be an opinion, as could</p>
<blockquote><p>Tim is a miserable, boring, depressing person no one should follow!</p></blockquote>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="480" height="385" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/ujUQn0HhGEk&amp;hl=en_US&amp;fs=1&amp;rel=0" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="480" height="385" src="http://www.youtube.com/v/ujUQn0HhGEk&amp;hl=en_US&amp;fs=1&amp;rel=0" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<blockquote><p>But it’s not long before Storm gets started:<br />
“You can’t know anything,<br />
Knowledge is merely opinion”<br />
She opines, over her Cabernet Sauvignon<br />
Vis a vis<br />
Some unhippily<br />
Empirical comment by me</p>
<p>“Not a good start” I think&#8230;.<br />
I resist the urge to ask Storm<br />
Whether knowledge is so loose-weave<br />
Of a morning<br />
When deciding whether to leave<br />
Her apartment by the front door<br />
Or a window on the second floor.</p></blockquote>
<p><em>Tim Minchin &#8211; Storm </em></p>
<p>But what about:</p>
<blockquote><p>Placing the L45 into Warp drive mode will cause it to fail resulting in a negative feedback in the pinky.</p></blockquote>
<p>Here is a something that, while negative words are used, could not be described as an opinion, assuming the warp drive does indeed fail in those circumstances. Still, if we assume an opinionated piece will have a higher concentration of negative or positive words per word count, we can create a simple rule for determining if text is opinionated or not.</p>
<h3>How To Determine Positive or Negativity</h3>
<p>Let&#8217;s face it, certain words are positive and others are negative. Actually, as humans, we can quickly work out which are which easily enough. Computers, however, need a little more help, so we have to provide them with a set of words and identify if they are positive or negative. Then, we simply process through them and work out if the line is positive, negative, or neutral on a line by line basis. Simples.</p>
<p>Ok, so to do this, we could simply bucket count, but that&#8217;s not very efficient. Instead, we&#8217;ll use the Bayes Theorem :<br />
<em></em></p>
<p><em>P(A|B) = P(B|A)*P(A) / P(B)</em></p>
<p>Where P is Probability. Where A and B are events and A|B is where event A occurs if B is true.<br />
So, Bayes theorem is calculating the probability that &#8220;A&#8221; is true or will be true, given a certain set of circumstances &#8220;B&#8221;. For more information, see <a href="http://www.statgun.com/research/bayes-law.html">Bayes Law in Plain English</a>.</p>
<p>With this math under our belt, we can construct a set of Baysian classifiers. These will sound familiar because they are what a lot of anti spam filters use. In our case, they allow us to process through and look for negative and positive words and phrases. With the classifier in place and a suitable set of positive and negative word lists, we can begin processing through the documents.</p>
<p>While the client was only interested in opinions, we actually stored the total number of sentences with negative or positive opinions. The code for our initial version was heavily influenced by <a href="http://phpir.com/bayesian-opinion-mining">Baysian opinion mining</a> code on PHPIR. However, we did end up rewiting the code in C++ as a php module to speed up the results for our large data sets. In addition, we took a similar method to <a href="http://darkoromanov.wordpress.com/2010/03/08/improved-bayesian-opinion-mining/">Darko Romanov</a> and filtered the stop words from our sentences.</p>
<p>Once we processed through document, sentence by sentence, we determined an overall score. This score was then used, along with the total number of sentences, to determine how opinionated a piece was. We also showed if it had a positive or negative bias and compared it to 100 examples of what was deemed to be opinionated and 100 that were not.</p>
<p>The system correctly identified the one hundred opinionated pieces, but also incorrectly identified 12 of the non opinionated pieces. Tweaking of the Lexicon reduced this down to 4, which was deemed a reasonable error margin. However, it also let 1 opinionated piece through. Again, this was deemed acceptable. The goal now is to provide multiple lexicons dependent on the site and author who is writing the piece.</p>
<h3>Does It Cope With Sarcasm?</h3>
<p>Surprisingly well! Most sarcasm is used when a positive indicator is used. When in fact a negative is inferred, most sarcasm is surrounded by other negative sentences when the system breaks down content sentence by sentence. Thus, while the sarcastic sentence itself will indeed be misclassified, the surrounding sentences will not (hopefully), and so there would be more negative then positive sentences within the piece.</p>
<h3>What Other Uses In Search</h3>
<p>Well, obviously, in the original client request, they were looking at removing or reducing relevancy of opinionated pieces, but imagine if you did the opposite. Let&#8217;s say I run a review website that people can use to search for product reviews. Obviously, I link to products with affiliate links.</p>
<p>Now, imagine if my internal search was designed to show more positive results for higher commission or converting items? Negative reviews would be found further down the list of product reviews. Sneaky, but if you have lots of people using your internal search it could be one way to increase revenue.</p>
<h3>Should it be a ranking factor?</h3>
<p>Well, I suspect many site owners of products may wish it was a factor. Take, for example, my post on <a href="http://www.timnash.co.uk/05/2010/gopark-co-uk/">GoParks</a> recently. The owner of the site would probably be quite keen if opinion pieces had reduced rankings. I can also see for Google News or similar this style of ranking could be useful, but on the main stream web I&#8217;m not convinced that it would work.</p>
<p>What do you think, should opinion be a ranking factor?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/05/2010/should-opinion-be-a-ranking-factor/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Want to have an automated content network G has made it easy</title>
		<link>http://www.timnash.co.uk/10/2009/want-to-have-an-automated-content-network-g-has-made-it-easy/</link>
		<comments>http://www.timnash.co.uk/10/2009/want-to-have-an-automated-content-network-g-has-made-it-easy/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 19:45:38 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Search Marketing]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=386</guid>
		<description><![CDATA[Google has provided an API for their Google Pages but has this just opened the flood gates for automated spam in Googles own backyard?]]></description>
			<content:encoded><![CDATA[<p>Ok so it might not have been their intention but with <a href="http://googlecode.blogspot.com/2009/09/google-sites-now-with-api.html">Googles announcement</a> that they are now providing an API to google pages, their answer to Geocities they have provided the perfect platform for an automated content network or um spam sites your choice.</p>
<p>The new API allows:</p>
<ul>
<li>Retrieve, create, modify, and delete pages and content.</li>
<li>Upload/download attachments.</li>
<li>Review the revision history across a site.</li>
<li>Display recent user activity.</li>
</ul>
<p>While at first glance this doesn&#8217;t sound breathtaking and to gain multiple Google Page sites you need multiple sites and multiple API keys this is the first time a &#8220;free&#8221; provider has made it quite so easy to automate parasite sites not hosted by the SEO company. And while Google may well de-list individual Google Pages or ban that user they are not going to ban their entire system meaning that its one of the few sites where you are sure to be able to from the off create nice clean sites in the eyes of Google with no worries about past history.</p>
<h3>Ideas for content Network Developers</h3>
<p>My first thoughts was to grab all those nice article directories and republish them, or indeed a bit of <a href="http://www.bluehatseo.com/black-hole-seo-desert-scraping/">desert scraping</a>, automate the search, Xpath the content and automate the uploading  after the initial setup you could simply insert keywords and let it go, for multiple Google accounts <a href="https://www.mturk.com">Mechanical Turk</a> provides a simple and cheap way round the CAPTCHAs.</p>
<p>Just food for thought, of course these low quality sites would be terrible for users and near worthless for search engines so expect poker, viagra and other competitive niches to be abusing them soon.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/10/2009/want-to-have-an-automated-content-network-g-has-made-it-easy/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Behaviour Modelling Seminar</title>
		<link>http://www.timnash.co.uk/09/2009/behaviour-modelling-seminar/</link>
		<comments>http://www.timnash.co.uk/09/2009/behaviour-modelling-seminar/#comments</comments>
		<pubDate>Tue, 01 Sep 2009 09:25:32 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Advanced SEO]]></category>
		<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Search Marketing]]></category>
		<category><![CDATA[SEO Introduction]]></category>
		<category><![CDATA[Site Information]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=348</guid>
		<description><![CDATA[Tim is running a seminar pre Think Visibility the first public one he has done in a couple of years do you want to be at it?]]></description>
			<content:encoded><![CDATA[<p><strong>September 11th, 1pm onwards, Leeds UK <a href="http://seminar.timnash.co.uk/">signup here</a></strong></p>
<p>What you want more? Seriously I mean what do I need to tell you?<br />
I will be running a seminar on <a href="http://seminar.timnash.co.uk/">Profiling website users</a>, looking at life cycles and a huge pile of tricks and ideas to get the most out of your users. Want to know if colour effects genders differently? Want to know how to estimate gender in website visitors?</p>
<p>Then you should be attending this seminar! What&#8217;s more its free well sort of, its actually free to <a href="http://www.thinkvisibility.com">Think Visibility</a> attendees everyone else its £50 but of course what this really means is if you buy a <a href="http://thinkvisibility.buildevents.com/">Think Visibility</a> ticket at £99 and use my discount Coupon TIMNASH the price will drop down to such a level that you got to ask yourself why not come to both <img src='http://www.timnash.co.uk/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<p>I&#8217;m afraid while this seminar will be filmed it will be only for internal use and not available for purchase, if it&#8217;s a success I might run a future seminar and film it.</p>
<p>If you are interested in improving your conversions, or even just curious as to what it is that I find more interesting then SEO (or if your into SEO why what we do is so much more then the normal) then come along for the afternoon it&#8217;s free, simply register to confirm your place at the <a href="http://seminar.timnash.co.uk/">behaviour modelling seminar</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/09/2009/behaviour-modelling-seminar/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Think Visibility 2 &#8211; oh yes its back!</title>
		<link>http://www.timnash.co.uk/08/2009/think-visibility-2-oh-yes-its-back/</link>
		<comments>http://www.timnash.co.uk/08/2009/think-visibility-2-oh-yes-its-back/#comments</comments>
		<pubDate>Fri, 21 Aug 2009 09:33:37 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Search Marketing]]></category>
		<category><![CDATA[SEO Introduction]]></category>
		<category><![CDATA[Site Information]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=346</guid>
		<description><![CDATA[Think Visibility is coming are you ready for it? Also would you be interested in joining me the day before for an indepth seminar in some of my latest research areas?]]></description>
			<content:encoded><![CDATA[<p>You have to have been living under a rock to not know <a href="http://www.thinkvisibility.com/">Think Visibility</a> is making its return on September the 12th! If you haven&#8217;t bought your ticket do so now using the coupon TIMNASH http://www.thinkvisibility.com/ ok so who&#8217;s going to be speaking?</p>
<ul>
<li>Several of the best SEOs in the country</li>
<li>WordPress and CMS experts</li>
<li>Social Media experts</li>
<li>A true accessibility expert with hands on experience</li>
<li>A guy who has been sued by well everyone!</li>
<li>oh and me</li>
</ul>
<p>Check out the full list of <a href="http://www.thinkvisibility.com/speakers/">speakers</a> and the <a href="http://www.thinkvisibility.com/schedule/">schedule</a> and if you haven&#8217;t already <a href="http://thinkvisibility.buildevents.com/">buy your ticket</a>, using the discount coupon TIMNASH </p>
<h3> Important &#8211; would you like a more indepth seminar the day before?</h3>
<p>I&#8217;m seriously thinking about running a free or very very cheap seminar for attendees on the friday if their was a charge it would be to cover costs for hiring somewhere. If you would be intersted in attending this seminar which would be on the Friday afternoon then please let me know if there is enough interest then I will organise one.</p>
<p>What I would be covering, metrics, human behaviour modelling and identifying patterns (including some nifty ideas for SEOs out there) as I say nothing concrete but either use the comments here or email/contact/tweet me (yes I&#8217;m now using twitter <a href="http://twitter.com/tnash">@tnash</a>) </p>
<p>This seminar would only be for Think Visibility Attendees basically I&#8217;m offering a £500 half day course for free <img src='http://www.timnash.co.uk/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  if people want it. </p>
<p>Oh don&#8217;t forget to buy a ticket! </p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/08/2009/think-visibility-2-oh-yes-its-back/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Temporal Click/Heat Mapping</title>
		<link>http://www.timnash.co.uk/02/2009/temporal-click-heat-mapping/</link>
		<comments>http://www.timnash.co.uk/02/2009/temporal-click-heat-mapping/#comments</comments>
		<pubDate>Sun, 01 Feb 2009 23:20:44 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Search Marketing]]></category>
		<category><![CDATA[Social Media Optimisation]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=254</guid>
		<description><![CDATA[A look at another recent project this time studying click/heat density maps but with the additional element of time, allowing us to see not only where users clicked but when and in what sort of numbers.]]></description>
			<content:encoded><![CDATA[<p>Very quick post mainly as I haven&#8217;t posted anything for a while and I promised myself I would, so thought I would introduce a project we are currently working on for a client or rather in this case the theory and point you in the right direction code wise.</p>
<h3>Heat/Click maps</h3>
<p>Most readers will be familiar with these but if your not basically a click heat map measures the number of clicks occurring on specific parts of a page (normally links) and then creates a pretty density match the more clicks the brighter the colour (normally they look like heat maps hence the name) they are used to see areas of interest for users and where users are leaving. These maps are really quite useful if you know how to use them and you can easily get started using open source software like <a href="http://www.labsmedia.com/clickheat/index.html">Clickheat</a> and there are also paid for services out there.</p>
<h3>Temporal Click Density maps</h3>
<p>So I&#8217;m sure the title (time based click maps) has given away what our current project is all about, you see clickmaps give a very two dimensional view of click density by introducing time, we get a much better picture of what areas are being clicked on and some motivation behind them. Unfortunately introducing time into web analytics is notoriously difficult perhaps more so then it would appear on first glance.</p>
<p><strong>Time periods</strong><br />
The first problem is accurately recording time, either you rely on the server or the clients browser to record the time, both come with problems and neither can be considered accurate the solution in this case is simply to probe in time periods in our case we have chosen a near logarithmic scale for determining time periods. It would be unhelpful if 99% of all our clicks were in our first time period because we chose to long a period and likewise we wish to keep processing and data storage overheads down so polling data every second is unrealistic.</p>
<p><strong>In focus or idle?</strong><br />
The next hurdle is determining if they are even looking at the page all modern browsers are capable of tabs and multiple windows, and so users could leave a page open and be surfing for hours on another window never to return to your site. Likewise a user referencing your site can be coming backwards and forwards between tabs or windows. Most browsers now support some javascript focus option which will allow you limited knowledge when combined with blur. However this does require that we first access the <a href="http://www.quirksmode.org/js/support.html">browsers capabilities </a>. In reality we actually use compiled action script assuming the user has flash and fall back to javascript only really for iphone browser and similar.</p>
<h3>Some Temporal Click Density Map goodies</h3>
<p>ok so now for some totally unhelpful yet oddly cool tidbits we have discoverd while running temporal clickmaps on a couple of our sites. Remember now this is just initial findings on a couple of sites.</p>
<p><strong>Greater the density the earlier the click</strong> – Afraid so from our initial finding the click area with the greatest number of clicks also had the clicks in a short period of time relative to their arrival. The positive side is a good call to action is likely to be clicked on quickly.</p>
<p><strong>Adverts are clicked late</strong> – This was a little surprising I had always assumed advert clicks occurred quite early on but adverts tended to be clicked later, is this showing a subconscious desire to complete the primary call to action? Well possibly and (warning blatant plug ahead) its certainly the theory we have put forward for why newmedias.co.uk <a href="http://www.newmedias.co.uk/wordpress-captcha-adverts/">Your CAPTCHA advert wordpress plugin</a> works (end of plug)</p>
<p><strong>Further down the page the later</strong> – yep your hocked but the lower down the page the longer it takes to get clicked,  there is an exception to this and that is pre known call to actions a good example being long sales pages where we saw a split between quick clicks vs long clicks. So some people realising the style of page shot to the bottom while others waited.<br />
<em>Interesting and this is when this sort of mapping comes into its own, those who waded through the long sales page and clicked to purchase were statistically less likely to complete the transaction</em></p>
<p><strong>Stumblers really do hate you </strong>– a couple of years ago I posted about the fact that the most clicked area on a page during a stumble was the top left hand side and theorised that this increase was due to stumblers missing the stumble button. With the new system we can see the different density levels for the top left for stumblers vs non and with a noticable difference between the two the theory still stands to make things worse&#8230; nearly all those clicks are in the first time period meaning you really don&#8217;t have much chance to impress.</p>
<p>So what do you need to get temporal data going on your site, well clickheat map system is a good start I recommend labsmedia <a href="http://www.labsmedia.com/clickheat/heatmap.html">clickheat php class</a>  as a starting point. Then you need to measure time and focus we use an actionscript system but I would be interested in seeing or hearing other ideas. Indeed if people want to play with our code I maywell clean it up and throw it up on newmedias.co.uk</p>
<div id="vs-message">
<strong>Stuff Consulting</strong><br />
Are you interested in finding out what your users are up to on your site and adapting your site to their needs then why not think about hiring a Stuff Consultant! See my <a href="http://www.timnash.co.uk/consulting/">consulting services</a> for more information or why not <a href="http://www.timnash.co.uk/contact/">get in touch</a>!</div>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/02/2009/temporal-click-heat-mapping/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Online Profiling – The good, the bad and the plain evil</title>
		<link>http://www.timnash.co.uk/01/2009/online-data-collection/</link>
		<comments>http://www.timnash.co.uk/01/2009/online-data-collection/#comments</comments>
		<pubDate>Tue, 06 Jan 2009 11:28:29 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Search Marketing]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[profiling]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=243</guid>
		<description><![CDATA[Have you ever wondered how much information you produce when browsing the web, forget your browser window size, what about your name, telephone number and date of birth. Chance are you are giving me all those details right now.]]></description>
			<content:encoded><![CDATA[<p>How much do you reveal about yourselves to the sites you visit?<br />
Without filling in a form there is a strong possibility that a site can get hold off:</p>
<ul>
<li>Your Name</li>
<li>Age </li>
<li>Shipping Address</li>
<li>Some basic Credit details.</li>
</ul>
<p>They can almost certain make educated and pretty accurate guesses to:</p>
<ul>
<li>If you are at work</li>
<li>your income bracket</li>
<li>Gender</li>
</ul>
<p>They might even in a few users cases be able to tell you what you had for breakfast all in a few seconds. Scared? </p>
<p>We recent worked with a client who was looking to improve conversions on their alternative payment methods in this case primarily Google Checkout. They were disappointed by the take up on their Google Checkout option (with such low commission for the first year I suspect they were expecting significant savings)  we were asked to develop a method to target existing Google Checkout users and present them with using the one click checkout option in favour of the traditional credit card approach. </p>
<h3>Solution</h3>
<p>We ran a very short questionnaire, of sales made to ask a simple question:</p>
<p><em>Do you use any of the following Google Services (tick as many as you use):</p>
<ul>
<li>Gmail</li>
<li>Google Calendar</li>
<li>Google Checkout</li>
<li>Google reader</li>
<li>Google Personalised Home page</li>
</ul>
<p></em></p>
<p>The results were pretty staggering 55% used one of the above services with Google personalised homepage making up the majority followed by Google checkout! </p>
<p><strong>Making it worthwhile</strong><br />
ok so we now know there is a feasible amount of users accessing Google services and a reasonable (12.4%) using Google Checkout to make targeting while 12.4 of participants to the survey we can normally add an error margin of + or – 2% which means theoretically 14.5% (approx) could be using checkout thats not a small majority. If we could up the number of completed checkout by even a few percent it would make a difference.</p>
<p>The idea was fairly simple instead of presenting a Pay with Credit Card button with a google checkout button as one of the alternate smaller payments, we would show the Google checkout button as the primary option and pay with credit card as the alternate. </p>
<p>This idea could backfire, users may have made a single one off payment with Google Checkout and absolutely hated it! We have no way of knowing we maybe suggesting something they loathe but lets for now assume that paying via Google Checkout is the preferred option for all those who have previously used checkout but if its a new user how do we know?</p>
<h2>Iframes, Javascript and the DOM</h2>
<blockquote><p>Using just Javascript to do this sort of XSS is no longer really viable, however there are plenty of alternates using flash and PHP proxies.</p></blockquote>
<p>If I was to get you to click this link https://www.google.com/accounts/ManageAccount?service=profiles&#038;hl=en chances are that it has just opened your Google profile. Why is that? Well mainly because the people reading this are likely to use one or more of Google services with a fair few using Gmail. Now whats the first thing you do when you get online? </p>
<p>Ok so this page is going to be accessible via javascript for a lot of users, to access it we simply open it in an iframe. We then can use javascript to pull data out, by calling the document in the iframe using something similar to:</p>
<p><code>
<pre>window.frames['myframe'].document</pre>
<p></code></p>
<p>where myframe is the id of the frame, once we have access to the document we can use a range of techniques for extracting data from getelementbyid through to implementations of xpath (since Google can&#8217;t manage nicely designed web pages the later is needed) </p>
<p><img src="http://www.timnash.co.uk/wp-content/uploads/2009/01/timsaccount.jpg" alt="Tims Google Account" /></p>
<blockquote><p><strong>note</strong> – Its worth remembering this sort of implementation is subject to the whims of the site your grabbing the data from! If they change the page layout you will need to match it. Also you are reliant on the user a) being logged in, b) their browser playing fair, each browser handles cross domain requests differently for example IE is a lot more strict then say Google Chrome the above code will not work on most browsers without some modification</p></blockquote>
<p>If you take a look at the above you can see the section marked default payment you will forgive me if I have removed some personal details but it shows the default Card type and last 4 numbers if any card is associated with the Google account. So all we need to do is check to see if there is a 4 digit code there if there is then we can assume the user is a Checkout user if there is not or if we couldn&#8217;t log in then we assume they are not. </p>
<p>Once they have made their first purchase we can ask them to create an account and allow them a preference check as to how they wish to pay. </p>
<p>We have only been running the new system a few days but the initial data is positive sadly we can&#8217;t do a similar thing for Paypal and would need to rely on CSS history trick (see below)</p>
<h3>All your data is mine</h3>
<p>Lets go look at that Google account page again, I&#8217;m pretty frugal with my details yet on that page there is my Full name, email address, a shipping address (probably my home even) if you go down to the services you would have quickly realised that I use this account for work as some of my services include Google Web Optimiser, Adwords, Analytics.</p>
<p>So what about other sites and other information for example&#8230;</p>
<p><strong>MSN / Yahoo</strong><br />
Similar to Google both have a central account location lots of information to grab should you choose to.</p>
<p><strong>Facebook</strong></p>
<p>http://www.facebook.com/home.php?ref=home</p>
<p>Gives basic login info including full name and their Facebook ID via the profile link which allows us to access:<br />
http://www.facebook.com/profile.php?id=FBID&#038;ref=profile#/profile.php?id=FBID&#038;v=info&#038;viewas= FBID </p>
<p>Which gets you the users complete profile, dates of birth, mobile phone numbers etc etc, oh and all their friends information to (well what they shared with your original victim sorry customer)</p>
<p><strong>Twitter</strong><br />
I did say find out what they had for lunch, while Twitter information can be obtained through the Iframe method its much simpler but my friend <a href="http://www.thehodge.co.uk/">TheHodge</a> pointed out an <a href="http://www.wait-till-i.com/2009/01/05/detecting-and-displaying-the-information-of-a-logged-in-twitter-user/">alternate method</a> utilising a bit of javascript and Twitter API the downside to this is that like the rest of Twitter you are reliant on the thing being up! Still I did promise we could find out what some people had for lunch and what better place, once you got the username, just pull in their feed and mine.</p>
<h3>Profiling and Data guessing</h3>
<p>Once we have all this data we can start making some guesses and profiling users, we could go the whole hog and do credit checks after all we have name and current address but in the UK that would probably abusing your credit license plus time consuming etc. We could also pull details from the land registry on their property and then estimate an income bracket, or we could look up their job description. I can&#8217;t find the study but the BBC ran a report a while back claiming people were less likely to lie about their Job title and description on facebook then on other sites such as LinkedIn etc always worth knowing.</p>
<p><strong>Identifying if some one is at work</strong><br />
This is pretty easy, verify they have a job, check current time in their location is it between 9 and 5? reverse DNS on their hostname check against a list of common terms such as residential etc if their ISP shows no sign of being primarily for non commercial use, its within normal hours of work and they themselves do work its a reasonable assumption to assume they are at work! Of course they might not be but chances are they are.</p>
<p><strong>What if they use a secure session?</strong><br />
Some sites like Paypal won&#8217;t let you stay logged in so you can&#8217;t grab information from the browser window unless they happen to be using Paypal within a few minutes of visiting your site. If we want to know if they have visited Paypal we will have to go down an alternate and less reliable route. </p>
<h3>CSS Browser History</h3>
<p>Your browser can identify visited links, so when a page renders it knows when you have been to certain pages before. Web designers can even style these visited pages with a different colour link, which means its equally easy for us to identify where a user has been. The first step is to generate a list of possible locations, then create a very specific visited colour. When the user visits the page it will show the other pages they visited in your specific colour, we then run a piece of script to identify elements with that specific colour on the page and we now have a history of where you have been. </p>
<p>Note: This is flaky and I mean really flaky I have used the term URL but I should really have said URI it is the exact location you have been for example http://www.example.com and http://www.example.com/#hello are not the same and in some browsers (such as chrome they would not both show in css browser history unless you visited both) </p>
<p>For more information and for the original script that see the <a href="http://ha.ckers.org/weird/CSS-history.cgi">Ha.ckers.org CSS History</a> Hack</p>
<p><strong>Final thoughts</strong><br />
If this has worried you a little then remember the solution is simple, log in and out of your sites, if you must stay logged in use a different browser (or in Chrome and new Firefox user their privacy mode) </p>
<p>However for marketers the ability to know as much information as possible about a user is gold but only if you can properly utilise it also keeping in mind data protection and retention regulations means a lot of the data you may find you can&#8217;t keep so can only be used for one off judgments (such as assigning a profile group, or changing checkout info around) </p>
<p>So now I will leave you with a simple question, how much are you worth? Your data I mean how much do you think all the data your exposing is worth?</p>
<div id="vs-message">
<strong>Stuff Consulting</strong><br />
Are you interested in Profiling and Grouping your users then why not think about hiring a Stuff Consultant! See my <a href="http://www.timnash.co.uk/consulting/">consulting services</a> for more information or why not <a href="http://www.timnash.co.uk/contact/">get in touch</a>!</div>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/01/2009/online-data-collection/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Case Study &#8211; Profiling Landing Page Users</title>
		<link>http://www.timnash.co.uk/12/2008/profiling-multivariate-landing-page-users/</link>
		<comments>http://www.timnash.co.uk/12/2008/profiling-multivariate-landing-page-users/#comments</comments>
		<pubDate>Sun, 14 Dec 2008 13:20:39 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Advanced SEO]]></category>
		<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Search Marketing]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=221</guid>
		<description><![CDATA[User Profiling is a vital but often missed tool when developing landing pages, in this case study I show the steps we went through to increase a clients landing page conversions not by changing the variables in the landing pages but segmenting the traffic source and profiling it. Warning this post contains some logic concepts.

]]></description>
			<content:encoded><![CDATA[<p>After posting about doing some testing on my Facebook status (I don&#8217;t use twitter) I got a couple of queries about what I was up to, my current project is a bit hush hush so I thought I would introduce this case study. I have also provided what I hope are some handy hints dotted through out.</p>
<hr />
<p>Affiliates are great no seriously they are the online sellers best asset without them many of the big stores would barely make a penny. So its strange how so many of them take their affiliates for granted treating them as one homogeneous block. We were asked to do some work for a client and I thought I would share some of the theory behind it (warning this post contains logic and a sprinkling of basic math) even though its not strictly an SEO post many SEOs are asked to perform this sort of statistical analysis every day.</p>
<p><strong>The problem</strong><br />
Company A sells a diverse range of products it has a large affiliate base and are keen on their affiliates doing the leg work. Their philosophy is to help the affiliates along as much as possible. While they don&#8217;t have a dedicated affiliate landing page (they prefer to encourage deep linking to specific products) their “special offer” pages are used within PPC campaigns and as a focal point for affiliates activities. These pages are regularly tested however traditional split testing has caused a problem. Several of their high earning affiliates have low conversion rates on these pages while the PPC campaign is doing well and according to traditional split testers this page is the best performing of all recipes that had been presented over a large dataset.</p>
<p><strong>The traditional solution</strong><br />
We have numerous traffic sources all arriving at one area the traffic is bound to be looking for different things and at different points along the buying process it makes sense therefore to simply split this traffic into groups and optimise each group. So we now have 3 separate experiments running each with the same recipes.  The results come in the PPC is up as is the misc organic traffic but conversions while up slightly are still disappointing for the affiliates, time to investigate whats going on.</p>
<h3>Interrogating demographics</h3>
<p>Eccomerce sites have a wealth of information about their users, their buying practices and with a little bit of data mining we can create a profile of shoppers (we have 12 in this study) on the site. When we match that with where they have come from we can begin to organise what sort of user is coming from our various traffic sources. Obviously there is crossover and sources such as PPC and Organic traffic should produce a diverse set of profiles as will affiliates on the first glance it is only when you break down your affiliate traffic by affiliate that the number of profiles reduces. </p>
<p>Since I really wasn&#8217;t interested in Affiliates making a sale a year I quickly discarded a few thousand records and associated bits to a core group of 500 or so,  each ones sales patterns were looked at and users profiled. Interestingly the top 5 affiliates had far less cross over in profiles then those earning a little less. This points to the fact that they are using distinct niches and they were tending to not step on each others toes. The next step was to compare these profiles to our conversion information for our landing pages were they way outside of the normal profiles?</p>
<hr />
<strong>Quick tip</strong> – Profiling and data mining is one of the key elements to eccomerce that even some of the big boys can&#8217;t manage to get right! Take amazon as an example of company with a flawed profile and data mining set. While its clear they are profiling users they are not cross referencing with previous purchases causing them to send emails suggesting you purchase books you already bought from them! If your interested in profiling and customer data mining I strongly recommend <a href="http://www.amazon.co.uk/gp/product/0749453389?ie=UTF8&#038;tag=timnasblo-21&#038;linkCode=as2&#038;camp=1634&#038;creative=6738&#038;creativeASIN=0749453389">Scoring Points: How Tesco Continues to Win Customer Loyalty</a> its a good read over the Christmas break.</p>
<hr />
<p>Our best performing profiles on landing pages were 3 distinct profiles to keep things simple we shall call them the bargain hunter, the infrequent and the impulse. In reality each of these had 2 or more subset profiles but we can quickly see the special offers landing page are converting people who are looking for bargains, who don&#8217;t spend often with the store and those whose shopping patterns are erratic but often put more then one thing in the basket before checkout. Now this is not surprising but its always nice to have stats back up common sense particularly as its not always the case! Our top affiliates users profiles were on the whole not part of our profiling grouping of high conversions though there was some crossover with impulse profile amongst them. Interesting affiliates were producing the highest returning customer group even from this pages (indicating even if they didn&#8217;t buy the deal they were remembering the site) . Through out all of this one affiliate was standing out his conversion rate was close to 25% while his traffic stats were similar to the rest of the top affiliates these figures were so high they were hard to believe and our client was suspicious something was afoot. His traffic was almost entirely a single profile and were purchasing specific products and only those products for our client his users were infuriating not responding to future mailings, rarely returning accept through his link to specific products something was clearly going on, could he be part of some secret shopping sect? Is he brainwashing? Or was something darker going on?</p>
<h3>Investigating an Affiliate</h3>
<p>The vast majority of affiliates get on with their affiliate managers its a symbiotic relationship one can&#8217;t exist without the other. Therefore most affiliates don&#8217;t mind when their manager calls them for a chat often its going to be over some offer or a rise in commission when someone refuses to communicate it signals if nothing else a deterioration in their relationship. When we were unable to contact this particular affiliate it seemed that we might have a bad egg it was time to unravel his traffic sources. Like most affiliates he used a couple of central redirects in an attempt to both hide the real source but also for stats gathering, a whois and DNS lookup allowed us to find the fact both of these were on a single server and after a few minutes we had a 122 domains mainly .infos to look through passing those through our link mapping software produced nearly 400k backlinks across the network, nice thats a weekend of fun. Thankfully manually scanning through such a large dataset is not really feasible. Taking a small sample of the 122 indicated most could be considered as “made for adsense” style sites scraped or just crud content with links pointing to other parts of the network and a pile of ads. Thankfully the link mapping also provided 6 domains (all hosted on separate servers) where the network was pointing to. So far we still have a suspicion and some dodgey link building methods certainly nothing to damn the affiliate we now also have the main sites he has been promoting.</p>
<hr />
<strong>Quick tip</strong> – People tend to think micro instead of macro just because you made your site look less spammy on a human glance doesn&#8217;t help if your linking into a large network. Companies and search engines look for large scale patterns and work backwards which is why paid links from brokers are so easy to identify if you have a large enough data set.</p>
<hr />
<p>6 sites some interesting cloaking to provide overly keyword stuff context to Google but on the whole these sites have good landing pages, they are auto generated and producing coupons for every product to get discount. They are also incentivising the offers through a ponzi scheme (the cheeky bugger!) basically incentives were only being issued if a user not only bought the product with the coupon and then recruited 3 other users who did the same. The whole thing was financed through the fact that the affiliate was earning a standard commission regardless of the coupon and so was using the difference to payout in the unlikely case he needed to, I have no doubt when you joined “the club” you would be put on a mailing list and treated to a bombarding of mailshots. The system was ingenious but totally against our clients TOS and depending exactly how he was funding the payouts possibly illegal in the UK. So having ruled out our super affiliate as being the answer we handed over our findings to client for them to deal with and went back to the original question.</p>
<p>With our anomaly ruled out as a way to improve our affiliate conversions it was time to start gathering information on the presale our affiliates were using, this occurred more or less the same way we investigated our first affiliate, in addition each affiliate was sent a very quick questionnaire and a couple of follow up conversations with several of the affiliates and we probably knew more about our clients affiliates then they did! </p>
<p>Our clients affiliates ran into four main groups, the lander, the blogger, the comparer and the  mailing list guru. The landers are affiliates using lots of mini sites to push a certain product often very much into PPC and SEO they push very targeted products. The bloggers run blogs in our clients case they were mainly gadget and sport blogs they would be targeting individual products  but would often reuse our clients site over and over again. They were the group with the most residual traffic after a launch. The comparison websites are big business for affiliates and with some studies showing 1 in 4 people now using a comparison site for finding online purchases its no surprise that 2 out of our clients top 5 affiliates were primarily using comparison sites to drive traffic. Finally the mailing list such a powerful tool in Internet Marketing was actually the least used but as our client mentioned when we started they always knew when 1 of their affiliates released his newsletter by a small spike, the downside this affiliate brought almost no long term traffic.</p>
<hr />
<strong>Quick Tip</strong> – Its interesting to note most big affiliate marketers while they might have fingers in many pies tend to have one major traffic source type even if they have multiple sites and lists. However the top affiliate in this study (ignoring the club owner) used a blog, mailing list combination which in turn had comparison charts.</p>
<hr />
<p>With our affiliates traffic sources and profiles in mind we went back to the split testing to provide a unique and adapting page for every affiliate was not practical so instead we took our 4 traffic source groups and then cross referenced their user profiles the result was 7 diverse profile groups each was then run in an experiment of their own with the original recipes. Results were pleasing with all 7 experiments seeing a large leap in conversions it was interesting that out of the 7 experiments none were exact matches with each other yet our comparison website profile experiment matched the PPC. </p>
<blockquote><p><strong>Explanation</strong> – Ok someone is bound to ask, if we have experiments which rely on profile information and we don&#8217;t have any user data how are we assigning a user profile prior to gathering the information? The answer is we or rather our testing software makes an educated guess, we then refine the results later and push it back into the correct group as a corrupt test. The actual guess is generated using a genetic algorithm with demographic info we do have available introduced we then use that demographic data as a means to cause mutation within our default population, to provide an initial population in both experiments we include two or more mutators (profile indicators) and we can feedback information from successful sales the result should introduce two  or more variants once we have enough feedback information we can remove or scale back our initial mutators.</p></blockquote>
<h3>Alternate Approach – Evolution</h3>
<p>While we do rely on the use of genetic algorithm math to make educated guesses about profiling it can actually be used to generate the entire landing page profiling as an alternative to our experimentation. Rather then considering our source as the population we would consider the page to be the population with our testing variables individuals since genetic algorithms are ideally suited to provide a positive or negative result of an individual within a set we can use the same math to “breed”  our individual (testing variables) to identify strongest matching pairs which can in turn be tested against one or more pairs or individuals. Their have been several interesting experiments on this type of split testing and its something we have found works well on large data sets. However the additional processing power may not justify its use unless the results show a marked improvement in conversions.</p>
<hr />
<strong>Quick Tip</strong> &#8211; A lot of Internet marketers will refer to a statistic technique called  Taguchi Method indeed many see it as a holy grail as it allows you to run experiments with what would be traditionally considered small data sets and return results similar to large scale tests. Don&#8217;t be fooled down this route if your dataset is not large enough to be running large multivariate tests then concentrate on smaller A/B split testing and looking to increase your dataset size any gains in using statistical analysis methods will be minimal at this size range anyway.</p>
<hr />
<p><strong>Quick Tip</strong> – Statistical data collected is temporal it is effected by time an obvious example is a searcher looking for Funny Christmas Hats today (mid December) is likely to be a different profile to one looking in January the search maybe the same as perhaps the intention but chances are the result is not. </p>
<hr />
<h3>Affiliates are not a single group</h3>
<p>The client site now when running a special offer page has 9 experiments running on it reflecting 12 profile groups and 6 distinct traffic sources. The alternate without analysing both the profiles and traffic sources would have been to assume that each traffic source was capable of producing all 12 profiles (which they are but in most cases statistical negligible) resulting in 72 separate experiments. Without profiling we would never have been able to identify personality groups and would have relied on traffic and guess work the key is to remember that testing is only useful if you know what variables are actually controlling the test. </p>
<hr />
<p>I hope this little study has helped people to see the potential of mining your data and that of your affiliates but its really be very much theory based so here are some handy and a bit more practical tips.</p>
<h3>Generating Profiles</h3>
<p>You cannot know your customers individually and eventually you have to accept that you are going to have to start grouping them. Using their spending habits along with other demographic information you should be able to split your users into between 6 to 18 distinct groups. Once you have your profile groups you can target and market to these groups.</p>
<h3>Pool your data externally</h3>
<p>Most sites do not have all their customer data in one place, apart from order and purchase logs, stats will be held elsewhere and questionnaire data in yet another location. Keeping in mind the concept of temporal data it is far easier to take snap shots of your data sources and pool them for analysis while most work is automated initial analysis and some interim work still needs to be compiled so having a stats package handy is always good. While I know most can&#8217;t justify a copy of Crystal, Matlab or SPSS there are free alternatives that will work well for you such as the <a href="http://www.gnu.org/software/pspp/">open source PSPP</a></p>
<hr />
<strong>Quick Tip</strong> – Make sure your temporal data is in the same timezones, particularly for non US users who servers are in the US.</p>
<hr />
<h3>Is it a gift?</h3>
<p>During the purchasing process its a great idea to understand the reason for purchase this simple easy to answer question allows us to discard the user within our normal product profiling, whats more it allows a simple upsell for adding wrapping and alternate delivery at a small cost.</p>
<h3>Interrogate your data not your users</h3>
<p>Questionnaires are a great source of information though most opinion is subjective the demographic data is not but  people who answer such questions will lean into one or more profile groups. When asking questions to users keep it short and sweet you are looking for ways to “enhance” their future experience not put them off ever shopping with you again.</p>
<h3>Think Tescos – Unique Coupons</h3>
<p>If I have a set of profiles which are unique from purchase history and I have a users purchase history it should be easy enough to provide a unique coupon designed to encourage them back to the store. Tesco are experts at this through their clubcard but very few online retailers use targeted coupons mainly because of delivery methods. But by providing a monthly newsletter with generic material chosen based on purchase history with unique coupons based on purchase history and profile a user can be tempted back.</p>
<p>A simple example 2 users both interested in computers both purchased several items in the past. Customer A is an impulse buyer – his coupon is a discount % of a single product the discount is substantial but the site makes money on the chance that he will add more items to the trolley at the full price. Customer B is a bargain hunter he is presented with a % of all items which is significantly lower then customers A percentage off. However customer B is likely to purchase only a couple of items and so the store wishes to retain some markup on the items.</p>
<p>The downside to this method is the sheer amount of data that is required plus maintaining a high delivery method (assuming email this means running your own mailing list system not using a 3rd party like Aweber) </p>
<p><em>What methods do you use for profiling users? Do you think we can pigeon hole customers into 6-18 little groups?<br />
</em></p>
<div id="vs-message">
<strong>Stuff Consulting</strong><br />
Are you interested in Profiling and Grouping your users then why not think about hiring a Stuff Consultant! See my <a href="http://www.timnash.co.uk/consulting/">consulting services</a> for more information or why not <a href="http://www.timnash.co.uk/contact/">get in touch</a>!</div>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/12/2008/profiling-multivariate-landing-page-users/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Flash SEO Indexing Revisited Part 1</title>
		<link>http://www.timnash.co.uk/07/2008/flash-seo/</link>
		<comments>http://www.timnash.co.uk/07/2008/flash-seo/#comments</comments>
		<pubDate>Tue, 15 Jul 2008 15:49:13 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Search Marketing]]></category>
		<category><![CDATA[SEO Introduction]]></category>
		<category><![CDATA[Flash]]></category>
		<category><![CDATA[Flash SEO]]></category>
		<category><![CDATA[FlashSEO]]></category>
		<category><![CDATA[SWF SEO]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=179</guid>
		<description><![CDATA[Flash is indexable we all know this, but is it actually a reliable medium for use on the web, can it rank for competitive keywords. How would such rankings work? Are the Links Followable indeed is Flash the hidden gem of SEO? No I didn't think so either but you never know!]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.timnash.co.uk/wp-content/uploads/2008/07/flashdevil.jpg" alt="Flash SEO" /><br />
Yes given that Google/Yahoo/Adobe made<a href="http://www.timnash.co.uk/07/2008/swf-indexing/"> sweeping announcements</a> I thought it was worth revisiting my Flash SEO tests to see how they are doing and introduce some new tests.</p>
<p>For those that are not sure what this is about, Adobe and Google claim Flash is now &#8220;<a href="http://www.timnash.co.uk/07/2008/swf-indexing/">indexable</a>&#8221; of course since most SEOs are sheep they spent a lot of time cheering even those who were sceptical still clapped liked penguins. For those of us who have an interest in Flash Google announcement was nothing more then a minor update to farcical heap of dung. But lets ask some very basic questions:</p>
<ul>
<li>Is Flash indexable?</li>
<li>Does Flash Rank for terms?</li>
<li>Is a Flash Link Followed?</li>
<li>Does a Flash Link pass juice?</li>
</ul>
<h3>So back to the original tests&#8230;.</h3>
<p>Back in February I updated a post (<a href="http://www.timnash.co.uk/02/2008/swf-seo/">Flash the SEO Devil?</a>) from previous April discussing a set 30 tests over 4 domains to see what information the search engines saw from a Flash file.<br />
It looked like&#8230;</p>
<table style="text-align: center;" ;="" padding:5px;="">
<thead>
<tr>
<th>&nbsp;</th>
<th>Google</th>
<th>Yahoo</th>
<th>MSN/Live</th>
<th>Ask</th>
</tr>
</thead>
<tbody>
<tr>
<td style="text-align: left;" ;="">File Name</td>
<td>Yes</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
</tr>
<tr>
<td style="text-align: left;" ;="">Text</td>
<td>Yes</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td style="text-align: left;" ;="">Text As Graphic</td>
<td>Yes</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td style="text-align: left;" ;="">Text As Symbol</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td style="text-align: left;" ;="">Text With Links</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td style="text-align: left;" ;="">Pages via SWF</td>
<td>No</td>
<td>No</td>
<td>No</td>
<td>No</td>
</tr>
<tr>
<td style="text-align: left;" ;="">Deep links (swfaddress)</td>
<td>Yes</td>
<td>Yes</td>
<td>No*</td>
<td>No</td>
</tr>
</tbody>
</table>
<p><em>Source:<a href="http://www.timnash.co.uk/02/2008/swf-seo/">http://www.timnash.co.uk/02/2008/swf-seo/</a></em></p>
<h2>Flash SEO Now</h2>
<p>Not pretty basically ignore Ask and they all could manage to do something, but none of them could really deal with anything complex such as converting text into a symbol. Also multiple screens and points on timeline caused lots of problems, and then we got to passing link juice it just didn&#8217;t happen.</p>
<h3>What&#8217;s it like now</h3>
<p>So 6 months on and a major announcement later&#8230;</p>
<p><strong>Drum roll please</strong><br />
yes you have guessed it nothing has changed!</p>
<p>Actually that is not true Google now doesn&#8217;t index 2 of our test files but it does find 3 extra lines of content in one of our files. So lets taker a closer look at Googles implementation of the Adobe Search Indexing Solution:</p>
<h2>Flash Indexing Today</h2>
<h3>Is Flash indexable?</h3>
<p>Yes. Generally text elements held in a single screen or scene within a Timeline that uses limited actionscript can be full extracted. Complex actionscript can cause some problems but on the whole textual content can be got at but when it is extracted no positional (within the timeline)  information appears to be passed. </p>
<h3>Flex Indexable?</h3>
<p>Yes, because of a change in the Search SDK to call SWF within a player and allowing it to make its calls first most Flex applications can be indexed this is actually a pretty big thing if the rest of it had been sorted. But <strong>Server Side Flash</strong> is the one area where improvement has been made.</p>
<h3>FLV Subtitles Indexable?</h3>
<p>Ok this was never going to happen but we can only hope.</p>
<h3>Flash Paper</h3>
<p>Well this died when Macromedia were bought by Adobe and was a bit to close to PDF for comfort however if I was given the choice of PDF or Flash Paper, <a href="http://www.timnash.co.uk/02/2008/pdf-seo/">PDFs are indexed</a> and links followed so are probably preferable from an SEO point of view. </p>
<h2>Does Flash Rank for Competitive terms?</h2>
<p>Not without a serious amount of external links and it never will until Adobe stops pandering to Flash Designers desire not to have to change their habits, for more information why see Adobe You Just Get Search. But Lets look at the factors that help rank pages, excluding external links&#8230;</p>
<h3>Is a Flash Link Followed?</h3>
<p>The original SDK would find all the links in a Flash file and put them in a text file minus anchor text for the search engine to do what ever they wanted to. The new version appears to do the same thing, which means that links do not appear to be followed.</p>
<h3>Does a Flash Link pass juice?</h3>
<p>Do I have to even comment if the links are not being followed then they can&#8217;t pass any link juice, but if the new Search mechanisms work in the same way as the old SDK then the link would be stripped from its context anyway so even if it was followable it wouldn&#8217;t pass any serious juice.</p>
<h3>Other Flash Ranking factors</h3>
<p>Well given Adobe has not provided a mechanism to mimic heading tags etc this is pretty much none. However this is the place where Adobe and Google could work hand in hand and truly make Flash and more importantly Flex searchable but it does require changing designer habits.</p>
<h2>Flash SEO Tests next generation</h2>
<p>So now its time to think of some of the problems flash will face even beyond those we initially posed (and which it failed) So here are some of the current tests I am running and will return with the results in part 2.</p>
<ul>
<li>If an SWF is embedded using the embed tag but not linked to will it be indexed?</li>
<li>What about the current crop of Javascript Players?</li>
<li>If in robots.txt file the Flash file is blocked but the embedded page is not is it indexed?</li>
</ul>
<h3>Yahoo and Flash</h3>
<p>Always the bridesmaid never the bride they seem to have been missed out of all the major announcements yet they also have access to the new technology (unlike Microsoft) but it would appear have yet to implement the new features as of writing.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/07/2008/flash-seo/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>Where on the page are your links from?</title>
		<link>http://www.timnash.co.uk/06/2008/on-page-link-location/</link>
		<comments>http://www.timnash.co.uk/06/2008/on-page-link-location/#comments</comments>
		<pubDate>Mon, 02 Jun 2008 11:28:20 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Advanced SEO]]></category>
		<category><![CDATA[Search Marketing]]></category>
		<category><![CDATA[link mapping]]></category>
		<category><![CDATA[link patterns]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=153</guid>
		<description><![CDATA[In a previous post I discussed Link Worth and prior to that a technique called Block Segmentation Analysis, in which I briefly discussed some research done by Microsoft into Visual Page Segmentation where rather then using the DOM they use actual images rendered from a browser to determine what are blocks. While this is overkill [...]]]></description>
			<content:encoded><![CDATA[<p>In a previous post I discussed <a href="http://www.timnash.co.uk/05/2008/link-worth/">Link Worth</a> and prior to that a technique called <a href="http://www.timnash.co.uk/05/2008/block-segmentation-analysis/">Block Segmentation Analysis</a>, in which I briefly discussed some research done by Microsoft into Visual Page Segmentation where rather then using the DOM they use actual images rendered from a browser to determine what are blocks. While this is overkill for most I was intrigued enough to create our own version with Opera, Imagemagick and some Python but the purpose for my little experiment was not to look for blocks (I had already identified them through the DOM) but the location of individual links on a page.</p>
<p>I have a whole other post dedicated to the subject of how, but quickly each page was grabbed, all block elements (div,span etc) were turned one colour and stripped of all other info to make life easier the block we wish to identify was turned red. Then the page had a grid overlayed (24&#215;24) and each square was checked for colour of it&#8217;s content. </p>
<p>With a working mechanism we then went and pulled a 100 SEO sites (Sites which advertised services rather then just blogs though many did have a blog as well) backlinks (we removed links that were internal and capped it at 200 per site) and mapped to our grid. The result (17680 backlinks later) is the ability to see popular locations for inbound links to SEO websites, which may or may not interest you.</p>
<p>The following is a pseudo heat map, blue is the limited number of links, orange a reasonable amount red is a lot.<br />
<img src="http://www.timnash.co.uk/wp-content/uploads/2008/06/heatmap.jpg" alt="pseudo heat map of links on inbound link pages" /><br />
As you can see, the first thing is there is a lot of blue, the red areas are primarily in side bars, footer and inching into what would be comment area on blogs. The orange is a bit more evenly spread out, of interest though is the lack of serious link cluster in page content themselves. The left hand side is interesting there has been a trend away from left hand side bars and so comments in a blog may be counted in that left hand side.</p>
<p>Useless trivia, out of the 100 SEO sites we tested:</p>
<ul>
<li>86  had 1 or more inbound link coming from footer area</li>
<li>96 had 1 or more inbound link coming from the right hand side sidebar</li>
<li>45 had 1 or more inbound link coming from left hand side</li>
<li>15 had no inbound links in content area on any sites</li>
<li>36 sites had 1 or more inbound links with only image as an anchor</li>
<li>98% of inbound links found in footer to our SEO sites site wide</li>
<li>99% of right side bar links found to our SEO sites are site wide</li>
<li>40% of left had side links found to our SEO sites are site wide</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/06/2008/on-page-link-location/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Clickthrough Experiment Update</title>
		<link>http://www.timnash.co.uk/05/2008/clickthrough-experiment-update/</link>
		<comments>http://www.timnash.co.uk/05/2008/clickthrough-experiment-update/#comments</comments>
		<pubDate>Sat, 17 May 2008 09:35:55 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Search Marketing]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=144</guid>
		<description><![CDATA[A couple of weeks ago I came across an experiment which in brief suggested you could improve your click through rate on SERPs by making them stand out. This would seem to make sense and his figures suggested a staggering increase. So I set out to replicate the feat in a slightly more controlled manner. [...]]]></description>
			<content:encoded><![CDATA[<p>A couple of weeks ago I came across an experiment which in brief suggested you could improve your <a href="http://www.connectedinternet.co.uk/2008/04/22/how-to-whip-the-competition-in-organic-search-results/" rel="nofollow">click through rate on SERPs</a> by making them stand out. This would seem to make sense and his figures suggested a staggering increase. So I set out to replicate the feat in a slightly more controlled manner.</p>
<p>You can read how the experiment was set up on the previous <a href="http://www.timnash.co.uk/04/2008/improve-organic-clickthrough/">click through article</a>. One thing I should add both sites are in the UK and we only monitored the click through from Google.co.uk and not .com or other regional, this makes the data set much smaller but also more reliable.</p>
<p><strong>Remember this is nothing to do with higher rankings, merely attempting to improve click through rate from results pages once your already ranked.</strong></p>
<h3>UPPERCASE EXPERIMENT</h3>
<p>So to recap we changed the case of the content in the <strong>title tag</strong> to be entirely upper case. In both experiments we had clients with 2 sites in the same SERP so that we could provide a norm, and experimented on the lower ranking of the two.<br />
<img src="http://www.timnash.co.uk/wp-content/uploads/2008/05/uppercase.jpg" alt="Graph of uppercase experiment" /><br />
So the top blue line is our higher ranking site for the two weeks the uppercase was in effect (remember this is our norm and hasn&#8217;t been changed in any way but ranks higher so will have a higher ctr), the cream brown line is the rate of the two weeks for the site we have uppercase, while the cyan blue/green is the previous two weeks.</p>
<p>The fluctuations may seem large but keep in mind the number of clicks from the highest peek and trough on the blue is just 8. </p>
<p>Clearly our UPPERCASE title has not done as well in the two weeks compared with the previous two and it&#8217;s interesting to note its peeks and troughs are in a slightly different pattern to our higher ranking site. All in all for the two weeks it consistently under performed.</p>
<h3>UPPERCASE and Click Me </h3>
<p>In this experiment we added a Click here for wording before the title with the entire title in uppercase, the idea give some one a direct call to action and they will follow. To make life more interesting the product related to this SERP was featured on a TV program in Canada causing a small spike in CTR even on the .co.uk with a much larger one in google.com.<br />
<img src="http://www.timnash.co.uk/wp-content/uploads/2008/05/clickhere.jpg" alt="Click here results" /><br />
Like before the Blue is our higher ranking norm (notice the spike), the mocha brown line is the same week as the blue with our call to action and uppercase, while the cyan is the previous two weeks</p>
<p>Our call to action does not on first glance to be performing badly but keep in mind the 25% increase in the period for searches related to our product and the click through is looking less impressive.</p>
<h3>What did visitors think</h3>
<p>To assist with the experiment we asked a few visitors arriving via google.co.uk if the experiments had made a difference to them, we started by showing them a picture of the Google result for the site from the previous two weeks and the current look asking them which they had clicked on.</p>
<ul>
<li>Uppercase results &#8211; 90% remembered clicking through the uppercase version</li>
<li>Click here results &#8211; 100% all remembered they clicked through the click here version</li>
</ul>
<p>Next we showed a generic result, and asked which element first drew them to our site<br />
<img src="http://www.timnash.co.uk/wp-content/uploads/2008/05/generic.jpg" alt="generic" /></p>
<ul>
<li>Uppercase results &#8211; 70% Said it was the title that had inspired them</li>
<li>Click here results &#8211; 60% Said it was the title</li>
</ul>
<p>Finally we asked if the site was the first site they had clicked on the search page?</p>
<ul>
<li>Uppercase results &#8211; 50% Clicked the test site first</li>
<li>Click here results &#8211; 75% Clicked the test site first</li>
</ul>
<h3>What about sales?</h3>
<p>Sales figures are only half available at the moment but the conversion rate has seen a slight performance increase in the click here experiment, but remember this is more likely to be related to the Canadian coverage.</p>
<h3>So at the halfway mark what have we learnt?</h3>
<ul>
<li>Neither experiment is showing improved results, indeed the click through rate is lower</li>
<li>Both experiments were distinctive enough for people to remember the &#8220;odd titles&#8221;</li>
<li>The survey indicates the titles do make the result stand out</li>
<li>Far to many Brits watch Canadian TV!</li>
</ul>
<p>Before I go, no I will not be revealing the keywords, site information or client info. A) the experiment is still on going, B) Both clients have been kind to let me run the experiments and deserve praise but both have asked for privacy. So come back in a couple of weeks for the full results.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/05/2008/clickthrough-experiment-update/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
	</channel>
</rss>

