<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tim Nash "stuff" Blog &#187; Behaviour modelling</title>
	<atom:link href="http://www.timnash.co.uk/category/behaviour-modelling/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.timnash.co.uk</link>
	<description>The Non SEO Consultant</description>
	<lastBuildDate>Tue, 20 Jul 2010 12:53:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Data Heaven &#8211; World Bank Opens it&#8217;s data</title>
		<link>http://www.timnash.co.uk/04/2010/world-bank-open-dat/</link>
		<comments>http://www.timnash.co.uk/04/2010/world-bank-open-dat/#comments</comments>
		<pubDate>Wed, 21 Apr 2010 15:07:11 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Behaviour modelling]]></category>
		<category><![CDATA[Programming]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=563</guid>
		<description><![CDATA[Tim introduces some of the amazing finds within the World Banks newly released Open Data platform, and explains why he thinks such openness benefits everyone.]]></description>
			<content:encoded><![CDATA[<p>Do you like demographic data? Like to make infographics or just love stats? Then I have a fantastic resource for you!</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="400" height="265" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://vimeo.com/moogaloop.swf?clip_id=11059994&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=ffffff&amp;fullscreen=1" /><embed type="application/x-shockwave-flash" width="400" height="265" src="http://vimeo.com/moogaloop.swf?clip_id=11059994&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=ffffff&amp;fullscreen=1" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p><a href="http://data.worldbank.org" >The World Bank</a> has opened up an awful lot of its data. Here are some of the interesting stats from their “World Development Indicators”. There is a caveat, however. The data is out of date since the 2009/2010 data has yet to be released.</p>
<p>Here are some examples from the 2008 version:</p>
<h3>Top 10 Countries by Number of Patent Application (by residents)</h3>
<ol>
<li>Japan - 330,110</li>
<li>USA - 231,588</li>
<li>China - 194,579</li>
<li>South Korea - 127,114</li>
<li>Germany - 49,240</li>
<li>Russia - 27,712</li>
<li>United Kingdom - 16,52</li>
<li>France - 14,743</li>
<li>Canada – 5,061</li>
<li>Spain - 3,632</li>
</ol>
<h3>Top 10 Countries by Internet Users (Per 100 people)</h3>
<ol>
<li>Iceland – 90</li>
<li>Sweden – 87</li>
<li>Netherlands – 86</li>
<li>Denmark – 83</li>
<li>Norway – 82</li>
<li>Finland – 79</li>
<li>Bermuda – 79</li>
<li>Luxembourg – 76</li>
<li>United Kingdom – 76</li>
<li>Switzerland – 76</li>
<li>United States – 76</li>
</ol>
<p><em>Note: The numbers are rounded. See data.worldbank.org for exact figures. Also bare in mind that was in 2008</em></p>
<h3>Top 10 Countries by cost of Mobile phone subscription plan</h3>
<p>This could be used as an alternate to <a href="http://www.timnash.co.uk/04/2010/working-out-multi-currency-prices-with-burgers/" >Big Mac index</a> and  potentially for subscription services, which require bandwidth usage. I want to note that countries at the top of the list are there principally because the infrastructure cost is being passed on to customer.</p>
<blockquote><p>The plan is defined as “which includes the cost of monthly mobile use for 25 outgoing calls per month spread over the same mobile network, other mobile networks, and mobile to fixed-line calls and during peak, off-peak, and weekend times as well as 30 text messages per month.”</p></blockquote>
<ol>
<li>Brazil - $37</li>
<li>France - $36</li>
<li>Switzerland - $36</li>
<li>Spain - $33</li>
<li>Japan - $32</li>
<li>Australia - $26</li>
<li>Portugal - $26</li>
<li>Greece - $25</li>
<li>Venezuela, RB - $25</li>
<li>Austria - $24</li>
<li>New Zealand - $23</li>
<li>Cuba - $23</li>
</ol>
<p>For those interested the UK came in at $20 while the USA $15</p>
<h3>Why is this data interesting?</h3>
<p>Well, it's the same data one of the largest data mining groups use to determine how solvent, economically viable, and wealthy a country is. However, it's also a great snapshot of life in different countries. While it may have no direct use in eccomerce, it can be used to weigh several factors. When entering new markets, determining cell phone costs and comparing them to how much money people have to spend will ultimately tell you how big a market it is.</p>
<p>One of the more interesting sections for me is '<a href="http://www.doingbusiness.org/economyrankings/" >Economic Rankings</a>'. It's a fascinating place for anyone looking to start, run and close companies abroad. Each country is ranked by various factors ranging from how easy it is to set up a company to how hard it is to sue people. This is actually a whole microsite in its own right.</p>
<h3>Getting hold of the data</h3>
<p>All this data and hundreds of other indicators and economic data is available from data.worldbank.org. They have an API as well as XML, Excel and various other data types including RFD. I can’t wait to see this data being used! Together with the national data sources for the UK and the USA, its never been easier for data lovers to get their fix!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/04/2010/world-bank-open-dat/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Simple Buyer Mining</title>
		<link>http://www.timnash.co.uk/04/2010/simple-buyer-mining/</link>
		<comments>http://www.timnash.co.uk/04/2010/simple-buyer-mining/#comments</comments>
		<pubDate>Wed, 21 Apr 2010 07:52:49 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Behaviour modelling]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=553</guid>
		<description><![CDATA[Tim discusses what Mike Hill calls Buyer Mining, Tim calls common sense and shows a simple example for how any eccomerce owner paying for leeds on Facebook or similar can benefit from looking at their demographics]]></description>
			<content:encoded><![CDATA[<p>Getting visitors to your site is easy, if you have cash. Getting the right visitors to your site...well, that takes a little more time and effort.</p>
<p>Over the weekend, <a href="http://andybeard.eu"  title="Andy Beard">Andy Beard</a> pointed me to a <a href="http://mikehillsblog.com/" >Mike Hill</a> video on 'Buyer Mining', which was a response to an interview Techcrunch did with two ex Googlers who started the ecommerce company '<a href="http://tellapart.com/"  rel="nofollow">Tell Apart</a>'. It provides sites with leads based on buyer demographic data. They are so confident in their service and the ability to provide such leads, they only charge a percentage of the revenue increase. Think 'affiliate style CPA' here.</p>
<p>Still with me? Cool, because after the video, I’m going to demonstrate how you, as an ecommerce owner and affiliate, can do something similar, for a fraction of the budget.</p>
<p><object id="flashObj" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="480" height="297" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="bgcolor" value="#FFFFFF" /><param name="flashVars" value="videoId=77912217001&amp;linkBaseURL=http%3A%2F%2Ftechcrunch.com%2F2010%2F04%2F13%2Fformer-googlers-launch-tellapart-raise-4-75-million-from-greylock%2F&amp;playerID=63890987001&amp;domain=embed&amp;dynamicStreaming=true" /><param name="base" value="http://admin.brightcove.com" /><param name="seamlesstabbing" value="false" /><param name="allowFullScreen" value="true" /><param name="swLiveConnect" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://c.brightcove.com/services/viewer/federated_f9/63890987001?isVid=1&amp;isUI=1" /><param name="name" value="flashObj" /><param name="flashvars" value="videoId=77912217001&amp;linkBaseURL=http%3A%2F%2Ftechcrunch.com%2F2010%2F04%2F13%2Fformer-googlers-launch-tellapart-raise-4-75-million-from-greylock%2F&amp;playerID=63890987001&amp;&amp;domain=embed&amp;dynamicStreaming=true" /><param name="allowfullscreen" value="true" /><embed id="flashObj" type="application/x-shockwave-flash" width="480" height="297" src="http://c.brightcove.com/services/viewer/federated_f9/63890987001?isVid=1&amp;isUI=1" name="flashObj" allowscriptaccess="always" swliveconnect="true" allowfullscreen="true" seamlesstabbing="false" base="http://admin.brightcove.com" flashvars="videoId=77912217001&amp;linkBaseURL=http%3A%2F%2Ftechcrunch.com%2F2010%2F04%2F13%2Fformer-googlers-launch-tellapart-raise-4-75-million-from-greylock%2F&amp;playerID=63890987001&amp;&amp;domain=embed&amp;dynamicStreaming=true" bgcolor="#FFFFFF"></embed></object></p>
<h3>Buyer Mining</h3>
<p>So, what is the 'buyer mining' Mike Hill talks about? Well, put simply, it's looking at key buyer demographics to identify patterns. Some maybe fairly obvious such as the average Tampon buyer being either in the military/Female or both. Others may not be so easy.</p>
<p>Some of the key demographics to grab would include geographic location, age, and gender.  In addition, we want to look at the average basket price and whether they are returning customers. Here is Mikes explanation:</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="400" height="224" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.facebook.com/v/399838503816" /><embed type="application/x-shockwave-flash" width="400" height="224" src="http://www.facebook.com/v/399838503816" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>Now, the problem is to be statistically reliable, you need a lot of data. In fact, you'll need more than your average ecommerce store can produce.  Still, even with a limited data set, you can make some choices when it comes to lead acquisition.</p>
<h3>Simple Buyer Mining</h3>
<p><strong>Scenario</strong><br />
Let’s assume you are a small/medium ecommerce store selling red widgets, and one of your lead acquisition methods is Facebook ads. These can be a great for targeting users, but it's often hard to tap into and they are fairly expensive, unless you are making a decent return.</p>
<p>Broad Matching Demographic, while possible, is often ineffective. So, to make best use of Facebook ads, we need: a location, a gender, and ideally, an age range. Two out of the 3 are very easy for any company to provide retrospectively, and well, 2 out 3 ain't bad now, is it?</p>
<h3>Determine Geographical Location</h3>
<p>Virtually all ecommerce systems will  store the IP address of the purchase, which can be used to determine location within a few metres in some cases. More likely, it will resolve to a city or a country, at minimum. Be wary, however, as using an IP is not always accurate. For example, my home IP address for many years claimed I lived in London, when I was 200 miles away.  At best, it will offer country accuracy with potential city/town. The exception is AOL customers, which some IP systems identify as being in the US regardless of where they are in the world. In addition, you can always take location from the address given at the time of purchase. This will be more accurate, but might only be available for non digital products.</p>
<h3>Gender</h3>
<p>There are lots of ways to determine gender, and this would be a great place to plug my new tools, except they are not ready yet. Here is a simple way to determine gender in historical data:</p>
<p>From your purchase logs, pull all the first names, and pass them through a gender dictionary to identify female/male names. With the exception of a few Kims, etc. most western names will be one gender or another, and therefore, those running Western based sites should be able to process and get  gender demographic quickly and easily. You can do the same for Arabic, but Chinese names are a bit more problematic. Names from some Asian countries and the Indian Subcontinent, however are generally a nightmare to work out this way.</p>
<h3>Age</h3>
<p>Remember I said 2 out of 3? While you can analyze names to get age,  it is really too unreliable for use. Likewise, browser history is not an option, which leaves only one choice: asking the user for his or her age.  Sadly, when looking at historical stats this is not much help. It is also one of the most useful demographics. So much so, I try to encourage all my clients to gather it. For subscriptions and certain products, this is easier to warrant, and obviously you do not want to much cluttering your sales flow, but I strongly suggest you include it even as an optional field.</p>
<h3>Facebook Ad fun</h3>
<p><img src="http://www.timnash.co.uk/wp-content/uploads/2010/04/facebook.png" alt="facebook ad" /></p>
<p>Once we have the demographic information, it’s time to do some mining and try to identify trending patterns. Specifically, we want to generate targeted ads.</p>
<p>From experience, the best place to start is with the demographic you should definitely have: location. Depending on your data size, the more you have, the tighter your geographical targeting should be.  With this in mind, take your data and split it into geographic regions. For each region, split the data by gender and age, if you have it.  The biggest of these groups will be your first target for Facebook ads. From there, it’s simply a case of setting up one or more ads within Facebook for these groups.</p>
<h3>Split Testing</h3>
<p>It goes without saying that it’s worth doing straight a/b testing, but it’s worth remembering different groups will react differently, so do split tests for each separate profile group. The results maybe different or  you may have a clear winner across all the profiles.  Don’t forget to cookie your test groups if they purchase at a later date so you have the split test data. (It’s worth recording regardless to see if a group has a tendency to purchase later.)</p>
<h3>Affiliates</h3>
<p>I mentioned the above could be used with affiliate marketers, as well as ecommerce owners. This is sort of true, but the ecommerce owner will need identify which of your leads resulted in sales, otherwise you will be grouping blind.</p>
<p>Of course,  you can do a blind variation by determining the gender of each lead through CSS history attack. To do this, use the IP for location, split your users up, and test each group to see which is more likely to purchase. However, this could prove to be an expensive option.</p>
<div id="vs-message">
<strong>Consulting</strong></p>
<p>Looking to work out demographic data from your product logs? Need help setting up campaigns similar to what is described above well why not chat to me. For more details please <a href="http://www.timnash.co.uk/contact/"  >contact me</a> or look on my <a href="http://www.timnash.co.uk/consulting/"  >consulting services</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/04/2010/simple-buyer-mining/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Working out Multi Currency prices with burgers</title>
		<link>http://www.timnash.co.uk/04/2010/working-out-multi-currency-prices-with-burgers/</link>
		<comments>http://www.timnash.co.uk/04/2010/working-out-multi-currency-prices-with-burgers/#comments</comments>
		<pubDate>Tue, 06 Apr 2010 10:17:20 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Behaviour modelling]]></category>
		<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Programming]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=532</guid>
		<description><![CDATA[Tim takes a look at how to automate your subscription values across multiple currencies using burgers and IPods. Who would have thought counting burgers would be useful!]]></description>
			<content:encoded><![CDATA[<p>This is an expanded version of a post I published in the <a href="http://huomah.com/dojo/" >SEO Dojo</a> and mentioned in <a href="http://www.timnash.co.uk/03/2010/hiding-in-the-dojo/" >this post</a>, which was a rather obvious hint for you to join the Dojo. <img src='http://www.timnash.co.uk/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  Since then, I have literally had one email asking about it, so I decided to expand the forum post and publish it here.</p>
<p>I wish to pose a simple question:</p>
<h3>How would you determine the price of a product or service for multiple countries?</h3>
<ul>
<li>You could go through each country and work out a value.</li>
<li>You choose a base currency and do real time conversions.</li>
<li>You could use comparable item counting.</li>
</ul>
<p>Each method has merits, but they are reliant on a lot of manual and repetitive work. They may also cost you when the method fails to take local conditions such as taxes into account.</p>
<p><strong>Work out individual valu</strong>e - This is a long process, but probably the best when working with 1 or 2 currencies in familiar markets. For example, I'm likely to set Pounds, Dollars and Euros because I am familiar with costs in the US and in Europe, but I'm not so sure about prices in South Korea, for example.</p>
<p><strong>Real time counting</strong> - (Determining a price in a default currency and converting it into alternative currencies on the fly.) This method has several issues: the price will change constantly, people are nervous of products with constant price changes, and it doesn't take local taxes and other costs into account. Customers in the EU, for instance, wouldn't be charged VAT.</p>
<p><strong>Product Counting</strong> - (Finding a product similar to your own and using its price in each country to set your own.) The problem is that the product has to be the same in each country, and it doesn't account for import costs.</p>
<p>So how do burgers help? Well, let's start with an example:</p>
<p>I'm running a subscription site in multiple countries, and I want to display subscriptions to people in their own language and currency.</p>
<h2>Identify the costs per country</h2>
<p>For each currency, identify any additional costs such as sales taxes, shipping , and any additional costs associated with selling products or services in a country. If it costs you more to supply a country, be sure to include it in your price.</p>
<h3>Choose a stable currency as your base</h3>
<p>This does not have to be your country's currency, since it's just something to work from. For instance, if you use the Big Mac index, the default currency is dollars, so you may wish to use dollars. Or, you might choose the currency you do your accounts in.</p>
<h3>Define the default currency levels</h3>
<p>Next, work out the subscription levels in your default currency. Lets say we use $10, $20 and $30 as our levels. Obviously, how you determine the initial price is up to you, but I suggest testing multiple prices before settling on one. Also, remember when choosing a currency conversion system that you can override it when you need to.</p>
<h3>Start counting Big Macs</h3>
<p>No, seriously! A Big Mac is a perfect comparison tool. It's made of the same ingredients in every country, and is competitively priced. This means that, while McDonalds makes a profit on each burger, it's carefully calculated to be roughly the same % in ever country. For a list of the costs of a Big Mac in every country, you'll need a subscription to the Observer, or you can go to <a href="http://www.oanda.com/currency/big-mac-index" >www.oanda.com/currency/big-mac-index</a></p>
<p><img src="http://www.timnash.co.uk/wp-content/uploads/2010/04/2010NAC145.jpg" alt="Big Mac Index" /><br />
<em>source: <a href="http://www.economist.com/daily/chartgallery/displaystory.cfm?story_id=15715184" >Economist</a></em></p>
<p>Our next step is to count how many Big Macs are in our subscription. So, a Big Mac in the US costs $3.57 so 10/3.75, and we would need to multiply it by approximately 2.8 to equal our subscription. So, to determine the relative value of a $10 subscription in other countries, we find the value of a Big Mac and multiply it by 2.8.</p>
<p>Lets do a simple example: A Big Mac in the UK is 2.29, and when we multiply that by 2.8, it would be 6.412.</p>
<p><code><br />
function bigmac($value,$cost,$rate){<br />
	return round((floor($value/$cost) * $rate),0,PHP_ROUND_HALF_UP);<br />
}<br />
</code></p>
<p><em>For PHP users a quick function where $value is default subscription, $cost of the burger in the default country and $rate is the current exchange rate. <strong>Works only currencies with similar exchange rates</strong></em></p>
<h3>The real economics of the Big Mac index</h3>
<p>The example above is an extremely simple way to use the Big Mac Index. Economists will use the index to determine the <a href="http://currencyforthelongterm.wordpress.com/ppp/" >purchasing-power parity</a>, which was first proposed by Gustav Cassel. (Purchasing-power parity is when a countries exchange rate is determined by the price of a shopping basket from a super market)</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="640" height="385" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/6F9xIj1YDxo&amp;hl=en_US&amp;fs=1&amp;" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="640" height="385" src="http://www.youtube.com/v/6F9xIj1YDxo&amp;hl=en_US&amp;fs=1&amp;" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>The video from the (http://www.metalproject.co.uk/) explains how economists can compare Big Macs by country to determine the relative value of almost anything. However, this rate often varies wildly from the actual rate, which is affected by many other factors.</p>
<h3>Always round up</h3>
<p>Now, of course you'll want to convert 6.412 to something a little less programmed, so round to 7,13,19. You'll notice in this example that the bigger the value is, the more of a discount it's getting, and the further it will drift from market demand.</p>
<p>Low inflation currencies probably should be rounded to the next 5 or 0 to maintain a nice pattern.</p>
<p>Note: This only works for low inflation currencies, so don't try this with Zimbabwean dollars. In the example, our Pounds are now 10, 15, 20. Suddenly, the site is making a profit on the lower price and losing on the higher price. Ultimately, it works out even, so long as there is an even distribution.</p>
<h3>What about larger subscriptions</h3>
<p>Sometimes, Big Macs are just to small to be useful. As you saw in the previous examples, the more Big Macs you work with, the greater the rounding error  will be. So, when dealing with larger numbers, we could use an alternate index.</p>
<p>One possible example is the <a href="http://www.comsec.com.au/public/news.aspx?id=809" >iPod Index.</a> Initially proposed back in 2007 by Comsec, the iPod has become as ubiquitous as the MP3. Unfortunately, they do not maintain the index, but it's possible to recreate it using the Apple store and some googling.</p>
<h3>Other methods</h3>
<p>One technique I've used is what I call 'over pegged middle'. First, you need to know the number of subscribers for each level and the total income they bring in. Then, whichever total brings in the most income is the one you do the Big Mac check against.</p>
<p>Next, round that figure up to a 5 or zero and match the other subscriptions by using the same ratio as the original subscription. This will cause an imbalance because the majority of users will pay more, but it will also make higher earning subscriptions more tempting.</p>
<p>Any suggestions for how you would work out subscription prices?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/04/2010/working-out-multi-currency-prices-with-burgers/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Centralising your Analytics in a decentralised way</title>
		<link>http://www.timnash.co.uk/03/2010/centralizing-analytics/</link>
		<comments>http://www.timnash.co.uk/03/2010/centralizing-analytics/#comments</comments>
		<pubDate>Mon, 29 Mar 2010 11:14:37 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Advanced SEO]]></category>
		<category><![CDATA[Behaviour modelling]]></category>
		<category><![CDATA[Case Studies]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=517</guid>
		<description><![CDATA[While getting ready for timnash.co.uk 3.0 Tim tries to tackle the problem of following a user through out his various stats packages and various methods for coping with all this decentralised data.]]></description>
			<content:encoded><![CDATA[<p>I am working on timnash.co.uk v3.0 oh yeah!! As part of the new site I will be using it as a place to run more experimental Behaviour Modelling and analytical bits, more importantly I want to make it easy for people to see what I'm gathering and what I'm doing with it. I haven't worked out entirely how I'm going to do that yet so stay tuned. However one of the things I have been pondering is how I am going to combine my disparate stats gathering system.</p>
<p>Currently I run:</p>
<ul>
<li>Google Analytics</li>
<li>Google Weboptimiser and other A/B testing</li>
<li>GetClicky</li>
<li>Heatmap software</li>
<li>Occasional CSS History profiler</li>
<li>Surveys</li>
</ul>
<p>If I want to track a user across all these currently I can't for example if I want to see the clickmap of a user I can't compare it with Google Analytical data for example. This is fine but the more work I do the more I want to be able to follow a user through the entire experience, now at this stage many people may start to think ok, reduce the number of third party software and this thought has occured to me. The reason I use getclicky and Google Analytics is I can't do better it's that simple.</p>
<h3>Privacy Concerns</h3>
<p>The biggest issue when linking multiple systems together is the inevitable extra privacy issues, while these systems are separate they are psuedo anonymous combining them makes it much easier to identify a user especially when linked with a login/commenting system where they have to give their email and other information like name. However in many ways I think centralising  your data makes dealing with concerns easier to deal with for example you can set up a single "remove me from your tracking" service (also you can track how many people have opt'd out! oh wait is that wrong?) so centralising my data not only will make things easier for me it will make it easier for visitors who have privacy concerns.</p>
<h3>Central storage area</h3>
<p>The obvious way to centralise all the data is to create a central storage repository and put data in it, of course this immediately prevents several obvious problems.</p>
<ul>
<li>Replication</li>
<li>Single Point of Failure</li>
<li>Reliability</li>
</ul>
<p><em>Replication</em> - There is rarely a good reason in life to have two <em>working copies</em> of something, your analytics data included, apart from the fact you have to maintain both copies you also have to check data integrity and it's taking up space and therefore costing more to store.</p>
<p><em>Single Point of Failure</em> - while not normally a problem, when something is being continually used both for read write it''s life expectancy is limited made worse by the fact that several parts of the site will be reliant on the system to make choices, if the system falls over or worse is just slow it will cause issues throughout the site. </p>
<p><em>Reliability</em> - One of the reasons to use third party services is so I don't have to handle such things as uptime and reliability any benefit in getting someone else to do the work is lost if I then redo it.</p>
<p>the advantage is speed and as long as it's up we should be able to access everything instantly.</p>
<h3>Decentralised with key link</h3>
<p>The second approach to look at it is linking all the various services with a common key. Most third party services worth anything will allow you to store a custom value against a visitor. If the same custom value is used per visitor for all the services then they can be tracked through various calls to each services API. This is easier said then done...</p>
<p>A couple of problems that immediately come to mind</p>
<ul>
<li>Identifying the unique visitor</li>
<li>Linking a visitor after the fact</li>
<li>What controls the initial identification </li>
</ul>
<p>It also has the potential for a single point of failure of the totally centralised solution, the service that tags visitors is down the data is lost. This however seems a much smaller risk, at worse some visitors are not tagged correctly and it probably means the site has far  worse problems!</p>
<p><em>Identifying the unique visitor</em> - This at first glance seems easy but to be accurate is actually more difficult and is a post in it's own right. Once identified the next problem is choosing a naming strategy for a visitor Id if we had a centralised relational database this would be easy it would be the id of the row but we don't. Some ideas I played with was timestamp, IP and profile type or some combination of these. </p>
<p>Once the ID of the unique user is set and stored on their machine either through a session, cookie or some more hardy persist storage they can be simply picked up in the future.</p>
<p><em>Linking to a user after the fact</em> - There can be times where a user maybe identified after a service has stored data about the individual some systems will automatically tie in the old data with the new, others won't unfortunately there is not much you can do barring a recursive check and additions. For example let's assume a user visits a site on a laptop from home, then visits at work. We treat his work log in as a different instance, when he logs in, we can identify this new visitor under the same user. However we have already sent a pile of custom keys to all our analytical packages.</p>
<p><em>What controls the initial identification</em> - here is a more tricky issue in the scenario to my blog, a simple wordpress plugin that checks to see if a persistent storage or cookie is on the users machine, determines ID and adds a cookie as needed.</p>
<p>so two competing systems both with problems the solution seems to be a blend between the two.</p>
<h3>Decentralised in a Centralised way</h3>
<p>I'm going to run through two examples of the way I'm going to centralise my data, one for here timnash.co.uk and the other for a membership site.</p>
<p>On timnash.co.uk I'm going for a totally decentralised approach, a wordpress plugin, will identify users based on if they have been tagged before as I have no easy way to identify if they are previous user on a different browser machine, except if they comment there is no major advantage of maintaining any form of database control. Users will be tagged with a combination of timestamp+profileid+random number<br />
This is then included as custom data to all the stats gathering packages and stored on the users machine using browsers persistent storage. If a user wishes me not to collate individual data they can opt out via the privacy page, this will place a persistent storage cookie, telling the system to not attach the key to their pages, to opt out entirely they will still need to individual drop out of each service.</p>
<p>For a Membership I run I plan a similar system however as it has a login system, individual browser profiles (unique keys) will be stored against a logged in user. This will allow these profiles to be linked via the username and has the advantage of spotting password sharers if their are a large quantity of browser combinations (it should be able to detect even if users use proxies or are on a corporate network)</p>
<p>so that's the plan, anyone see any major issues with it? let me know, ideally before I fully build it! How are you managing your various data services?</p>
<div id="vs-message">
<strong>Consulting</strong><br /> <br />
Looking to develop a similar system or interested in doing detailed tracking and profiling of users? Why not come and have a chat and see what I can do for you! For more details please <a href="http://www.timnash.co.uk/contact/"  >contact me</a> or look on my <a href="http://www.timnash.co.uk/consulting/"  >consulting services</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/03/2010/centralizing-analytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hiding in the Dojo</title>
		<link>http://www.timnash.co.uk/03/2010/hiding-in-the-dojo/</link>
		<comments>http://www.timnash.co.uk/03/2010/hiding-in-the-dojo/#comments</comments>
		<pubDate>Wed, 17 Mar 2010 12:47:03 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Behaviour modelling]]></category>
		<category><![CDATA[SEO Introduction]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=514</guid>
		<description><![CDATA[Tim is in the Dojo and wants you to join him so is highlighting some of the most interesting posts he has done in the new Conversions and Metrics section.]]></description>
			<content:encoded><![CDATA[<p>My friend <a href="http://www.huomah.com/" >David Harry</a> has given into my continual badgering to add a <a href="http://www.huomah.com/dojo/component/option,com_kunena/Itemid,65/catid,54/func,listcat/" >conversion and metrics section</a> (Dojo <a href="http://www.huomah.com/dojo/about-the-dojo-seo-training.html" >Membership Required</a> to see) to the SEO Dojo forum and has finally added them. Now I feel compelled to at the very least add some content to the forum. </p>
<p>Already in the forum are:</p>
<ul>
<li>How I'm profiling all of you guys in the hope when I relaunched timnash.co.uk I can better serve your needs</li>
<li>Sample JQuery code for the CSS History Hack which reports back to a logging system</li>
<li>How I'm using Big Macs to determine prices of subscriptions</li>
</ul>
<p>If your a Dojo member check the new forums out, if your not well here is a couple of samples of what you are missing...</p>
<h3>Profiling the SEO Community</h3>
<blockquote><p>
My site tends to be geeky bordering on nerdy at times therefore my target audience are geeky search geeks, programmers and IR peeps, yet the people who come to my site are mainly SEOs/Internet Marketers who tend to run away quite quickly. Likewise a third group exist clients or potential clients they visit the site not in the hope of knowledge but to contact me. However this third group expect to not understand the posts. There are more groups such as the Bob Massa spam team who regularly visit but let's concentrate on identifying the first three.</p>
<p>The problem is interesting, normally when you have two groups it's easy to pick instant differences for example with the CSS history hack we could and indeed will look to see if they have visited certain sites. for example:</p>
<p>www.huomah.com/dojo/component/option,com_kunena/Itemid,65/ = clearly a search geek with some intelligence<br />
www.seomoz.org/members/ = an idiot with no brains at all</p>
<p>The problem is when they are both it's hard to have a search geek with some intelligence but with no brains. </p>
<p>the solution that I pose is to build onto sub profile the Search geek becomes a part of the internet marketer profile, this is fine in most cases we will want to treat them the same but not in all cases so we still need to pick them out.<br />
<em><a href="http://www.huomah.com/dojo/component/option,com_kunena/Itemid,65/catid,55/func,view/id,2915/" >http://www.huomah.com/dojo/component/option,com_kunena/Itemid,65/catid,55/func,view/id,2915/</a></em>
</p></blockquote>
<h3>Identifying Subscriptions Prices using McDonalds Big Macs</h3>
<blockquote><p>
Define you default currencies levels[/b]<br />
Next is to work out our subscription levels in your default currency lets say we are choosing $10, $20, $30 as our levels nice easy figures, obviously these are not our final figures we are going to be price break testing right, all nod and say yes Tim.</p>
<p>Start counting Big Macs<br />
No seriously a Big Mac is a perfect comparison tool, it is made of the same ingredients in every country and is competitively priced meaning while McDonalds makes a profit on each burger it's carefully calculated to be roughly the same % in ever country. For a list of the cost of a Big Mac in every country you either can get a subscription to the Observer or go to....<br />
<em><a href="http://www.huomah.com/dojo/component/option,com_kunena/Itemid,65/catid,56/func,view/id,2929/" >http://www.huomah.com/dojo/component/option,com_kunena/Itemid,65/catid,56/func,view/id,2929/</a></em>
</p></blockquote>
<p>So what are you waiting for if your not a dojo member it's just $30 a month which is a steal. Go on you want to <a href="http://www.huomah.com/dojo/about-the-dojo-seo-training.html" >join me</a> and 200 other dojoers.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/03/2010/hiding-in-the-dojo/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Who do I pitch to?</title>
		<link>http://www.timnash.co.uk/09/2009/who-do-i-pitch-to/</link>
		<comments>http://www.timnash.co.uk/09/2009/who-do-i-pitch-to/#comments</comments>
		<pubDate>Mon, 14 Sep 2009 14:29:23 +0000</pubDate>
		<dc:creator>Tim Nash</dc:creator>
				<category><![CDATA[Behaviour modelling]]></category>

		<guid isPermaLink="false">http://www.timnash.co.uk/?p=350</guid>
		<description><![CDATA[Tim is looking for feedback on who he should pitch his behaviour modelling ideas to in form of seminars. Could it be you? Give him your feedback and shape the future of timnash.co.uk and his seminars.]]></description>
			<content:encoded><![CDATA[<p>I have a problem, one of many I know, I ran a seminar on Friday most people liked it woot! I want to do another yey! So what is the problem well I don't know who should come!</p>
<p>The seminar started with hellos and introductions before going into a couple of fun games the idea was to get people to start  looking at each other and to understand we can pull data from multiple sources. We ended up using word association to generate rudimentary and fun profiles of the attendees before going into some of the methodology behind some of the work I do. The second half was much more focused and included overviews and specific ideas and techniques.</p>
<p>I think everyone left with value for money (after all it was free!) and loads of information to digest but even amongst the small group the amount of use from the seminar will be varied.</p>
<p>Our attendees fell roughly into the following categories:</p>
<ul>
<li>Bloggers - and professional writers probably got more out of the first half of the seminar</li>
<li>Web developers - More out of the second half then the first</li>
<li>Internet Marketers - Not sure I suspect more out of the second half which would be a shame because that implies quick fix</li>
<li>SEOs - looked confused because Google was mentioned but once</li>
</ul>
<p>The problem comes I'm not providing a solution, a quick fix or indeed a way forward but rather an ideas and concepts I strongly believe the one example I showed in the second half for guaranteed under cutting of your competitors price is worth virtually any ticket price I wish to set. I also intend to provide more detailed examples and access to video content for any attendees. This is not an unusual business model its proven to work, the issue is not with the concept, the content but the pitch and who to pitch to itself.</p>
<p>I want to run at least one more seminar and to be honest it would be nice if it paid for itself even if I wish to provide online access via a paywall (for at least some content). I don't really want to offer a product or service and really prefer to simply communicate the ideas. Consequently I have now got a difficult choice of who to market to and what to charge (if anything) and so am thinking the people who read my blog are my current audience the best people to ask so tell me what do you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.timnash.co.uk/09/2009/who-do-i-pitch-to/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
