Frequent thinker, occasional writer, constant smart-arse

Tag: flickr (Page 1 of 3)

Ouch – widgets bypassing Google’s wall

Feedjit
On the right of my blog as I write this, I have a widget – it’s a simple piece of javacript, from the company Feedjit, that allows me to embed a short piece of code to indicate to my readers how other people find my blog. Since the launch of the widget, it seems like it has become very popular with 60 million widgets claimed by the company’s website.

I made a discovery today almost by accident: I accessed my blog on another computer. Or rather, I accessed my blog via Google’s cache – who have replicated my content for their search results, widgets and all. Now when you look at the Feedjit widget (image below left), the data is very different: it no longer shows visitors to my blog, but visitors to Google servers.

If you follow through to the detailed statistics you will even see what the most popular sites are that day, as well as the locations of the visitors. As this is data from the Google cache server, you are effectively getting an analysis of visitors – who they are, what keywords they are searching for, and what they found. So because my blog is part of Google cache, I can effectively hack and sneak in the backdoor of Google’s data.

(Having a quick look, it seems this URL is the main Google cache address; however data will only get logged when someone looks at the cache.)

Feedjit google cacheDoes it matter?
While this is a fun thing to look at and then move on, I think it raises some serious issues – multiple ones at that.

On widgets: With the prolifiration of widgets on the web, has this become potentially the next biggest security risk on the web?

On privacy: It’s not that hard to identify the people making those searches. Search engines handing over data to the government has been a hot issue, with Google resisiting a much hyped story as the company tried to prove it protected its users. With the growing cross-pollination of the web, exemplified with widgets, are we prepared for what it means to have open data (which is becoming inevitable)?

On metrics: Google has a complete download of my blog in its cache, but what I didn’t realise, is that it is a copy of the full blog (with scripts like my web stats). When I look at my statistics, I see an awful lot of activity from computer bots for example. Is this because every time Google, Yahoo or MSN analyse content that has been ripped off my site, I can actually see what they are doing behind their closed walls?

Those are questions with simple but also complicated answers. Either way, if its that easy to hack even Google, then God help us.

Pageview’s are a misleading metric

Recently MySpace, the social networking site that once dominated but is now being overtaken by Facebook, sent me an e-mail informing me that a friend of mine had a birthday. What is unusual, is that although I have received notifications of this type when I had logged into the site, I had never been e-mailed.

Below is a copy of the e-mail, and lets see if you notice what I did:
birthdayreminder

It doesn’t tell me whose birthday it is. In fact, it is even ambiguous as to whether it was just the one person or not. Big deal? Not really. But it very clearly tells me something: MySpace is trying to increase its pageviews.

Social networking sites are very useful services to an individual; they enable a person to manage and monitor their personal networks. Not only am I in touch with so many people I lost contact with, but I am in the loop with their lives. I may not message them, but by passive observation, I know what everyone is up to. Things like what they’re studying, where they work, what countries they will be holidaying in, and useful things like when they have their birthday.

Social networking sites are not just a website, but an information service, to help you manage your life. However as useful as I find these services, the revenue model is largely dependent on advertising, with premium features a rare thing now. So when you rely on advertising, you are going to be looking at ways of boosting the key figures that determine that revenue stream.

Friendster’s surprising growth in May was due to some clever techniques of using e-mail, to drive pageviews. And it worked. E-mail notifications, when done tactfully, can drive a huge amount of activity. Of the what seems like hundreds of web services I have joined, e-mail at times is the only way for me to remember I even subscribed to it once upon a time. Combine e-mail with information I want to be updated with, and you’ve got a great recipe for using e-mail as a tool to drive page views.

…And that is the problem. MySpace has very cleverly sent this e-mail to get me to log into my account. A marketing campagn like that will at the very least, see a good day in pageview growth. But the reason I am logging in, is just so I can see whose birthday it is. Myspace now to me is irrelevant: those pageviews attributed to me are actually, not one of an engaged user.

Pageviews as a metric for measuring audience engagement is prone to manipulation. Increases in pageviews on the face of it, make a website appear more popular. But in reality, dig a little deeper and the correlation for what really matters (audience engagement) is not quite on par.

So everyone, repeat after me: Pageviews – we need to drop them as a concept if we are ever going to make progress.

How Google reader can finally start making money

Today, you would have heard that Newsgator, Bloglines, Me.dium, Peepel, Talis and Ma.gnolia have joined the APML workgroup and are in discussions with workgroup members on how they can implement APML into their product lines. Bloglines created some news the other week on their intention to adopt it, and the announcement today about Newsgator means APML is now fast becoming an industry standard.

Google however, is still sitting on the side lines. I really like using Google reader, but if they don?¢‚Ǩ‚Ñ¢t announce support for APML soon, I will have to switch back to my old favourite Bloglines which is doing some serious innovating. Seeing as Google reader came out of beta recently, I thought I?¢‚Ǩ‚Ñ¢d help them out to finally add a new feature (APML) that will see it generate some real revenue.

What a Google reader APML file would look like
Read my previous post on what exactly APML is. If the Google reader team was to support APML, what they could add to my APML file is a ranking of blogs, authors, and key-words. First an explanation, and then I will explain the consequences.

In terms of blogs I read, the percentage frequency of posting I read from a particular blog will determine the relevancy score in my APML file. So if I was to read 89% of Techcrunch posts ?¢‚Ǩ‚Äú which is information already provided to users ?¢‚Ǩ‚Äú it would convert this into a relevancy score for Techcrunch of 89% or 0.89.

ranking

APML: pulling rank

In terms of authors I read, it can extract who posted the entry from the individual blog postings I read, and like the blog ranking above, perform a similar procedure. I don?¢‚Ǩ‚Ñ¢t imagine it would too hard to do this, however given it?¢‚Ǩ‚Ñ¢s a small team running the product, I would put this on a lower priority to support.

In terms of key-words, Google could employ its contextual analysis technology from each of the postings I read and extract key words. By performing this on each post I read, the frequency of extracted key words determines the relevance score for those concepts.

So that would be the how. The APML file generated from Google Reader would simply rank these blogs, authors, and key-words – and the relevance scores would update over time. Over time, the data is indexed and re-calculated from scratch so as concepts stop being viewed, they start to diminish in value until they drop off.

What Google reader can do with that APML file
1. Ranking of content
One of the biggest issues facing consumers of RSS is the amount of information overload. I am quite confident to think that people would pay a premium, for any attempt to help rank the what can be the hundreds of items per day, that need to be read by a user. By having an APML file, over time Google Reader can match postings to what a users ranked interests are. So rather than presenting the content by reverse chronology (most recent to oldest); it can instead organise content by relevancy (items of most interest to least).

This won?¢‚Ǩ‚Ñ¢t reduce the amount of RSS consumption by a user, but it will enable them to know how to allocate their attention to content. There are a lot of innovative ways you can rank the content, down to the way you extract key works and rank concepts, so there is scope for competing vendors to have their own methods. However the point is, a feature to ?¢‚ǨÀúSort by Personal Relevance?¢‚Ǩ‚Ñ¢ would be highly sort after, and I am sure quite a few people will be willing to pay the price for this God send.

I know Google seems to think contextual ads are everything, but maybe the Google Reader team can break from the mould and generate a different revenue stream through a value add feature like that. Google should apply its contextual advertising technology to determine key words for filtering, not advertising. It can use this pre-existing technology to generate a different revenue stream.

2. Enhancing its AdSense programme

blatant ads

Targeted advertising is still bloody annoying

One of the great benefits of APML is that it creates an open database about a user. Contextual advertising, in my opinion is actually a pretty sucky technology and its success to date is only because all the other types of targeted advertising models are flawed. As I explain above, the technology instead should be done to better analyse what content a user consumes, through keyword analysis. Over time, a ranking of these concepts can occur ?¢‚Ǩ‚Äú as well as being shared from other web services that are doing the same thing.

An APML file that ranks concepts is exactly what Google needs to enhance its adwords technology. Don?¢‚Ǩ‚Ñ¢t use it to analyse a post to show ads; use it to analyse a post to rank concepts. Then, in aggregate, the contextual advertising will work because it can be based off this APML file with great precision. And even better, a user can tweak it ?¢‚Ǩ‚Äú which will be the equivalent to tweaking what advertising a user wants to get. The transparency of a user being able to see what ‘concept ranking’ you generate for them, is powerful, because a user is likely to monitor it to be accurate.

APML is contextual advertising biggest friend, because it profiles a user in a sensible way, that can be shared across applications and monitored by the user. Allowing a user to tweak their APML file for the motivation of more targeted content, aligns their self-interest to ensure the targeted ads thrown at them based on those ranked concepts, are in fact, relevant.

3. Privacy credibility
Privacy is the inflation of the attention economy. You can?¢‚Ǩ‚Ñ¢t proceed to innovate with targeted advertising technology, whilst ignoring privacy. Google has clearly realised this the hard way by being labeled one of the worst privacy offenders in the world. By adopting APML, Google will go a long way to gain credibility in privacy rights. It will be creating open transparency with the information it collects to profile users, and it will allow a user to control that profiling of themselves.

APML is a very clever approach to dealing with privacy. It?¢‚Ǩ‚Ñ¢s not the only approach, but it a one of the most promising. Even if Google never uses an APML file as I describe above, the pure brand-enhancing value of giving some control to its users over their rightful attention data, is something alone that would benefit the Google Reader product (and Google?¢‚Ǩ‚Ñ¢s reputation itself) if they were to adopt it.

privacy

Privacy. Stop looking.

Conclusion
Hey Google – can you hear me? Let’s hope so, because you might be the market leader now, but so was Bloglines once upon a time.

Explaining APML: what it is & why you want it

Lately there has been a lot of chatter about APML. As a member of the workgroup advocating this standard, I thought I might help answer some of the questions on people’s minds. Primarily – “what is an APML file”, and “why do I want one”. I suggest you read the excellent article by Marjolein Hoekstra on attention profiling that she recently wrote, if you haven’t already done so, as an introduction to attention profiling. This article will focus on explaining what the technical side of an APML file is and what can be done with it. Hopefully by understanding what APML actually is, you’ll understand how it can benefit you as a user.

APML – the specification
APML stands for Attention Profile Markup Language. It’s an attention economy concept, based on the XML technical standard. I am going to assume you don’t know what attention means, nor what XML is, so here is a quick explanation to get you on board.

Attention
There is this concept floating around on the web about the attention economy. It means as a consumer, you consume web services – e-mail, rss readers, social networking sites – and you generate value through your attention. For example, if I am on a Myspace band page for Sneaky Sound System, I am giving attention to that band. Newscorp (the company that owns MySpace) is capturing that implicit data about me (ie, it knows I like Electro/Pop/House music). By giving my attention, Newscorp has collected information about me. Implicit data are things you give away about yourself without saying it, like how people can determine what type of person you are purely off the clothes you wear. It’s like explicit data – information you give up about yourself (like your gender when you signed up to MySpace).

Attention camera

I know what you did last Summer

XML
XML is one of the core standards on the web. The web pages you access, are probably using a form of XML to provide the content to you (xHTML). If you use an RSS reader, it pulls a version of XML to deliver that content to you. I am not going to get into a discussion about XML because there are plenty of other places that can do that. However I just want to make sure you understand, that XML is a very flexible way of structuring data. Think of it like a street directory. It’s useless if you have a map with no street names if you are trying to find a house. But by having a map with the street names, it suddenly becomes a lot more useful because you can make sense of the houses (the content). It’s a way of describing a piece of content.

APML – the specification
So all APML is, is a way of converting your attention into a structured format. The way APML does this, is that it stores your implicit and explicit data – and scores it. Lost? Keep reading.

Continuing with my example about Sneaky Sound System. If MySpace supported APML, they would identify that I like pop music. But just because someone gives attention to something, that doesn’t mean they really like it; the thing about implicit data is that companies are guessing because you haven’t actually said it. So MySpace might say I like pop music but with a score of 0.2 or 20% positive – meaning they’re not too confident. Now lets say directly after that, I go onto the Britney Spears music space. Okay, there’s no doubting now: I definitely do like pop music. So my score against “pop” is now 0.5 (50%). And if I visited the Christina Aguilera page: forget about it – my APML rank just blew to 1.0! (Note that the scoring system is a percentage, with a range from -1.0 to +1.0 or -100% to +100%).

APML ranks things, but the concepts are not just things: it will also rank authors. In the case of Marjolein Hoekstra, who wrote that post I mention in my intro, because I read other things from her it means I have a high regard for her writing. Therefore, my APML file gives her a high score. On the other hand, I have an allergic reaction whenever I read something from Valleywag because they have cooties. So Marjolein’s rank would be 1.0 but Valleywag’s -1.0.

Aside from the ranking of concepts (which is the core of what APML is), there are other things in an APML file that might confuse you when reviewing the spec. “From” means ‘from the place you gave your attention’. So with the Sneaky Sound System concept, it would be ‘from: MySpace’. It’s simply describing the name of the application that added the implicit node. Another thing you may notice in an APML file is that you can create “profiles”. For example, the concepts about me in my “work” profile is not something I want to mix with my “personal” profile. This allows you to segment the ranked concepts in your APML into different groups, allowing applications access to only a particilar profile.

Another thing to take note of is ‘implicit’ and ‘explicit’ which I touched on above – implicit being things you give attention to (ie, the clothes you wear – people guess because of what you wear, you are a certain personality type); explicit being things you gave away (the words you said – when you say “I’m a moron” it’s quite obvious, you are). APML categorises concepts based on whether you explicitly said it, or it was implicitly determined by an application.

Okay, big whoop – why can an APML do for me?
In my eyes, there are five main benefits of APML: filtering, accountability, privacy, shared data, and you being boss.

1) Filtering
If a company supports APML, they are using a smart standard that other companies use to profile you. By ranking concepts and authors for example, they can use your APML file in the future to filter things that might interest you. As I have such a high ranking for Marjolein, when Bloglines implements APML, they will be able to use this information to start prioritising content in my RSS reader. Meaning, of the 1000 items in my bloglines reader, all the blog postings from her will have more emphasis for me to read whilst all the ones about Valleywag will sit at the bottom (with last nights trash).

2) Accountability
If a company is collecting implicit data about me and trying to profile me, I would like to see that infomation thank you very much. It’s a bit like me wearing a pink shirt at a party. You meet me at a party, and think “Pink – the dude must be gay”. Now I am actually as straight as a doornail, and wearing that pink shirt is me trying to be trendy. However what you have done is that by observation, you have profiled me. Now imagine if that was a web application, where this happens all the time. By letting them access your data – your APML file – you can change that. I’ve actually done this with Particls before, which supports APML. It had ranked a concept as high based on things I had read, which was wrong. So what I did, was changed the score to -1.0 for one of them, because that way, Particls would never show me content on things it thought I would like.

3) Privacy
I joined the APML workgroup for this reason: it was to me a smart away to deal with the growing privacy issue on the web. It fits my requirements about being privacy compliant:

  • who can see information about you
  • when can people see information about you:
  • what information they can see about you

The way APML does that is by allowing me to create ‘profiles’ within my APML file; allowing me to export my APML file from a company; and by allowing me to access my APML file so I can see what profile I have.

drivers

Here is my APML, now let me in. Biatch.

4) Shared data
An APML file can, with your permission, share information between your web-services. My concepts ranking books on Amazon.com, can sit alongside my RSS feed rankings. What’s powerful about that, is the unintended consequences of sharing that data. For example, if Amazon ranked what my favourite genres were about books – this could be useful information to help me filter my RSS feeds about blog topics. The data generated in Amazon’s ecosystem, can benefit me and enjoy a product in another ecosystem, in a mutually beneficial way.

5) You’re the boss!
By being able to generate APML for the things you give attention to, you are recognising the value your attention has – something companies already place a lot of value on. Your browsing habits can reveal useful information about your personality, and the ability to control your profile is a very powerful concept. It’s like controlling the image people have of you: you don’t want the wrong things being said about you. 🙂

Want to know more?
Check the APML FAQ. Othersise, post a comment if you still have no idea what APML is. Myself or one of the other APML workgroup members would be more than happy to answer your queries.

5 observations of how social networking (online) has changed social networking (offline)

Just then, I had an image get shattered. A well respected blogger, whose online persona had me think they were a very cool person offline, is infact, a fat geek with an annoying voice. I can pretty much cross off the list that he can relate to experiences of how Facebook is mentioned in trendy nightclubs on the dancefloor.

Another thing I have noticed: all the major commentators & players of the Internet economy, are usually married, in their 30s or 40s, and almost all come from an IT background.

Don’t get me wrong – the industry has a lot of people that are a goldmine with what they say. They challenge my thinking, and they are genuinely intelligent. But although they are users of web services like Facebook or MySpace – just like the rest of society – they are people experiencing these technologies in the bubble of the technology community. Their view of the world, is not aligned with what’s actually happening in the mainstream. No surprises there – they are the early adopters, the innovators and the pioneers. It’s funny however, that comparable to other services (like Twitter) the adoption amongst the tech community for Facebook has been slow: it was only when the developer network launched that it started getting the attention.

What I want to highlight is that most commentators have no way in the world of understanding the social impact of these technologies in the demograghic where the growth occurs. We all know for example, Facebook is exploding with users – but do we know why it’s exploding? A married man in his 40s with a degree in computer science, isn’t going to be able to answer that, because most of the growth comes from single 20 year olds with an history major.

So what I am about to recount is my personal experience. I am not dressing it up as a thought-piece; I am just purely sharing how I have seen the world take to social networking sites and how it has transformed the lives of my own and the people around me. I’m 23 years old, the people in my life generally fall into the computer clueless category, and I have about 500 Facebook friends that I know through school, university, work, or just life (about ten are in the tech industry).

1) Social networking sites as a pre-screening tool
Observation: I randomly was approached by a chick one night and during the course of our conversation she insisted I knew a certain person. Ten minutes, and 20 more “I swear…you know xxx” – I finally realised she was right and that I did know that person. For her to be so persistent in her claim, she had to be sure of herself. But how can someone be sure of themselves with that piece of information, when I had only met her 30 seconds earlier?

I then realised this chick had already seen me before – via facebook. I know this is the case, because I myself have wandered on a persons profile and realised we have a lot of mutual friends. In those times I would note it is bound to happen that I would meet them.

Implication: People are meeting people and know who they are before they even talk. They say most couples meet through friends. Well now you can explore your friends’s friends – and then start hanging around that friend when you know they know someone you like!

2) Social networking sites getting you more dates
Observation: I met a chick and had a lengthy chat with her, and although she was nice, I left that party thinking I would probably never see her again as I didn’t give out any contact details. That next day, she added me as a friend on Facebook. In another scenario, there was a girl I met from a long time ago and I hadn’t seen her since. We randomly found each other on Facebook, and I’ve actually got to know the girl – picking up from where we left off.

Implication: Social networking sites help you further pursue someone, even though you didn’t get their number. In fact, it’s a lot less akward. Facebook has become a aprt of the courtship process – flirtation is a big aspect of the sites activity.

3) Social networking sites helping me decide
Observation: There was a big party, but I wasn’t sure if I would go because I didn’t know who would go with me. I looked at the event RSVP, and I to my surprise found out a whole stack of people I knew were going.

Implication: Facebook added valuable information that helped me decide. Not knowing what people were going, I probably wouldn’t have gone. Think about this on another level: imagine you were were interested in buying a camera, and you had access to the camera makes of your friends (because the digital photos they upload contain the camera model – as seen with Flickr). Knowing what your friends buy is a great piece of advice on what you want to buy.

4) Social networking sites increasing my understanding of people I know
Observation: I found out when a friend added me on myspace, that she was bisexual – something I never would have realised. Being bi is no big deal – but it’s information that people don’t usually give up about themselves. Likewise, I have since found out about people I went to school with are now gay. Again – no big deal – but discreet information like that increases your depth of understanding about someone (ie, not making gay jokes around them). I know what courses my contacts have studied since I last saw them, and what they are doing with their lives. I also know of someone that will be at one of my travel destinations when I go on holiday.

Implication: You are in the loop about the lives of everyone you’ve met. It’s nothing bad, because these people control what you can see, but it’s great because there are things you know, things you know you don’t know, but now you can find out things you didn’t know that you didn’t know.

5) Social networking sites as a shared calendar
Observation: My little sister is currently going through 21st season – back to back parties of her friends. One of the gripes of 21sts when organising them, is overlap with other peoples. Not only that – but also the physical process of contacting people and getting them to actually RSVP – it’s a pain. However unlike my 21st season experience from a few years ago, my sister has none of these issues. This is because Facebook is like one big shared calender. Another example is how I send my congratulations to birthday friends a lot more than I have in the past because I actually know its their birthday- due to fact our calendars are effectively pooled as a shared calendar.

Implication: Facebook has become an indispensable tool to peoples social lives.

6) Bonus observation – explaining the viral adoption of Facebook
I have a few friends that don’t have Facebook. You can almost count them on the one hand. And when you bring it up, they explode with a “I’m sick of Facebook!” and usually get defensive because so many people hassle them. In most cases, they make an admission that one day, they will join. The lesson here is that Facebook is growing because of peer pressure. The more people in someone’s network, the more valuable facebook becomes to them. When they say 40 million users, it’s actually 40 million sales people.

God bless the network effect.

Facebook is doing what Google did: enabling

The hype surrounding the Facebook platform has created a frenzy of hype – on it being a closed wall, on privacy and the right to users having control of their data, and of course the monetisation opportunities of the applications themselves (which on the whole, appear futile but that will change).

We’ve heard of applications becoming targeted, with one (rumoured) for $3 million – and it has proved applications are an excellent way to acquire users and generate leads to your off-Facebook website & products. We’ve also seen applications desperately trying to monetise their products, by putting Google Ads on the homepage of the application, which are probably just as effective as giving a steak to a vegetarian. The other day however was the first instance where I have seen a monetisation strategy by an application that genuinely looked possible.

It’s this application called Compare Friends, where you essentially compare two friends on a question (who’s nicer, who has better hair, who would you rather sleep with…). The aggregate of responses from your friends who have compared you, can indicate how a person sits in a social network. For example, I am most dateable in my network, and one of the people with prettiest eyes (oh shucks guys!).

The other day, I was given an option to access the premium service – which essentially analyses your friends’ responses.

compare sub

It occurred to me that monetisation strategies for the Facebook platform are possible beyond whacking Google Adsense on the application homepage. Valuable data can be collected by an application, such as what your friends think of you, and that can be turned into a useful service. Like above, they offer to tell you who is most likely to give you a good reference – that could be a useful thing. In the applications current iteration, I have no plans to pay 10 bucks for that data – but it does make you wonder that with time, more sophisticated services can be offered.

Facebook as the bastion of consumer insight

On a similar theme, I did an experiment a few months ago whereby I purchased a facebook poll, asking a certain demographic a serious question. The poll itself revealed some valuable data, as it gave me some more insight into the type of users of Facebook (following up from my original posting). However what it also revealed was the power of tapping into the crowd for a response so quickly.
clustered yes
Seeing the data come in by the minute as up to 200 people took the poll, as a marketer you could quickly gauge how people think about something in a statistically valid sample, in literally hours. You should read this posting discussing what I learned from the poll if you are interested.

It’s difficult to predict the trends I am seeing, and what will become of Facebook because a lot could happen. However one thing is certain, is that right now, it is a highly effective vehicle for individuals to gain insight about themselves – and generating this information is something I think people will pay for if it proves useful. Furthermore, it is an excellent way for organisations to organise quick and effective market research to test a hypothesis.

The power of Facebook, for external entities, is that it gives access to controlled populations whereby valuable data can be gained. As the WSJ notes, the platform has now started to see some clever applications that realise this. Expect a lot more to come.

Facebook is doing what Google did for the industry

When Google listed, a commentator said this could launch a new golden age that would bring optimism not seen since the bubble days to this badly shaken industry. I reflected on that point he made to see if his prophesy would come true one day. In case you hadn’t noticed, he was spot on!

When Google came, it did two big things for the industry

1) AdSense. Companies now had a revenue model – put some Google ads on your website in minutes. It was a cheap, effective advertising network that created an ecosystem. As of 30 June 2007, Google makes about 36% of their revenue from members in the Google network – meaning, non-Google websites. That’s about $2.7 billion. Although we can’t quantify how much their partners received – which could be anything from 20% to 70% (the $2.7 billion of course is Google’s share) – it would be safe to say Google helped the web ecosystem generate an extra $1 billion. That’s a lot of money!

2) Acquisitions. Google’s cash meant that buyouts where an option, rather than IPO, as is what most start-ups aimed for in the bubble days. In fact, I would argue the whole web2.0 strategy for startups is to get acquired by Google. This has encouraged innovation, as all parties from entrepreneurs to VC’s can make money from simply building features rather than actual businesses that have a positive cashflow. This innovation has a cumulative effect, as somewhere along the line, someone discovers an easy way to make money in ways others hadn’t thought possible.

Google’s starting to get stale now – but here comes Facebook to further add to the ecosystem. Their acquisition of a ‘web-operating system‘ built by a guy considered to be the next Bill Gates shows that Facebook’s growth is beyond a one hit wonder. The potential for the company to shake the industry is huge – for example, in advertising alone, they could roll out an advertising network that takes it a step further than contextual advertising as they actually have a full profile of 40 million people. This would make it the most efficient advertising system in the world. They could become the default login and identity system for people – no longer will you need to create an account for that pesky new site asking you to create an account. And as we are seeing currently, they enable a platform the helps other businesses generate business.

I’ve often heard people say that history will repeat itself – usually pointing to how 12 months ago Myspace was all the rage: Facebook is a fad, they will be replaced one day. I don’t think so – Facebook is evolving, and more importantly is that it is improving the entire web ecosystem. Facebook, like Google, is a company that strengthens the web economy. I am probably going to hate them one day, just like how my once loved Google is starting to annoy me now. But thank God it exists – because it’s enabling another generation of commerce that sees the sophistication of the web.

Understanding the Facebook poll feature

A little while ago, I was lucky to catch a Facebook poll, as a way of advertising its new poll feature. As a follow up from that experience, I thought I might purchase my own poll to validate its effectiveness. Here are a few of my observations:

1) Answers appear to be clustered

One of the interesting things about the poll feature, is that it is real time. You are getting answers as people vote. You select what type of people you want to target, and Facebook will then quiz users of that criteria by putting the poll on their homescreen. Something I noticed however, was that answers seemed to come in together followed by a gap. I also noticed that these answers that come in groups, usually have similar responses.
clustered yes

I appears that users are highly responsive to a poll. If it appears on their survey, a lot of people appear to answer it. I know this because I specifically targeted my poll to Australians, in the middle of the day when I wouldn’t expect people to be using facebook.
The placing of the options seems to affect the results. I suppose anyone that has studied polling before, would probably know the order of a ballot heavily influences the poll. This appears evident here. Usefully however, Facebook allows you to randomise the poll so that different users see a different order. However as is demonstrated above, with this clustering, its groups of users that see a different order, not individuals

2) Facebook users appear to be more male, and younger
Something I noticed in my previous blog posting on the poll feature, was that there appeared to be more males answering. This seems to have happened occur with this poll as well, and indicates to me that Facebook’s population of users have a higher male base – which is unusual given that women generally outnumber men in society.

fcfb2

It should also be noted that there is no age groups option for people above 50 years old.

3) Takers of the poll appear to be a genuinely random population
The reason I picked 200 people, was that that is the minimum amount a poll needs to be before it can statistically be considered accurate to represent a population. However as I was able to obtain data as the poll was running, it gave me insight into how random (and representative) the population that took the test was.

Below is a screenshot half way through, as well as the final result

results half wayb

fcfb1b

The results for the poll are almost identical. Without reading too much into it, that tells me the conditions of the test were genuinely random.

There are a few other things I noticed, but this isn’t me trying to promote a Facebook service, and will leave to make your own analysis in combination with the other Facebook poll I blogged about. I just want to highlight that for absolutely nothing, you can get an insight into a market in literally hours.

IBM recently released a report saying that the Internet has overtaken TV, changing the dynamics of the advertising industry, and that they see the role of advertising agencies in the future to go “beyond traditional creative roles to become brokers of consumer insight

Facebook is an amazing company because of the amount of data it holds about the population in various societies. And for a fee – the rest of the world can take advantage of this as well. Welcome Facebook – the world’s most competitive agency for consumer insight.

Google: the ultimate ontology

A big issue with the semantic web is ontologies – the use of consistent definitions to concepts. For those that don’t understand what I’m talking about – essentially, the next evolution of the web is about making content readable by not just humans but also machines. However for a machine to understand something it reads, it needs consistent definitions. Human’s for example, are intelligent – they understand that the word “friend” is also related to the word “acquaintance”, but a computer would treat them to mean two different things. Or do they?

Just casually looking at some of my web analytics, I noticed some people landed on my site by doing a google search for how many acquaintances do people have, which took them to a popular posting of mine about how many friends people have on facebook. I’ve had a lot of visitors because of this posting, and its been an interesting case study for me on how search engines work. However today was something different from other times: I found the word acquaintance weird. I know I didn’t use that word in my posting – and when I went to the Google cache I realised something interesting: because someone linked to me using that word, the search engine replaced the word ‘friend’ with ‘acquaintances’.

acquaintances

Google’s linking mechanism is one powerful ontology generator.

Facebook poll: how many friends do you have?

One of Facebook‘s new features is the ability to create surveys, targeted to certain groups of people within the community site. One caught my eye today, which asked 1,000 random people “How many friends do you have?”. Although I am not sure of the conditions this poll was conducted under (ie, did only Australian’s see it?), 1,000 random people should theoretically be a fairly representative sample of the entire population.

Whilst the results immediately show some interesting information on the typical size of a person’s network (which is a discussion in itself), I am equally fascinated by the specific genders and age breakdown of people who answered the poll and the correlation with their network size. One theory I have of why people spend so much time on the site, is because people ‘collect’ friends. They are constantly discovering old friends through mutual friends – a friend’s list leads a person to another profile where they may discover someone they have lost touch with. Check the results first, before I continue:

Poll on

Facebook poll breakdown

Facebook poll breakdown by age

Some of my interpretations of the results

  • Despite being open to anyone since late last year, university students still dominate the site as over half the survey was answered by people in the 18-24 age bracket
  • About 46% of males and 49% of females have over 200+ people. It’s impossible to have 200 ‘friends’ – no one can physically see 200 friends on a regular basis This tells me Facebook is now more about ‘contacts’ and keeping in touch with people you know. This makes it more than just a closed network of your close friends and more of a networking tool – validating what some commentators have been saying of late. I could spend a whole blog post explaining the implications of this, but basically, this means facebook is ‘the’ social networking site now and it’s only going to get more entrenched due to the law of cumulative advantage.
  • Of people aged 35 and above, 70% have under 99 friends – which is only the case of 41% of people aged 25-34, and 19% of 18-24. This is interesting, because the people in the 24+ age group didn’t have facebook when they were at university (which is why 18-24 is so dominant in this regard). Over time, you would expect the age groups to be fairly synchronised – in fact older people would have much larger networks. This tells me despite all the hype, Facebook is still not mainstream – there is a heck of a lot more growth to occur.
  • …and leading off where I started the blog posting: the fact that more males answered the poll (53%) – despite women generally outnumbering men in Western countries – implies men are more interested in knowing how many friends people have. So if you tie that with my ‘friend collector’ theory means more men spend time ‘collecting’…in other words, men stalk more!

Pricks

If you don’t have a valid e-mail, Facebook forces you to verify it, before it removes those annoying CAPTCHA boxes.It’s a pretty standard thing for websites to do this.

Now, it’s telling me, I have to verify my mobile phone number – even though I have been regularly using the service for eight months.

bastards

This is not about verifying my identity – it’s about forcing me to give up my personal information. Bastards.

« Older posts