Tag Archive for 'USA'

Don’t get the Semantic Web? You will after this

Prior to 2006, I had sort of heard of the Semantic Web. To be honest, I didn’t know much – it was just another buzzword. I’ve been hearing about Microformats for years, and cool but useless initiatives like XFN. However to me it was simply just another web thing being thrown around.

Then in August 2006, I came across Adrian Holovaty’s article where he argues journalism needs to move from a story-centric world to a data-centric world. And that’s when it dawned on me: the Semantic web is some serious business.

I have since done a lot of reading, listening, and thinking. I don’t profess to be a Semantic Web expert – but I know more than the average person as I have (painfully) put myself through videos and audios of academic types who confuse the crap out of me. I’ve also read through a myriad of academic papers from the W3C, which are like the times when you read a novel and keep re-reading the same page and still can’t remember what you just read.

Hell – I still don’t get things. But I get the vision, so that’s what I am going to share with you now. Hopefully, my understanding will benefit the clueless and the skeptical alike, because it’s a powerful vision which is entirely possible

1) The current web is great for humans; useless for machines
When you search for ambiguous terms, at best, search engines can algorithmically predict some sort of answer that partially answers your query. Sometimes not. But the complexity of language, is not something engineers can engineer to deal with. After all, without ambiguity of natural languages, the existence of poetry is impossible.

Fine.

What did you think when you read that? As in: “I’ve had it – fine!” which is like another way of saying ok or agreeing with something. Perhaps you thought about that parking ticket I just got – illegal parking gets you fined. Maybe you thought I am applauding myself by saying that was one fine piece of wordcraftship I just wrote, or said in another context, like a fine wine.

Language is ambiguous, and depending on the context with other words, we can determine what the meaning of the word is. Search start-up company Powerset, which is hoping to kill Google and rule the world, is employing exactly this technique to improve search: intelligent processing of words depending on context. So by me putting in “it’s a fine”, it understands the context that it’s a parking ticket, because you wouldn’t say “it’s a” in front of ‘fine’ when you use it to agree with something (the ‘ok’ meaning above).

But let’s use another example: “Hilton Paris” in Google – the worlds most ‘advanced’ search engine. Obviously, as a human reading that sentence, you understand because of the context of those words I would like to find information about the Hilton in Paris. Well maybe.

Let’s see what Google comes up with: Of the ten search results (as of when I wrote this blog posting), one was a news item on the celebrity; six were on the celebrity describing her in some shape or form, and three results were on the actual Hotel. Google, at 30/70 – is a little unsure.

Why is Paris Hilton, that blonde haired thingy of a celebrity, coming up in the search results?

Technologies like Powerset apparently produce a better result because it understands the order of the words and context of the search query. But the problem with these searches, isn’t the interpretation of what the searcher wants – but also the ability to understand the actual search results. Powerset can only interpret so much of the gazilions of words out there. There is the whole problem of the source data, no just the query. Don’t get what I mean? Keep reading. But for now, learn this lesson

Computers have no idea about the data they are reading. In fact, Google pumping out those search results is based on people linking. Google is a machine, and reads 1s and 0s – machine language. It doesn’t get human language

2) The Semantic web is about making what human’s read, machine readable
Tim Berner’s Lee, the guy that invented the World Wide Web and the visionary behind the Semantic Web, prefers to call it the ‘data web’. The current web is a web of documents – by adding this extra data to content – machines will be able to understand it. Metadata, is data about data.

A practical outcome of having a semantic web, is that Google would know that when it pulls up a web page regardless of the context of the words – it will understand what the content is. Think of every word on the web, being linked to a master dictionary.

The benefit of the semantic web is not for humans – at least immediately. The Semantic Web is actually pretty boring with what it does – what is exciting, is what it will enable. Keep reading.

3) The Semantic web is for machines to interpret, not people
A lot of the skeptics of the semantic web, usually don’t see the value of it. Who cares about adding all this extra meta data? I mean heck – Google still was able to get the website I needed – the Hilton in Paris. Sure, the other 60% of the results on that page were irrelevant, but I’m happy.

I once came across a Google employee and he asked “what’s the point of a semantic web; don’t we already enough metadata?” To some extent, he’s right – there are some websites out there that have metadata. But the point of the semantic web is so that machines once they read the information, can start thinking like how a human would and connecting it to other information. There needs to be across the board metadata.

For example, my friend Michael was recently looking to buy a car. A painful process, because there are so many variables. So many different models, different makes, different dealers, different packages. We have websites, with cars for sale neatly categorised into profile pages saying what model it is, what colour it is, and how much. (Which may I add, are hosted on multiple car sites with different types of profiles). A human painfully reads through these profiles, and computes as fast as a human can. But a machine can’t read these profiles.

Instead of wasting his (and my) weekends driving around Sydney to find his car, a machine could find it for him. So, Mike would enter his profile in – what he requires in a car, what his credit limit is, what his prior history with cars are – everything that would affect his judgement of a car. And then, the computer can query every online website with cars to match the criteria. Because the computer can interpret these websites across the board, it can evaluate and it can go back to Michael and say “this is the car for you, at this dealer – click yes to buy”.

The semantic web is about giving computers the information to be able to interpret data, so that it can do what they do really well – compute.

4) A worldwide database
What essentially Berner’s Lee envisions, is turning the entire world wide web into a database that can be queried. Currently, the web looks like Microsoft Word – one swab of text. However, if that swab of text was neatly categorised in an Excel spreadsheet, you could manipulate that data and do what you please – create reports, reorder them, filter, and do whatever until your heart is content.

At university, I was forced to do an Information Systems subject which was essentially about the theory of databases. Damn painful. I learned only two things from that course. The first thing was that my lecturer, tutor, and classmates spoke less intelligible English than a caterpillar. But the second thing was that I learned what information is and how it differs from data. I am now going to share with you that lesson, and save you three months of your life.

You see, data is meaningless. For example, 23 degrees is data. On its own, it‚Äôs useless. Another piece of data in Sydney. Again, – useless. I mean, you can think all sorts of things when you think of Sydney, but it doesn‚Äôt have any meaning.

Now put together 23 degrees and Sydney, and you have just created information. Information is about creating relationships between data. By creating a relationship, an association, between these two different pieces of data – you can determine it’s going to be a warm day in Sydney. And that is what information is: Relationship building; connecting the dots; linking the islands of data together to generate something meaningful.

The semantic web is about allowing computers to be able to query the sum of human knowledge like one big database to generate information

Concluding thoughts
You are probably now starting to freak out and think “Terminator” images with computers suddenly erupting form under your computer desk, and smashing you against the wall as a battle between humans and computers begins. But I don’t see it like that.

I think about the thousands of hours humans spend trying to compute things. I think of the cancer research, whereby all this experimentation occurring in labs, is trying to connect new pieces of data with old data to create new information. I think about computers being about to query the entire taxation legislation to make sure I don’t pay any tax, because it knows how it all fits together (having studied tax, I can assure you – it takes a lifetime to only understand a portion of tax law). In short, I understand the vision of the Semantic web as a way of linking things together, to enable computers to compute – so that I can sit on my hammock drinking my beer, as I can delegate the duties of my life to the machines.

All the semantic web is trying to do, is making sure everything is structured in a consistent manner, with a consistent dictionary behind the content, so that a machine can draw connections. As Berner’s Lee said on one of the videos I saw: “it’s all about creating links”.

The process to a Semantic Web is boring. But once we have those links, we can then start talking about those hammocks. And that’s when the power of the internet – the global network – will really take off.

You need to be persistently adaptable

Tim Bull has recently written an interesting discussion point on when is the right time to innovate. In a post titled “Steam engine time“, he asks:

If innovation is a process of the right idea, in the right place and at the right time, how do we judge what the right time is and measure what is going on around us to hit the right spot?

Some would say luck has something to do with it, although I believe that is the perception from an outsiders point of view. In my eyes, a core set of attributes are required for innovation.

Consider this quote from Calvin Coolidge, 30th president of USA:

Nothing in the world can take the place of Persistence. Talent will not; nothing is more common than unsuccessful men with talent. Genius will not; unrewarded genius is almost a proverb. Education will not; the world is full of educated derelicts. Persistence and determination alone are omnipotent. The slogan ‘Press On’ has solved and always will solve the problems of the human race.

I think Tim is wrong to ask when is the right time, because innovators understand their environment, adapt to it – and then push until they get there. Persistence and adaptability, in my eyes, are two crucial aspects needed in a person or even a country or company, for it to successfully move forward. However whilst persistence is key – you need determination to push forward despite the barriers you are going to encounter – adaptability is the real secret to successfully innovating.

A case-study: multiculturalism in a flat world
Although I was born and bred in Australia, I have been brought up under a very strong Greek influence. With an Australian-born father, and a fresh-off-the-boat Greek mother – I have lived a life straddled in two cultures. Going to an Anglo-Saxon school, yet at the same time doing Greek classes at 9am Saturday (but leaving early for my schools footy games) – I grew to resent Australia’s multiculturalism policy. Without going into too much detail because this will turn it into a political discussion and detract from the point I wish to make – I disliked the fact that Greeks in Australia refused to integrate into the local culture. The Australian government’s stance of officially supporting Multiculturalism, which does things like pay for that Saturday morning tuition, was to me a stupid policy.

Fast forward to 2005, when I visited the Balkans as part of my nine months traveling around Europe. Serbia’s story is one of the saddest stories in Europe. Walking around the city of Belgrade, interacting with its inhabitants, and just generally experiencing Serbia – you realise you have come across a hidden gem in Europe. Yet once you look at the statistics and talk to some of the educated, you understand otherwise: a basket case situation that has little hope.

Serbia, like a lot of other countries I discovered in my travels, have a cultural problem: they can’t let go of the past. Millions of people have died over differing interpretations of history. The Republic of Macedonia’s identity is entirely staked on the fact they are situated on the lands of Alexander the Great. Identity to the nation states of Europe, is in history. And challenges to that history, and their identity, has led to some stupid wars affecting millions of innocent lives.

So guess what? I now think multiculturalism is the best thing my country could ever do, for the simple fact we can never have a fixed identity – what it meant to be Australian 50 years ago looks very different from what it looks like now. In Europe, identity is based on ethnicity with a fixed identity tied to history, language and a religion. In Australia, our identity isn’t allowed to be based on a certain ethnicity, and forces us to find common ground on what really matters like our way of life. If it wasn’t for the policy of Multiculturalism, we would be turning into one of these static nation states within Europe who become fixed as a certain point of time. The Greeks are still mourning over the Turks capturing the Great City of Constantinople from them in 1453 (which is why Tuesday is the unlucky day of the week for them). Yet for the countries like Australia, who don’t have much of a history – they are not locked – and consequently look forward, rather than back. Multiculturalism is a crucial ingredient to our success, because with all that diversity, it means we are constantly evolving our culture to the times without any one group fixing it. And with a globalised word, Australia’s ability to adapt to circumstances will be a key competitive advantage we have over countries.

If you don’t agree with me, have a read of Thomas Friedman’s The World is Flat – a book a entrepreneur/intrapreneur suggested I read. This guy who told me about the book was a German from Argentina, working for an Indian company to set up the company’s presence in Turkey! He told me that after he read that book, he quit his job and got himself into his current role. He faced the facts, and adapted his career.

Adaptability as success
You’re probably wondering what I am trying to get at, but to tie it back to my point about adaptability, successfully innovators need to constantly adapt to their environment. What happens with people once they get an idea, is that they spend all their time trying to fit it into a world that once existed, only for the world to be a entirely new place. Successful innovators need to constantly evolve their ideas, to the changing circumstances.

In October last year, I made a proposal at my firm to implement a new technology. For the months leading up to that point, people had to some extent talked down my idea and some even flat out rejected it. October however had me find the right person to hear my idea. And yet if I look at what I originally had thought, and what it is now – it is almost a completely different thing. Because when I pitched my idea, I was asked “why” it works and “how” is it different from anything else. It was that ‘why’ question that had me spend countless hours researching and understanding – adapting – my idea to the scenario being presented to me. I successfully made my business case, because I was given the opportunity to reframe my idea and adapt it to the circumstrances I was presented. Had I not adapted my original idea and vision, I wouldn’t be doing what I am doing now.

Of course, I could have summed up the above by mentioning Charles Darwin’s theory of evolution. Survival of the fittest, right? Adapt to the Green forest like that Green lizard that looks like a leaf, and you’ll find some food (rather than being the food yourself). Adaptability in life is a key critical success factor; and with innovation, it is the hidden factor that on the outside and in retrospect by others, gets attributed as luck.

Update 20/6/07: Catching up on some reading, I just came across a great posting by Marc Andreessen, an internet pioneer, who talks about the four types of luck and which nicely complements my thoughts above.

Tangler

This is the second post in a series – wizards of oz – which is to highlight the innovation we have down under, and how the business community needs to wake up and realise the opportunities. I review Tangler, a Sydney-based start-up that has recently released their application to the world as a public beta.

Tangler is a web-service that enables discussions over a network. Think of discussions with the immediacy of Instant Messaging (it’s easy), but with the persistency of a forum (messages are permanently stored). Discussions are arranged into communities of interest (groups), which are further broken down into topic areas. Click here to see a video overview.

Value

1) It’s a network application. Although it’s got a great design, and looks like a funky website, the real power of this web service is what it’s working towards: discussions over a network. Imagine a little widget with the topic “What do you think of Elias Bizannes?” placed on my (external) personal blog, my internal work blog, my myspace/facebook/social networking page, as well as it’s own dedicated forum on the Tangler site. A centralised discussion, in a decentralised manner. That’s big.

2) It’s community has great DNA. Communities are not easy things to build – my own experience on a getting-bigger-by-the-day internal project has shown that it is a complex science, touching everything from understand motivational theory to encouraging the right kind of behaviours (policing without policing). My usage on the site has shown to me that the active community building currently occuring, is on the right track. Anyone can hire a code monkey, wack on some flashy front-end, and say they have a great product. But not anyone can build a strong community – even Google struggles on this (the acquisition of YouTube happened largely because of community, because the YouTube community beat Google’s own service). Tangler’s community is already turning into a powerful asset – the DNA is there – now it just needs exposure, and the law of cumulative advantage will kick in.

3) The founder and staff are responsive to its community. I posted a question on the feedback forum, to prove this point: I got a response in an hour, on a Saturday. The staff at Tangler are super responsive – which in part, is due to the real-time discussion ability of the software – but also because of their commitment. As I state above – the value of Tangler is the community of users it builds – this type of responsiveness is crucial to keep its users satisfied to come back, because it makes them feel valued. Additionally, the community is driving the evolution of the application, and that’s the most powerful way to create something (adapting to where there is a need by the people that use it)

4) It’s a platform. What makes Tangler powerful, is that it encourages discussions around niche content areas. Make that niche content, being created for free. Low cost to produce + highly targeted content = an advertisers dream. Link it with a distributed network across the entire Internet (see 1 above), and you’ve got something special.

Conclusion

Social networks, which is what Tangler is, are characterised by:
1) the existence of a repository of user-generated content and
2) the need of members to communicate.

Tangler’s user-generated content and communications web make them an interesting fit for both media conglomerates and telecommunication companies (but for different reasons). I see a Tangler acquisition as a no-brainer for the big Telco’s. Integrating a social network like Tangler into Telstra, builds on the synergy between the communication needs of social network users and the communications expertise and service infrastructure of the communication companies. Unlike voice calls that are a commodity now, the Telco’s need to take advantage of their network infrastructure and accommodate for text-based discussions, which can be monetised for as long as the content exists (with advertising).

The challenge for Tangler however – as with any other Internet property – is that the scale of the audience of social networks determines the nature of the relationship with a communications company. Micro-sized social networks are not interesting to communication companies. Massive social networks are, but history has shown they would rather be partners than be acquired. To be attractive to the big end of town, Tangler needs to show to have a scale large enough to grow as a business but not too large to dictate the terms of the business.

My observations conclude me to think that they will be a hit once they open up their application to external developers, which will relieve the development bottleneck faced by their resource and time constrained team. However they shouldn’t rush this, as I still think their performance issues are not completely ironed out yet. An open API would be taken up by its enthusiastic community who are technologically orientated. Not too mention the strong relationships the CEO and CMO have forged with the local web entrepreneurial and development community in Australia.

My boss is currently doing a secondment as acting Finance Director at Sensis, Telstra’s media arm. Maybe I need to organise a catch-up with him, before these guys get snatched up by some US conglomerate!