{"id":123,"date":"2007-09-12T19:33:46","date_gmt":"2007-09-12T09:33:46","guid":{"rendered":"http:\/\/Liako.Biz\/2007\/09\/don%e2%80%99t-get-the-semantic-web-you-will-after-this\/"},"modified":"2007-09-12T19:33:46","modified_gmt":"2007-09-12T09:33:46","slug":"dont-get-the-semantic-web-you-will-after-this","status":"publish","type":"post","link":"https:\/\/eliasbizannes.com\/blog\/2007\/09\/dont-get-the-semantic-web-you-will-after-this\/","title":{"rendered":"Don&#8217;t get the Semantic Web? You will after this"},"content":{"rendered":"<p>Prior to 2006, I had sort of heard of the <a href=\"http:\/\/en.wikipedia.org\/wiki\/Semantic_web\">Semantic Web<\/a>. To be honest, I didn\u201a\u00c4\u00f4t know much \u201a\u00c4\u00ec it was just another buzzword. I\u201a\u00c4\u00f4ve been hearing about <a href=\"http:\/\/en.wikipedia.org\/wiki\/Microformats\">Microformats<\/a> for years, and cool but useless initiatives like <a href=\"http:\/\/en.wikipedia.org\/wiki\/XHTML_Friends_Network\">XFN<\/a>. However to me it was simply just another web thing being thrown around.<\/p>\n<p>Then in August 2006, I came across Adrian Holovaty\u201a\u00c4\u00f4s article where he argues <a href=\"http:\/\/www.holovaty.com\/blog\/archive\/2006\/09\/06\/0307\">journalism needs to move from a story-centric world to a data-centric world<\/a>. And that\u201a\u00c4\u00f4s when it dawned on me: the Semantic web is some serious business.<\/p>\n<p>I have since done a lot of reading, listening, and thinking. I don\u201a\u00c4\u00f4t profess to be a Semantic Web expert \u201a\u00c4\u00ec but I know more than the average person as I have (painfully) put myself through videos and audios of academic types who confuse the crap out of me. I\u201a\u00c4\u00f4ve also read through a myriad of academic papers from the W3C, which are like the times when you read a novel and keep re-reading the same page and still can\u201a\u00c4\u00f4t remember what you just read.<\/p>\n<p>Hell \u201a\u00c4\u00ec I still don\u201a\u00c4\u00f4t get things. But I get the vision, so that\u201a\u00c4\u00f4s what I am going to share with you now. Hopefully, my understanding will benefit the clueless and the skeptical alike, because it\u201a\u00c4\u00f4s a powerful vision which is entirely possible<\/p>\n<p><strong>1) The current web is great for humans; useless for machines<\/strong><br \/>\nWhen you search for ambiguous terms, at best, search engines can algorithmically predict some sort of answer that partially answers your query. Sometimes not. But the complexity of language, is not something engineers can engineer to deal with. After all, without ambiguity of natural languages, the <a href=\"http:\/\/www.mi.sanu.ac.yu\/vismath\/igor\/index.html\">existence of poetry is impossible<\/a>.<\/p>\n<p>Fine.<\/p>\n<p>What did you think when you read that? As in: \u201a\u00c4\u00faI\u201a\u00c4\u00f4ve had it \u201a\u00c4\u00ec fine!\u201a\u00c4\u00f9 which is like another way of saying ok or agreeing with something. Perhaps you thought about that parking ticket I just got \u201a\u00c4\u00ec illegal parking gets you fined. Maybe you thought I am applauding myself by saying that was one fine piece of wordcraftship I just wrote, or said in another context, like a fine wine.<\/p>\n<p>Language is ambiguous, and depending on the context with other words, we can determine what the meaning of the word is. Search start-up company <a href=\"http:\/\/en.wikipedia.org\/wiki\/Powerset_%28corporation%29\">Powerset<\/a>, which is hoping to kill <a href=\"http:\/\/en.wikipedia.org\/wiki\/Google\">Google<\/a> and rule the world, is employing exactly this technique to improve search: intelligent processing of words depending on context. So by me putting in \u201a\u00c4\u00fait\u201a\u00c4\u00f4s a fine\u201a\u00c4\u00f9, it understands the context that it\u201a\u00c4\u00f4s a parking ticket, because you wouldn\u201a\u00c4\u00f4t say \u201a\u00c4\u00fait\u201a\u00c4\u00f4s a\u201a\u00c4\u00f9 in front of \u201a\u00c4\u00f2fine\u201a\u00c4\u00f4 when you use it to agree with something (the \u201a\u00c4\u00f2ok\u201a\u00c4\u00f4 meaning above).<\/p>\n<p>But let\u201a\u00c4\u00f4s use another example: \u201a\u00c4\u00faHilton Paris\u201a\u00c4\u00f9 in Google \u201a\u00c4\u00ec the worlds most \u201a\u00c4\u00f2advanced\u201a\u00c4\u00f4 search engine. Obviously, as a human reading that sentence, you understand because of the context of those words I would like to find information about the Hilton in Paris. Well maybe.<\/p>\n<p>Let\u201a\u00c4\u00f4s see what Google comes up with: Of the ten search results (as of when I wrote this blog posting), one was a news item on the celebrity; six were on the celebrity describing her in some shape or form, and three results were on the actual Hotel. Google, at 30\/70 \u201a\u00c4\u00ec is a little unsure.<\/p>\n<p>Why is Paris Hilton, that blonde haired thingy of a celebrity, coming up in the search results?<\/p>\n<p>Technologies like Powerset apparently produce a better result because it understands the order of the words and context of the search query. But the problem with these searches, isn\u201a\u00c4\u00f4t the interpretation of what the searcher wants \u201a\u00c4\u00ec but also the ability to understand the actual search results. Powerset can only interpret so much of the gazilions of words out there. There is the whole problem of the source data, no just the query. Don\u201a\u00c4\u00f4t get what I mean? Keep reading. But for now, learn this lesson<\/p>\n<p><em>Computers have no idea about the data they are reading. In fact, Google pumping out those search results is based on people linking. Google is a machine, and reads 1s and 0s \u201a\u00c4\u00ec machine language. It doesn\u201a\u00c4\u00f4t get human language<\/em><\/p>\n<p><strong>2) The Semantic web is about making what human\u201a\u00c4\u00f4s read, machine readable<\/strong><br \/>\n<a href=\"http:\/\/www.w3.org\/People\/Berners-Lee\/\">Tim Berner\u201a\u00c4\u00f4s Lee<\/a>, the guy that invented the World Wide Web and the visionary behind the Semantic Web, prefers to call it the \u201a\u00c4\u00f2data web\u201a\u00c4\u00f4. The current web is a web of documents \u201a\u00c4\u00ec by adding this extra data to content \u201a\u00c4\u00ec machines will be able to understand it. Metadata, is data about data.<\/p>\n<p>A practical outcome of having a semantic web, is that Google would know that when it pulls up a web page regardless of the context of the words \u201a\u00c4\u00ec it will understand what the content is. Think of every word on the web, being linked to a master dictionary.<\/p>\n<p><em>The benefit of the semantic web is not for humans \u201a\u00c4\u00ec at least immediately. The Semantic Web is actually pretty boring with what it does \u201a\u00c4\u00ec what is exciting, is what it will enable. Keep reading.<\/em><\/p>\n<p><strong>3) The Semantic web is for machines to interpret, not people<\/strong><br \/>\nA lot of the skeptics of the semantic web, usually don\u201a\u00c4\u00f4t see the value of it. Who cares about adding all this extra meta data? I mean heck \u201a\u00c4\u00ec Google still was able to get the website I needed \u201a\u00c4\u00ec the Hilton in Paris. Sure, the other 60% of the results on that page were irrelevant, but I\u201a\u00c4\u00f4m happy.<\/p>\n<p>I once came across a Google employee and he asked \u201a\u00c4\u00fawhat\u201a\u00c4\u00f4s the point of a semantic web; don\u201a\u00c4\u00f4t we already enough metadata?\u201a\u00c4\u00f9 To some extent, he\u201a\u00c4\u00f4s right \u201a\u00c4\u00ec there are some websites out there that have metadata. But the point of the semantic web is so that machines once they read the information, can start thinking like how a human would and connecting it to other information. There needs to be across the board metadata.<\/p>\n<p>For example, my friend Michael was recently looking to buy a car. A painful process, because there are so many variables. So many different models, different makes, different dealers, different packages. We have websites, with cars for sale neatly categorised into profile pages saying what model it is, what colour it is, and how much. (Which may I add, are hosted on multiple car sites with different types of profiles). A human painfully reads through these profiles, and computes as fast as a human can. But a machine can\u201a\u00c4\u00f4t read these profiles.<\/p>\n<p>Instead of wasting his (and my) weekends driving around Sydney to find his car, a machine could find it for him. So, Mike would enter his profile in \u201a\u00c4\u00ec what he requires in a car, what his credit limit is, what his prior history with cars are \u201a\u00c4\u00ec everything that would affect his judgement of a car. And then, the computer can query every online website with cars to match the criteria. Because the computer can interpret these websites across the board, it can evaluate and it can go back to Michael and say \u201a\u00c4\u00fathis is the car for you, at this dealer \u201a\u00c4\u00ec click yes to buy\u201a\u00c4\u00f9.<\/p>\n<p><em>The semantic web is about giving computers the information to be able to interpret data, so that it can do what they do really well \u201a\u00c4\u00ec compute.<\/em><\/p>\n<p><strong>4) A worldwide database<\/strong><br \/>\nWhat essentially Berner\u201a\u00c4\u00f4s Lee envisions, is turning the entire world wide web into a database that can be queried. Currently, the web looks like Microsoft Word \u201a\u00c4\u00ec one swab of text. However, if that swab of text was neatly categorised in an Excel spreadsheet, you could manipulate that data and do what you please \u201a\u00c4\u00ec create reports, reorder them, filter, and do whatever until your heart is content.<\/p>\n<p>At university, I was forced to do an Information Systems subject which was essentially about the theory of databases. Damn painful. I learned only two things from that course. The first thing was that my lecturer, tutor, and classmates spoke less intelligible English than a caterpillar. But the second thing was that I learned what information is and how it differs from data. I am now going to share with you that lesson, and save you three months of your life.<\/p>\n<p>You see, data is meaningless. For example, <em>23 degrees<\/em> is data. On its own, it\u201a\u00c4\u00f4s useless. Another piece of data in <em>Sydney<\/em>. Again, &#8211; useless. I mean, you can think all sorts of things when you think of Sydney, but it doesn\u201a\u00c4\u00f4t have any meaning.<\/p>\n<p>Now put together <em>23 degrees <\/em>and <em>Sydney<\/em>, and you have just created information. Information is about creating relationships between data. By creating a relationship, an association, between these two different pieces of data \u201a\u00c4\u00ec you can determine it\u201a\u00c4\u00f4s going to be a warm day in Sydney. And that is what information is: Relationship building; connecting the dots; linking the islands of data together to generate something meaningful.<\/p>\n<p><em>The semantic web is about allowing computers to be able to query the sum of human knowledge like one big database to generate information<\/em><\/p>\n<p><strong>Concluding thoughts<\/strong><br \/>\nYou are probably now starting to freak out and think \u201a\u00c4\u00faTerminator\u201a\u00c4\u00f9 images with computers suddenly erupting form under your computer desk, and smashing you against the wall as a battle between humans and computers begins. But I don\u201a\u00c4\u00f4t see it like that.<\/p>\n<p>I think about the thousands of hours humans spend trying to compute things. I think of the cancer research, whereby all this experimentation occurring in labs, is trying to connect new pieces of data with old data to create new information. I think about computers being about to query the entire taxation legislation to make sure I don\u201a\u00c4\u00f4t pay any tax, because it knows how it all fits together (having studied tax, I can assure you \u201a\u00c4\u00ec it takes a lifetime to only understand a portion of tax law). In short, I understand the vision of the Semantic web as a way of linking things together, to enable computers to compute \u201a\u00c4\u00ec so that I can sit on my hammock drinking my beer, as I can delegate the duties of my life to the machines.<\/p>\n<p>All the semantic web is trying to do, is making sure everything is structured in a consistent manner, with a consistent dictionary behind the content, so that a machine can draw connections. As Berner\u201a\u00c4\u00f4s Lee said on one of the videos I saw: \u201a\u00c4\u00fait\u201a\u00c4\u00f4s all about creating links\u201a\u00c4\u00f9.<\/p>\n<p>The process to a Semantic Web is boring. But once we have those links, we can then start talking about those hammocks. And that&#8217;s when the power of the internet &#8211; the global network &#8211; will really take off.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Prior to 2006, I had sort of heard of the Semantic Web. To be honest, I didn\u201a\u00c4\u00f4t know much \u201a\u00c4\u00ec it was just another buzzword. I\u201a\u00c4\u00f4ve been hearing about Microformats for years, and cool but useless initiatives like XFN. However to me it was simply just another web thing being thrown around. Then in August [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[408,119,158,169,181,190,412,215,216,265,295,313,417,350,369,382,423,392,396,397],"class_list":["post-123","post","type-post","status-publish","format-standard","hentry","category-semantic-web","tag-business","tag-data","tag-future","tag-google","tag-humans","tag-information","tag-internet","tag-link","tag-links","tag-paris","tag-results","tag-search","tag-semantic-web","tag-sydney","tag-trends","tag-usa","tag-video-blog","tag-web","tag-wiki","tag-wikipedia","post-preview"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/posts\/123","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/comments?post=123"}],"version-history":[{"count":0,"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/posts\/123\/revisions"}],"wp:attachment":[{"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/media?parent=123"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/categories?post=123"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/eliasbizannes.com\/blog\/wp-json\/wp\/v2\/tags?post=123"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}