My Photo

My Other Accounts

April 2008

Sun Mon Tue Wed Thu Fri Sat
    1 2 3 4 5
6 7 8 9 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30      

April 09, 2008

The Economist Confirms: The Semantic Web Is Upon Us

I'd like to draw your attention to this great overview of the status of the semantic web by The Economist. Nice to see that the field is being picked up by such an influential paper, with a readership that's highly attractive for semweb applications. In passing and if you're interested, I added a quick comment that further develops some of the ideas I seeded in my last post (click on Comments after the article). It would be great to take the discussion there.

Metadata even Machines Can Process

In the idealized version of the semantic web, humans don’t deal with metadata, it’s all done under the hood to process information that’s delivered fully baked to us, the knowledge primates. How far are we from that vision? What does “fully baked” mean? Is it even possible?

If you recall, this issue ties back into the fundamental questions I raised in one of my previous posts. More specifically, it ties into the second question: “Who is primarily going to process the labels?” (with labels referring to metadata).

The potential answers were straightforward:

  • Machines (then we need rigorous standards)
  • Humans (then we don’t need standards as rigorous - tagging can do)
  • Both – most likely (then we have two sets of metadata standards… e.g. RDF/OWL and del.icio.us-type tags… and machine progressively work their way up to turn their machine labels into labels that are more useful for us humans)

Most players out there in the semantic web and knowledge management fields are focused on trying to get machines to process more of the metadata than they used to. The key enablers are on the one hand, information standards, and on the other, smart (and less smart) algorithms. Most players tend to place more efforts on one enabler or the other as a way to tackle the problem of automating metadata processing.

On the one hand, we have what I’ll identify as the data homogenizers: those players are driven by standardization trends in data and metadata. Because it’s much easier to process well-defined and recognizable data types, they tend to focus on information that has limited conceptual content and little need for disambiguation and context, such as times, places, people, and organizations. RDF together with OWL standards constitute the technological foundation of choice for this school of thought. The goal is to create labels that can enable machines to process all that information directly, without human input. So those labels are mostly leveraged by machines using simple agents to make associations at the data level within and across the online world, organizations, and private individual domains. Building on that, the hope is to tackle data of increasingly higher complexity through standardization, and the vision is to cover a larger fraction of the information building blocks and then find a way up towards higher levels of the semantic pyramid (see previous post).

Data homogenizers are getting the most press in the semantic web world currently, due to the readiness of the enabling technologies, mainly RDF, OWL, and SPARQL. As mentioned above, this stream is highly dependent on the capacity to standardize data formats. That is a huge limitation. Initiatives to extend this standardization to new formats, beyond the most basic ones we listed such as locations, names etc, must be constrained to either very basic data or very specific universes (e.g. a format is used by libraries but not beyond).

The data homogenizer ecosystem will keep expanding at an interesting pace, and will bring great developments and “mash-ups”, but likely not the wow feeling needed to kick off a gold rush on the user side. Freebase boasts that it can let you search for Jennifer Connelly films with actors who have appeared in a Steven Spielberg movie. Not the type of search I run more than half a dozen times a year. Twine can recognize people, organizations, and dates, and use that to twine data together. It’s a great step forward, but still falls short of solving a critical market problem.  And for the part which holds the highest user value, offering content and connection recommendations by creating metadata for complex information sequences, it still to this date relies greatly on human tagging. Which brings only incremental value to existing processes such as exploring expert bookmarks through tags in del.icio.us.

Problem is, those much-heralded data homogenizers are the least semantic part of the so-called semantic web. They detect things like person’s names and locations on a page, but do not surface what a document, a paragraph or a sentence are about. As such, their value resides mainly in complex search queries and pooling things together. Imagine a database that would match exact entries across documents to connect data together automatically. It would do well if you’re looking for Jennifer Connelly films with actors who have appeared in a Steven Spielberg movie. Not if you’d like to find what dances were in vogue across Latin America in the 50s and compile a summary of those dance techniques, a query that requires a broader range of metadata and analytical processing than currently enabled by RDF, OWL and SPARQL.

Discovering some relevant new information is a big problem and the data homogenizers help with that. But making sense of the knowledge out there, and helping with things like cutting through information overload by delivering just the insights I really care about, are problems of much larger proportions. Organizing my emails automatically, figuring out what part of a webpage is really about Google Earth API capabilities, aggregating several documents into one intelligently, that’s what I’d expect a semantically-enabled application to do.

So let’s look at the second group, and let’s call those the meaning makers. One example of such players would be Endeca, which offers automated content navigation. A number of emerging applications tackling the problem of autotagging also qualify. Those attempt to extract the meaning from group of documents, documents, and structured fragments of documents (the 3 upper levels of the semantic pyramid). They don’t go down to the building-block level to try and identify an author, a location, or a date, because that’s of limited help in extracting the broader meaning. The technologies they use tend to be much more mathematically complex and often mimics the process human brains go through when tagging information. This explains why they have a history of not working very well in practice.

Till today, the problem of these players has been to either rely too much on humans for the conceptual interpretation task, which creates an adoption barrier, or to provide relatively poor results when relying too heavily on algorithms. The semantic agents indeed tend to be weak compared to human reasoning. That’s a key reason why people-powered applications like technorati or del.icio.us continue to exist, frankly. A second problem lies in the output on the other side of the pipe: for now, because of the complexity and lack of standardization of the generated metadata, a lot of that metadata still appears as tags for human users to process. In other words, there is little that machines can do with the extracted metadata apart from feeding users content carrying similar tags. For instance, if a paragraph refers to Hinduism in a document on India, my algorithm will tag it accordingly, and then I may be able to bring up that paragraph as part of a search on Hinduism. Generally, the machine value-add process pretty much stops here. Beyond content retrieval, it won’t use the Hinduism tag much. But the point is, it could, and soon it likely will.

The capabilities these applications develop are that of automatically adding intelligent metadata to relatively unstructured pools of information. Everywhere information lives, such unstructured data tend to abound and prevail, and so it is much more preferable and scalable to explore smart solutions that will make sense of it, rather than to ask that this data be structured using standards that by nature can only be local, transient, and prejudiced. Reasoning capabilities give these applications a long-term edge in covering all levels the metadata pyramid in great depth. Once you’re capable of recognizing, extracting and playing with the concepts in a scientific white paper, little remains in the way of recognizing dates, locations, and people.

In practice, as highlighted previously, this is a much harder endeavor than standardizing data and then adding simple inference agents to process relationships. From a purely pragmatic angle, data homogenizers deliver, whereas the meaning makers tend to get bogged down in execution complexity, which clever players reduce by constraining themselves to B2B verticals. As a result, the market is directing increasing resources to the homogenizers, as a look at the different rounds of Radar Networks, Metaweb or Hakia suffice to demonstrate.

Looking ahead, it’s going to be interesting to see both sides wrestle to define and control the heart of the semantic web market. The data homogenizers are ready and their power is growing fast. They have the advantage of a pragmatic, systematic approach that’s bearing immediate fruits. The meaning makers have years of development behind them and the potential to deliver much more value to users, but they have only a meagre trail of commercially successful innovations to show for it, as their idealistic penchant led them to tried too much too soon, too often.

From a pure market slicing-and-dicing perspective, we could decouple both types of applications since the benefits they provide are quite different; but already, as metadata is deployed and progressively creeps across all levels and types of information and, more importantly, as delivering real users benefits requires both cleaner data and smarter agents, those two schools of thoughts are starting to converge.

Until algorithms are able to overlay and process relevant metadata at all levels of the semantic pyramid, it means that humans will continue to be huge direct contributors and direct processors of metadata, across most Semantic Web plays. For a player in that ecosystem, striking the right balance between automation and the need for human input, and leveraging any resulting metadata to maximize information processing and enrichment right out of the gate, will be critical.

Somehow, I was really looking forward to being spoon-fed enhanced information by the machines... Ultimately, I expect it to take us to our next evolutionary step. Yet it seems now that we’ll have to content ourselves with remaining knowledge primates for yet another little while.

March 26, 2008

The “shadow web” in the limelight

No, no, I haven’t disappeared, in spite of my promise of resuming this blog back in January. In the past few months, I have devoted most of my time and efforts to the start-up I work at. The good news, I think it is coming together nicely. We have gone through a rebranding effort to align ourselves with our market, and are now named Primal Fusion. How do you like it? The bad news is, we are still stealth, so I can’t do much more than direct you to the nice logo on our website for now. But no worries, this is all in line with our plan and timeline, and as things progress we hope to be out of stealth pretty soon. 

So, as you can imagine, I have learned quite a bit about the semantic web and some other adjacent spaces during all that time: there is so much I want to talk about here, but I’ll try to stay focused! The big thing is that the semantic web has come a long way in those past few months, and it now looks close to reaching a tipping point, with much increased public recognition and a landmark announcement by Sir Tim Berners-Lee that it’s now “open for business”. In that respect, I was recently invited by Paul Miller of Talis to participate in the “Semantic Web Gang”, a group of discussion on the semantic web. Both a great honor and a terrific experience. For our first podcast, we commented on the readiness of the semantic web. You can find it here on Paul Miller's blog.  And more information here on ZdNet. It is also syndicated on ReadWriteTalk. We are to repeat the experience monthly, and I will announce our new podcasts on this blog, so do subscribe to the RSS feed if you want to be kept posted. Note: due to vacations, I won't make the next podcast, but I should be back for the one in May.

So what did I learn during that podcast? Well, all of us around the virtual table seemed to agree that the technologies on which to build the semantic web have reached the “1.0” stage. Sure, they will continue to evolve dramatically in the coming months and years in response to market needs, but the point is: the initial foundations are in place. What is needed now is for those technologies to be leveraged further into compelling value propositions for mainstream end users. The recent Yahoo announcement about the incorporation of semantic metadata into their search engine certainly is a key milestone in that direction, and towards a truly “self-organizing” web as TechCrunch put it nicely. There was also the increased openness of Twine, which points to the birth of a mainstream semantic web application, “at last”. Not all is perfect, Twine was welcome with some disappointed reviews, mostly regarding the investment it takes to extract real value from the application. And so were Hakia and Powerset. Yet there is now an optimistic feel that the move towards the semantic, “implicit”, “shadow” web, has become unstoppable. All that is needed for these sparkles to ignite into a full-blown blaze seems to be THE killer value proposition.

I’d bet that the winning application is going to look a lot like a virtual machete and a map. With all the metadata that is going to be generated as a result of the “semantization” of the web, it looks as if the world of information is about to turn way denser. If you liked the forest, get ready for the jungle!

January 02, 2008

Back to the future

I am resuming my blog writing after a few weeks of interruption due to some news I am really excited to share with you... This past December, I started to work for a semantic web start-up called Terapath. The founders have put together a fantastic team of programmers and domain experts, and we have a great opportunity to invent whole new services based on a killer technology. My job there is to take that technology to market, make sure it wows you, and turn it into the commercial BIG BANG it has the potential to be. The start-up is in stealth mode and is expected to get out of it sometime this year. More to come on this blog...

In the meantime, a happy new year to all of you, and may 2008 bring us some great innovations to further catalyze the construction of a truly intelligent web.

December 05, 2007

Top 10 semantic web players

Wondering what the top applications are in the emerging semantic web field? The excellent Read/WriteWeb blog published a top 10 list last week. Freebase, Powerset, Twine, AdaptiveBlue, Hakia,Talis, TrueKnowledge, TripIt (an interesting application of the semantic web concepts to the travel world), Clearforest (bought by Reuters) and Spock all made the list. Several of these apps are still in beta.

I'll make a very risky bet...: the space is going to see lots of new entrants in the coming months. Ok, a riskier one: applications by some big names, like Google and Microsoft, will make it in the top 10 list six months from now.

Ps. As a practical demonstration of the claims I made in my "Read 42 blog comments" post, I encourage you to read the comments posted below Read/WriteWeb's post! Lots of practical insight into people and other companies active or interested in that space...

December 04, 2007

Creating metadata: a task for humans or machines?

Today I’d like to tackle the first “fundamental” question I raised in my November 23rd post: who will label all this data?

A big driver for “who” will label the data is “what” is to be labeled. There is not just one kind of data out there, and for the purpose of metadata creation I’d distinguish between at least 4 types of data: basic building blocks (e.g. sentences in a text document), structured fragments of documents (e.g. a paragraph), self-standing documents (e.g. a speech), and groups of documents (e.g. set of conference speeches). I have synthesized this in the slide below.

Greg_boutin_metadata_types_by_dat_3


Each one of these data types calls for a different type of metadata. Metadata for documents and groups of documents is mostly going to be used as is to organize these documents and return search results to a human user. This metadata needs to be provided in a synthesized format, usually in the form of a few keywords or expressions. Standardization of this metadata can remain relatively limited, as machine only need to match these text strings in a mostly straightforward manner.

On the other hand, metadata for what I dubbed ‘building blocks’, the most basic structured unit in a document, will be highly standardized in order to be processed by algorithms, which will weave blocks together by relying on metadata and, if all goes well, turn all this into ‘intelligent’ answers. This metadata therefore is purely designed for machine use.

Metadata for ‘structured fragments’ lies in between that for documents and for 'building blocks', as it can be leveraged for direct human use or for machine processing, depending on the need. Generally, however, I’d see it more aimed at human use, due to the lack of standardization of the underlying data (the computer will likely need to go down one level and still process the metadata for building blocks to make sense of it all.)

So are machines better equipped than humans to create that lower-level metadata for machine use? Just looking at the cheer volume of metadata to be created, one would hope so. Indeed, the volume of metadata to be generated is inversely proportional to the level of the data it relates to. This is evident: labeling each sentence in a document will generate much larger volumes of metadata than tagging the overall document. See the slide below for illustration.

Greg_boutin_metadata_volumes_by_d_2



Unfortunately, one problem remains with algorithms: accuracy. How accurate are the metadata-weaving algorithms today? Overall, not very. To be accurate, algorithms need to focus on a very small part of the problem. For instance, recognizing addresses, or people, or events, in a document, and generating RDF metadata for them.

But algorithms are fast improving. So I expect machines to progressively climb up the metadata food chain. It is possible that they may not even do this in the anticipated order. Algorithms may emerge that may tag document accurately, before they even overlay metadata on things like sentences accurately.

How fast will all of that metadata automation happen?

Here is where I part way with many out there…

A lot of folks in the space seem to ask themselves optimistically how to best automate the task of building metadata, and not really how much the task can be automated within their relevant timeframe. They work on replacing users input as much as possible through mathematical models, and anticipate them to be ready in six months or a year, when most likely they will require another 5 or 10 years of efforts to get to anywhere practical - if they do get there. By focusing instead on building systems that best stimulate, aggregate and synthesize user inputs (ironically, meta-systems!), they could
within a year or so deliver a working solution, and then build on that potential success to gradually increase the level of automation in their application.

In sum, I suggest here that solutions that (
intelligently) incorporate human input further will perform better over time. We need a healthier balance between human input and automatic metadata production. Given the poor performance of current metadata applications, focusing on algorithms that enhance the collection of user input and learn from it rather than autistically extract metadata from the data itself is a better investment of one’s time.


Will the differential between human performance and machine performance likely remain wide enough to justify the investments in collecting human input for years to come? A multibillion-dollar question, but I’d bet that it will. Because it’s likely that metadata will become increasingly user-driven, dynamic and volatile, in line with the ever- and faster-changing user needs and mental frameworks. As long as the ultimate consumer of all that metadata remains human, algorithms will need our inputs. So building capabilities in the “wisdom of crowds” area today can only help position you better in the space tomorrow.

Of course, it can be said almost with certainty that at some point, user input collection will be fully automated and transparent, and machines will create metadata with higher accuracy and speed levels than would be possible through human processing. As of today though, no algorithm I came across has proved capable of coating metadata accurately and comprehensively without extensive human input. We seem to be years away from  intelligent systems that will "get it". And guess what? Getting to those systems will require the same thing as trying to do without them: focus on better ways to stimulate, aggregate and synthesize user inputs!

In a future post I’ll attempt to look into (1) which "users" will provide those inputs: programmers, experts, mainstream users? (2) how user input can be collected and integrated into metadata-generating solutions.

November 26, 2007

Marketing RDF: theory and practice

Before I build on my recent "four questions" post, I highly recommend listening to the following podcast, as it offers an interesting example of the practical difficulties with marketing RDF, and some interesting thoughts on the future of the semantic web

The secrets to tagging, the semantic web and the universe: read 42 blog comments

Note: the follow-up to my previous post will be posted in a short while - below's post is not directly related

One thing that never fails to surprise me is how much insight you can derive simply by listening to users or, in the present case, by reading blog commentators -- and how few businesses do it in practice… Ask yourself: does your organization read user blogs? Does it act on the findings?

I
recently saw the value of this first-hand. As a heavy user of file management solutions and through a quick survey I had ran previously, I had come up to the realization that desktop tagging does not work yet for most people, but will be big in the future. One of the many things telling me it will be big is that Apple and Microsoft both enabled tagging in their latest OS (and well before that too for Apple). What tells me that it does not work just yet is that the vast majority of OSX users (let alone Windows users) do not tag their files.

Continuing with a wide angle, I thought: the key to tagging is to reduce its cost and to make tagging outputs (search and other possible applications) more rewarding from an end-user perspective. Generalizing further, the same goes for the semantic web: to succeed, reduce the cost of adding metadata and increase the outputs from metadata. Back to tagging, I wondered: how do you do that?

Soon enough, as I was reading through blog comments to detect needs from early adopters, I came across these insight-packed comments by users of tagging applications (in case you wondered, I didn't check whether there are 42 comments):

  • “How nice if WinFS had good metadata possibilities. But then again, I can just switch to Macs.”

If you didn’t know yet that OSX is better at tagging than windows, now you would

  • “I already have a very tightly-controlled tagging system on del.icio.us (it takes a lot for me to consider using a new tag), and I see the awesome implications for file maintenance. If only iTunes let me tag my music files in a del.icio.us-like manner...”

People who tag with del.icio.us want to reuse those tags on their desktop and other web apps too. Who would think? Not del.icio.us or iTunes apparently, who may be slowly missing a huge opportunity

  • “Although I am a new del.icio.us user, I have already found myself using certain tags more often than others. I also find myself often using tags that are 'recommended' or 'popular', which can often lead to more consistent tagging.”

Another del.icio.us user reading about desktop tagging… and showing how much value this user base places on tagging consistency

  • “I wish there was a way to have it auto-complete individual tags as I type them in (like del.icio.us), based on tags I have already used.”

On the importance of an autocomplete feature

  • “I'm finding more and more that tagging beats the heck out of all the other organization systems I've tried”

Here is a clear sign of an early trend with a fantastic asset: a passionate user base that spreads the word

  • “Does this work with smartfolders?”

A suggestion for a new product feature

  • “When I tag a photo with keywords in Picasa, for example, I can't see those keywords when I look at the properties of that file in Windows Explorer. The same is true with a lot of other IPTC metadata that images use. Again, when I tag images in Adobe Bridge, I can't see all the information in Picasa. Anyway, my point is that sometimes metadata does not work. I would like to see a more "unified" approach, where tagging will work across software and operating systems.”

Wow. In other words, a more transversal tagging solution could make tagging “work” for such users

  • “I work in the VFX sector for movies and television. I produce several thousand files a week as rendered images or different versioned scene files (…) To keep two systems running - folders for work and tagging on my workstation (and probably my home machine) - seems like overkill. If you have some inspirational thoughts for me I am all ears.”

How about a tagging system that replaces your folders and works across platforms? Worth exploring, especially if media companies are among the buyers

  • “Ideally, the tagging would take place in the Save window"

One-step tag and save! Think how much more painful it is to tag files after you’ve already been asked where to save them? Unfortunately that’s what every application out there asks users to do… No wonder desktop tagging is so slow in taking off...

In passing, my blog commentators gave me the names of multiple applications that attempt to address some of those needs. A year after most of these comments were posted, though, I still don’t see the killer app on the market. Is anyone reading?

November 23, 2007

Whose semantic web is it anyway?

Over the past few months I have dug deeper and deeper in the concepts surrounding the idea of the "semantic web". I’ll share a bit of this explorative process with you in the upcoming posts, but today I’d like to start with some of the temporary conclusions that I reached, and hopefully help structure the debate going forward through my small contribution.

As they say, wisdom is knowing what you don’t know. Since I’m always striving for wisdom..., let me confess that those insights I developed are nothing else than, well, questions…
I have come to see four questions as the fundamental issues whose answers will shape the web going forward.

(note to semantic web novices: you can double-click on any word to see its encyclopedic definition – don’t be shy…)

As I see it, the semantic web is all about labeling data to describe it at a higher conceptual level and then using that information to process this data (while tying previously unrelated data together as needed). Using a comparison, it’s a bit like we have built houses and roads across the land and we are now starting to set up signposts all over to indicate who lives where and what there is to see and do, all of that in a more or less standardized fashion. So the semantic web is about setting signposts on the web (and most likely on your desktop too, so one could say the semantic web will extend beyond the web!). Here is a picture I like to use to illustrate this:

Semanticwebbygregboutin


To illustrate the practical implications of this for the mainstream computer user, let’s take a web page with car dealer contact information. With the semantic web, this data could be labeled with signs in effect indicating to the computer that “this is an address”, “this is a phone number”, “this is the car dealer's name” etc. Using these labels your browser can send the data to your contact management application and this application will know what to do with it: it will turn it automatically into contacts.

I also talked about tying together previously unrelated data. Using our car dealer example, that would mean that your browser may be able to tie this page to each of the car dealer homepages and their (official) prices, all automatically, to present you with a synthesis of the dealer locations, the prices and the pictures for the car you’re considering buying. The only thing it won’t do just yet is to negotiate it for you.

With the lay of the land completed, I can now introduce you to the four key questions I see as decisive for the future shape of the semantic web:

1.    Who will label all this data?

  • Artificial Intelligence algorithms (i.e. machines)
  • Programmers
  • Expert users
  • Mainstream users
  • A combination of the above (most probable answer! In particular, it is to be expected that algorithms will take a larger role over time)


2.    Who is primarily going to process these labels?

  • Machines (then we need rigorous standards)
  • Humans (then we don’t need standards as rigorous  - tagging can do)
  • Both – most likely (then we have two sets of metadata standards… e.g. RDF/OWL and del.icio.us-type tags… and machine progressively work their way up to turn their machine labels into labels that are more useful for us humans)


3.    To embed or not to embed metadata?

  • Just as HTML embeds display information within actual content, some proposed standards such as microformats embed semantic information directly in the content 
  • The standards most often associated with the semantic web, RDF/OWL, do not embed semantic information into the data. So far it seems that the web authorities are moving forward with this approach, which makes some people worry that will create a "shadow" web. But there is a successful precedent: CSS took the display information outside the HTML document, creating  a set of standard labels that can easily be updated and reused across pages – and today most new sites use CSS

4.    Which format should be used for all these labels?


None of this is trivial matter. I’ll explain it further, but the overarching point to keep in mind as we discuss those issues is this: the answers to questions one, two and three will determine the prevalent labeling technology on the semantic web. And vice –versa. That likely will also determine who is in the driver seat, and who benefits the most from the semantic web. A $52.4bn question according to Mills Davis (of Project 10x – see the post “Mills Davis talks with Talis about multi-billion dollar markets and the Semantic Wave”)

Now I’ll dig into each of these questions in order to clarify what they mean and, practically, what’s at stake for the semantic web users (and consumers --- I’m a proud marketer,
remember?)

But before I go ahead, I’d like to ask the semantic web experts among you: do you see any other fundamental questions?

Continue reading "Whose semantic web is it anyway?" »

November 22, 2007

Blog bling bingo

Welcome to my new blog! I've explored a few blogging solutions and settled on typepad. I hesitated with blogger.com (which is free, but has limited features), and squarespace (which looks very good except on the publicity side). My only reproach to typepad is how design features are limited in the basic version (I don't seem to be allowed to customize my blog heading!), but I like how it integrates with everything that matters out there (Facebook, FeedBurner... and some other f-words...)

So, anyway, I'm starting this new blog here, and my intentions are to write about new technologies, primarily in the computing space and specifically on the semantic web. I may also post a few words on photovoltaics and energy technologies from time to time, as I have acquired decent experience in that space.

If you don't know me and you stumbled upon this, you may be interested in some quick background info. Well, I am active in the tech space, and particularly in the semantic web area (surprise surprise), as an entrepreneur and a marketer / business developer. My goal is to build or help build great companies that aim to shape the future, and act as an evangelist and active product development contributor for them (especially by developing a practical knowledge of users and future potential users). I went to school at Stanford where I got an MBA (wait, wait, see my posts first before assessing whether I know something about technology...) and I worked for a management consulting firm called the Boston Consulting Group (just for 2 years!) prior to jumping back into the tech start-up world.

Before I call this my first post, I wanted to add that I saw a great ppt presentation on the semantic web posted here by Nova Spivack. Now I don't second everything he writes and especially the limited potential he attributes to tags, but that's mainly because I've been working on ways to eliminate the cons he describes regarding tagging - so I'm biased. And since it appears that Nova has been working on RDF/OWL, he's logically supporting these standards in his presentation (no surprise, I'd be too if I worked at Radar Networks) and suggesting that RDF/OWL can constitute the foundations for the semantic web.

Still, Nova's presentation is the best description I've seen so far of the different approaches towards "smarter data", and it's an inspiring job of popularization. I'll be looking at the upcoming news at Radar Networks with interest. And I think that sharing this presentation confirms Nova's position among the forefront runners of the upcoming wave of change. All my congratulations to him for a very nice evangelization job. I'd make sure I've tagged his page if I were you!