Sunday, January 16th, 2005
Technorati really have been busy recently. As well as the new features I mentioned last week, they have now introduce a new tag search facility which allows you to search for posts that have been marked with a Technorati tag:
Technorati then pulls in any blog posts which have been thus tagged or categorised (it treats the categories as tags), along with photos from Flickr and bookmarks from del.icio.us, and collates them on a single page.
The link text does not have to be the same as the tag and could even be made invisible by linking to a space, although having discussed the matter with Kevin Marks, who clears up a couple of questions about tags on his blog, there are good reasons why that is not a good idea.
Firstly, linking to the same text as the tag gives people a link straight to the relevant tag page so that they can see who else is using that given tag, a bit like having the Technorati cosmos link on each post. It gives people clues about how you are tagging your posts, which may in turn encourage them to use similar tags so that your tag search results overlap, e.g. if two people are talking about tagsonomies then by tagging their posts as such, they both show on the same page and become part of the same discussion.
As Stephanie Booth points out, this is a very similar effect to that of TopicExchange, which I have used on occasion and found to be useful. TopicExchange works by collating trackback pings so is easier to use than Technorati tags, but Technorati probably has more reach and a better search facility. However, TopicExchange is run on a central server, whereas the Technorati tags are not only distributed, they are also open to being spidered, freeing up the tag data for other services to use.
Another reason for keeping your tag URI and your tag link text the same is that of spam. It’s fairly obvious that the Technorati system is open to abuse, as pointed out by Nootropic, but they are already aware of that possibility. Kevin Marks is very experienced at identifying spam, and is looking at ways to keep the database clean - one way of spotting tag spam would be if the link text did not match the tag.
All in all, this is an interesting way of using emergent tagsonomies to pull together diverse datastreams in one place. As it happens, I’ve had a number of different conversations recently with friends about such things, and this is a useful first step along the way to creating a single entry point for a variety of sources.
I do, as usual, foresee a couple of small problems. The Technorati tags at the moment need to be inserted manually in the markup by the blogger and, as anyone who has worked with taxonomies and tagsonomies knows, laziness wins every time - many people have a tendency to simply not bother adding metadata, no matter how useful it is. You only have to randomly browse the Flickr to see that not only do a fair proportion of users not tag their photos, many don’t even add meaningful titles.
A way round this would be for the blog tools to provide a specific ‘tag’ field for people to add in their metadata more easily, although it would be better if the software could somehow suggest tags for you. That is, though, getting in to the realms of metadata autodiscovery and automatic classification, which is far harder problem to solve than it first appears to be.
Perhaps another option would be for Technorati to use its keyword search facility as well as the tags, so my ‘Suw‘ page would show the most recent tagged blog posts and a selection of the most recent ‘Suw’ keyword search results. I would also like to see Technorati pull in other data streams, such as Furl bookmarks, (not everyone uses del.icio.us), or 43Things tags.
Another problem is one which bedevils all tagsonomies and that is the issue of synonyms and plurals: Do you tag using ‘tagsonomy’ or ‘tagsonomies’? Perhaps Technorati could use a Google Suggest style solution to this, so when you search for ‘tagsonomy’ it asks you if you would like to also see the results for ‘tagsonomies’ as well. It would also be good if one could use Boolean search operands in the tag search facility so that you can create complex search strings rather than being limited to one word or phrase.