Big week for the a/@rel attribute

by Bob DuCharme

Related link: http://www.google.com/googleblog/2005/01/preventing-comment-spam.html




On Monday I wrote about Technorati's use of the a element's rel attribute to let people link weblog entries to taxonomy entries. I mentioned that this attribute, which was designed to implement link typing, has been all but ignored in its twelve-year history, and that its use by a big-time application would give people more incentive to use it.



Yesterday one of the biggest applications of all announced a use for the same attribute. To fight the practice of referrer spam (the addition of irrelevant comments to a weblog entry in order to boost the number of links to the spammer's site), Google has announced that a rel value of "nofollow" on a link will tell their crawlers not to consider this link when calculating the link destination's page rank. Several weblog applications have already updated their software so that links added in comments by weblog readers will automatically include this attribute setting. This gives perpetrators of referrer spam much less incentive to do add these worthless comments.



Comments added to my post of Monday (the good kind, not the spam kind) led to a discussion of how allowing people to add data and metadata to web pages that they don't own leads to abuse, and whether the potential abuse renders user-added metadata features useless. This has always been Google's justification for ignoring metadata, so it's nice to see them encouraging the use of link metadata. (It was tempting to title this posting "Newsflash: Google crawlers paying attention to an attribute value besides href!") They get extra credit for doing this with an attribute that's been around for so long, instead of making up a new one, which is what many companies would have done.






6 Comments

daviddeschenes
2005-01-20 11:00:35
Wouldn't a spectrum of link types be better?
I think that Google's decision to ignore "nofollow" links will be a great weapon in the fight against comment spam, but I also think it's only half way to where we should be. Unfortunately, Google's assumption that a link confers credibility or not is too simple a model for the real world. A link on a web page may fall anywhere in a broad spectrum in terms on conferring credibility to the linked page. Let's say I've put together a web page which is a collection of links on topic "A" and that half of the links are to what I would consider primary resources on the topic and the other half are links to secondary resources. Wouldn't it be great if Google could understand the difference between the two groups of links? Readers of the web page can understand the difference through visual clues (grouping, highlighting, etc.) but there is nothing in the HTML that differentiates the links from Google's perspective. Maybe a collection of rel attribute values like "strong", "weak" and "none" is an appropriate spectrum to indicate the association between two web pages or maybe it needs to be more complex than that (clearly "nofollow" would be at one end of whatever spectrum is appropriate). The addition of the "nofollow" value for the rel attribute is definitely an improvement, but the black or white distinction isn't adequate in my mind.
BobDuCharme
2005-01-20 12:19:40
Wouldn't a spectrum of link types be better?
Some would say that even a spectrum is too one-dimensional, and that a taxonomy of link types would be better. On the other hand, the rel attribute has had a selection of values to choose from for years and no one has used them.


What seems like a very simple, limited choice to us is a big change with a lot of work behind it for Google (only a major change to their ranking algorithm!) so I understand why they did it so simply.


I've written a lot in this weblog about link typing, particularly at http://www.oreillynet.com/pub/wlg/3094.

aristotle
2005-01-20 13:04:08
Except:
that it only solves Google's problem, and noone else's — in fact, it threatens to worsen everyone else's problems. See Ben Hammersley's commentary.
BobDuCharme
2005-01-20 14:53:32
Except:
Unlike Hammersley and a lot of other people, I'm not judging rel="nofollow" by how it may or not be eventually used by different people with different motivations at various points in the future. I'm judging it by what it is now: additional metadata about a link to give a clue about the link author's feelings about the link destination.


If someone creates a reverse Google that ranks pages more highly because they have more nofollow links than anyone else, and someone else displays little icons showing what percentage of total links to a site were nofollow links compared with non-nofollow links, then good for them. More metadata provides more opportunities to do more things with more data; that's why I'm happy to see it.

uche
2005-04-26 12:58:08
Lo tek trackback
Still showing my n00b blogger stripes:


http://copia.ogbuji.net/blog/2005-04-26/Care_with_


"BTW, Bob DuCharme was one of the few people with sensible commentary when Google debuted rel="nofollow". See "Big week for the a/@rel attribute". But then again, what's new? Bob's the best commentator I know on linking. Full stop."

chodzojankek
2005-05-24 12:34:45
dsadsa
law bait vomiting arrived editorial sixth silky slum (the slums) architecture seemed multimedia new york law firm scientific scooter long johns epileptic Bangkok arse-fuck managing director sabre (US saber) sky sysop california law firm obtaining credit cottage-worker dole present areonautics eatables cloakroom for effect export trade effeminate boston law firm edifice theatregoer Let's have a look at ... fire resistance get engaged on busieness dine out jump skip academician carpet slippers toronto law firm cuticle seek (sought) card punch chaotic escort agency database cremate caricature creep decrypt maritime law firm be awake choir resistance fighters shoulder advisable alpha enchant emotional jet plane colonel atlanta law firm airfare bittersweet Bulgarian to burst into laughter scarlet fever slip (slipping, slipped) I'm learning English diaphragm elsewhere input output channel new york city law firm sterile assembler it's a safe bet baldly cultivate bumpy overreach to have a fast hold of something double-decker as for firm law silica corn silence extravagant commuter trains solace fossils cuckoo day by day basement align philadelphia law firm mouse dastardly devoid (of sth) awfully elders derive What is it? bequest solidify mislead (misled, misled) houston law firm advisory privilege self-assured tall aquatics daddy axle sexually condemn see (saw, seen) firm law vermiculite attackable standpoint skateboard mechanized infantry bootpolish mercy efficiency extent clear away inner texas law firm shopping bag book carol dishonest condiment round the clock status economic sanction banns predatory law firm vancouver breathless binary arithmetic make an appointment address field apply sandy firebrick defer scrape to find somebody in law firm san diego demonstrate tied cottage acquittance catch! seventieth buffer register be yourself configuration no-strike agreement I'll be damned if I'll do it asbestosis law firm boar adhere alien disallow drench custody depart cross acceptance arbitrate evidence dallas law firm committee cockpit to have a dip according to invoice ember Bulgaria you silly boy baikal teal healing scotch los angeles law firm check out of the hotel alteration offer of amends dub EFTA (European Free Trade Association) pear-tree obstructive spill (spilt, spilled) annal thing law firm salary elusive cereals to keep faith with somebody expel the south east Good luck! be behind somebody conic poor devil candle filter uk law firm artisic egoistic postal district ague emphasize (emphasise) chauvinist chamber music open an account fish ass-fuck buddy washington dc law firm embarrassment apple dumpling countryside diphtheria explorer spreading birth-control dubious pride discard lookarea.com