On The Margins

(Masood Mortazavi)


(Books)(Blogger)(java.net)
Check Google Page Rank

20041001 Friday October 01, 2004

[ Web ] George Soros Starts a Blog

George Soros, financial wizard and author, has just started a blog, where he has professed an eagerness to hear from his readers.

2004-10-01 22:51:48.0 -- ; Permalink ; Trackback.

[ Web ] The Failure of Search (or the Fallacy of Abundance)

So, what's going on with web search? Why is it giving such high rank to (at best) marginal material, such as the one on this weblog on certain topics? Or as I asked earlier, why do we even feel that we get anything relevant when we perform a search on the Web? How much better material are we actually missing when we limit ourselves to the findings of a search engine?

It is in asking those sorts of questions that we can arrive at modest discoveries or at least novel explanations of what we see around us.

To further the investigation I reported earlier, I went back to the chapters on search in Hubert Dreyfus' little book, On the Internet. According to Dreyfus, given the immense size of the Net, it is "estimated that search engines can recall at most 2 per cent of the relevant sites." (The number might have changed in the last three years but I don't believe that the changes, if any, would affect the arguments in any drastic way.)

We need to ask why "content" (or "information") retrieval systems are receiving the hype they are receiving even if they are hardly adequate when it comes to searching for specific content. How could my weblogs, even if they are somewhat useful, be ranked as the third most useful or important content on certain scholars I've only occasionally quoted and on whose works I still consider myself a novice?

Surely, this sort of system behavior cannot be good if we have hopes to be able to find important bits of documents or knowledge through search and information retrieval.

To explain the hype regarding search and information retrieval, Dreyfus quotes computer scientist David Blair, who cites information retrieval (IR) pioneer Don Swanson:

IR prioneer Don Swanson observed this phenomenon decades ago, and calls it the "fallacy of abundance". The fallacy of abundance is the mistake a searcher makes when he uses a large IR system and is able to find some useful documents. Swanson pointed out that on a sufficiently large system . . . almost any query will retrieve some useful documents. The mistake is to think that just because you got some useful documents the IR system is performing well. What you don't know is how many better documents the system missed.

And so . . . since my weblogs can be ranked highly by Google for certain subjects, they may be perceived (by some searchers) to be more important than they really are.

2004-10-01 11:11:02.0 -- ; Permalink ; Trackback.

[ Web ] Do Google Rankings Mean Anything

Simple errors and break-downs often lead to new discoveries . . . The type of break-down is almost immaterial.

Error:

A few days ago, I accidentally removed all records of referers and hits on my weblog, all 106,500 of them.

This simple push-of-a-button dropped me out of the hot list you see at the bottom of blogs.sun.com.

Curiosity:

Before this incidence, I'd never thought much about the referers list but deleting all the hit records made me curious. So, I have now gone back to the gradually accumulating referers list for this weblog and have tried to learn something about the referer URL distribution. A significant majority of the hits on this weblog are direct and an equally significant minority are from Google (and competing search engine) searches.

Example:

Although I'm not sure how persistent this sort of system behavior is, this weblog is currently (as of early Octobor, 2004) receiving high (Google and other) search rankings on subjects in which the author is barely a novice.

The rankings this weblog is receiving from Google (as well as other competing search engines) for certain specific queries, for example queries on Oliver Williamson (see Ref.1) and on Chester Barnard (see Ref.2) truly amaze me. As of earlier this week, I've consistently been ranked third on both (and their varient orderings) on Google. See Ref.1.1 and Ref.2.1 for the relevant Google searches.

I may have had the good fortune of having studied with Oliver Williamson for a very short period of time but I'm still a novice learner of his ideas. I may have studied portions of Chester Barnard's classical book, because Williamson recommended it, but I do not deserve to be read diligently as serious commentary on either. So, why is it that what I have written about both is receiving high Google rankings. Surely, I myself know better written material on both topics.

Questions:

What's broken down? What's amiss about search, whether of the Google variety or not? Why do we even feel that we get anything relevant when we perform a search on the Web? How much better material are we actually missing if we limit ourselves to the findings of a search engine?

A Modest Discovery:

Web search and information retrieval fails us more often than we know or realize !

2004-10-01 10:28:30.0 -- ; Permalink ; Trackback.

On the Margins Tag Cloud

america apache art berkeley blogs books business canada capital code communications community computing conference connectors content contribution corporate costs culture databases derby design desktop developers development economics education energy engineering film finance history information innovation international internet iran isfahan java java-db javaone law linux logic management markets mathematics media mobile music mysql netbeans networks news open open-solaris open-source opensolaris opensource os persian philosophy phones photography photos politics postgresql practice privacy products programming ruby science server services social society software solaris sports strategy sun sun-microsystems systems technology tehran telecommunications tools transactions transportation travel tv us video war web windows work writing

Del.icio.us

RSS Feeds

XML

All
/ Persian (فارسی)
/Announcements
/Art (هنر)
/Business
/Code
/Culture
/Design
/Economics
/Here
/History
/Java
/Mathematics
/Media
/Networks
/Papers
/Personal
/Philosophy
/Science
/Society
/Sports
/Sun Microsystems Inc.
/Technology
/Telecommunications
/This
/Web
/Work

Disclaimer

I work at Sun Microsystems. The opinions expressed here are purely my own, and neither Sun nor any other party necessarily agrees with them.

Coordinates

Locations of visitors to this page

« October 2004 »
SunMonTueWedThuFriSat
     
2
13
14
16
18
24
31
      
Today

www.flickr.com
This is a Flickr badge showing public photos from M.Mortazavi. Make your own badge here.

Entry Statistics

Entries: 1246
Comments: 919

Recent Entries

StatCounter

Statistics from StatCounter

Page Rank

Check Google Page Rank

On the Margins Tag Cloud

america apache art berkeley blogs books business canada capital code communications community computing conference connectors content contribution corporate costs culture databases derby design desktop developers development economics education energy engineering film finance history information innovation international internet iran isfahan java java-db javaone law linux logic management markets mathematics media mobile music mysql netbeans networks news open open-solaris open-source opensolaris opensource os persian philosophy phones photography photos politics postgresql practice privacy products programming ruby science server services social society software solaris sports strategy sun sun-microsystems systems technology tehran telecommunications tools transactions transportation travel tv us video war web windows work writing

RSS Feeds

XML

All
/ Persian (فارسی)
/Announcements
/Art (هنر)
/Business
/Code
/Culture
/Design
/Economics
/Here
/History
/Java
/Mathematics
/Media
/Networks
/Papers
/Personal
/Philosophy
/Science
/Society
/Sports
/Sun Microsystems Inc.
/Technology
/Telecommunications
/This
/Web
/Work

Other Places




Landmine Casulties
free counters'

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 2.5 License.
© Masood Mortazavi
This is a personal weblog, I do not speak for my employer.