Metadata and the Semantic Web
Today I've got an automated e-mail from a software called "Link Manager", telling me that a hyperlink inside one of the books I've published on docs.sun.com isn't valid any more and should be replaced or removed.
This is a nice and welcome service, provided by an external (external to my book text) program. I had to register the book right before publishing, so the Link Manager software did know about the book and the hyperlinks within the text.
Now, there are millions of text documents, online and offline, published on the web or in an intranet, or not published at all, and they all contain information that needs to be kept up to date. Wouldn't it be good to have a type of Link Manager service built into every document? A function that automatically cares for all the information inside your text documents?
How often did you think, while writing a paragraph: "This reference certainly will expire some day, so I must come back here regularly and have a look."
Or you might have a comparison chart with data that will be different in some years, still you want to publish the chart now and not be forced to update it manually ever so often.
And the "Link Manager" service can be valuable far beyond caring for hyperlinks. How often do you write a paragraph and think: "Here I must ask person ABC if this is really right, and there I will insert an image of XYZ as soon as I get one that is free to use", and so on.
Currently, you can insert a note into your text, but this doesn't look good, and it is of no big help if your documents contain many thousand notes. What we need is a method of attaching additional structured information to any text. Information that is visible to tools software, or by request of the user, but that is normally hidden from the human reader.
For a hyperlink in your text, you would need additional information about date and time of insertion, date and time of next validation, may be alternative link to be used in case the original expired, source of the information, may be copyright information and license, author contact information, and much more.
With all these information, it would be possible to have some software tool scanning your documents regularly, checking if everything is OK, changing what is allowed to be changed automatically, or reporting problems to your attention.
Of course, those methods only work on open standard formatted documents, as ODF, and the meta data must be attached following an open standard, too.
Fortunately, such open standards exist. The meta data format is proposed by OASIS.
If you are into technical writing, publishing documentations, you will need standardized metadata in your documents. Every document should have a feedback link, and creation data, page count, copyright information, and more. In Writer, just click Tools > Bibliography Database to see how you can input and edit metadata today. The method used in OpenOffice.org to link to the bibliography database was just a very first step, using proven and available technology, on the road to real RDF metadata. Entering and editing metadata will certainly be more comfortable later.
Many software visionaries and engineers are working on metadata standards, for example using Dublin Core technology, or other developments as in SKOS (Simple Knowledge Organization System).
The OpenOffice.org software is already prepared for meta data, see the Blog entry by Svante Schubert and the presentation he held at the OpenOffice.org Conference 2007 in Barcelona.
So, OpenOffice.org is prepared to handle meta data in open standard documents, as proposed in the RDF specification.
As you can see from the comments to Svante's blog, the meta data support is a hot topic. Be prepared to meet some dragons here.
This is because there is enormous potential with metadata. Clever designed metadata can transform your text documents, spreadsheets, presentations into almost everything. Using metadata and special metadata aware hardware and software, your web based documents can be used to automate almost every aspect of your reader's life. The automatic programming of video recorders is just the least intrusive effect. A multitude of services can be developed to build upon the metadata, and the financial aspects might look like the next virtual gold rush.
Time will tell. Click the Semantic Web logo to read more about the future of the web.
A nice magazine article by Tim Berners-Lee, the "inventor of the World Wide Web", and others is here. By the way, that old magazine text shows the need of more intelligent link managers: It names an URL of a web page that contains some meta tags, but if you browse to that URL, you find that this page meanwhile got changed by its owner. It's a pity: The owner had no metadata tag sticked automatically onto his web page reminding him that the URL is used in an example about meta tags. In a perfect world, he would have registered his web page to a metadata maintaining service that would have told him that another registered web page links to this web page.
And be happy that your ODF documents will not become outdated. ODF is an open standard, a good citizen of the World Wide Web.








