Today read RFC 2396 about URI syntax. I found RFC is subtle, for example:
In general, a character is reserved if the semantics of the URI changes if the character is replaced with its escaped US-ASCII encoding. -- section 2.2or
Implementers should be careful not to escape or unescape the same string more than once, since unescaping an already unescaped string might lead to misinterpreting a percent data character as another escaped character, or vice versa in the case of escaping an already escaped string. -- section 2.4.2These can be real bugs in specific implementation! RFC is RFC. If one get to read it, read it very carefully :-)
Posted by Tim Bray on November 09, 2004 at 11:57 PM GMT+08:00 #