Re: Algorithm or ideas wanted for creative text parsing

  • From: "Richard Ji" <richard.c.ji@xxxxxxxxx>
  • To: rjamya@xxxxxxxxx
  • Date: Mon, 10 Apr 2006 13:59:13 -0400

Raj,

.tv is 2 characters yet its' not a ccTLD, you need to get a list of
valid ISO two letter country codes.

Richard Ji

On 4/10/06, rjamya <rjamya@xxxxxxxxx> wrote:
> Thanks SF and all
>
> maybe here is what I can do ...
>
> 1. if the domain is numeric, take it as it is
> 2. if the TLD (i.e. the last piece) is 3 or more characters, you take
> last 2 pieces
>     (this will cover com,org,edu,name,info,museum etc)
> 3. if the last piece is 2 characters (most likely a ccTLD), take last 3 pieces
>     (i.e. il, br, ca, uk etc)
>
> hmmm ... looks promising, am I missing anything?
--
//www.freelists.org/webpage/oracle-l


Other related posts: