View Single Post
  #6  
Old March 11th, 2010, 01:02 PM

Psycho Psycho is offline
Captain
 
Join Date: Jan 2008
Posts: 913
Thanks: 21
Thanked 53 Times in 33 Posts
Psycho is on a distinguished road
Default Re: New HoF brain storm

I tried writing a parser in Ruby some time ago to parse the thread and get the HoF statistics, but it proved to be not so easy. The problem is that you don't have a standard format for posts. Also some player names are written in two or three different variants - with and without capital letters, with and without spaces, etc. You'll probably have to enter a lot of data manually.

And if you're doing it, well then XML is probably the best way. You can easily write an XSLT to transform it to a human readable html file for example.
Reply With Quote