Well currently Grub is working away on many peoples computers and is filling up the servers quickly with data. According to Jeremie on the mailing list, we will soon get our first download of results becoming available. Not sure how soon though. However apparently Grub isnt the only crawler being used. According to this edit Heritrix is currently being tested. Heritrix is the open source crawler currently being used by Archive.org for their Wayback machine.
More coming soon.
(ps – want access to post on the blog?? Email me: newsmarkie+WP@gmail.com )