[TriLUG] wget puzzle

crimsun at fungus.sh.nu crimsun at fungus.sh.nu
Fri May 2 14:45:05 EDT 2003


On Fri, May 02, 2003 at 02:23:54PM -0400, Andrew Perrin wrote:
> wget -r www.ussg.iu.edu/hypermail/linux/kernel
> 
> but for some reason it downloads only the directories containing messages
> from 2002 on (0201.*-->0305.*).  The directories are present and readable
> on the original site. Any ideas why this might happen?

There's a robots.txt with this section:

User-agent: *
Disallow: /hypermail/linux/kernel/95
Disallow: /hypermail/linux/kernel/96
Disallow: /hypermail/linux/kernel/97
Disallow: /hypermail/linux/kernel/98
Disallow: /hypermail/linux/kernel/99
Disallow: /hypermail/linux/kernel/00
Disallow: /hypermail/linux/kernel/01

Hope that helps.

-Dan

-- 
Dan Chen                crimsun at fungus.sh.nu
GPG key: www.unc.edu/~crimsun/pubkey.gpg.asc



More information about the TriLUG mailing list