[TriLUG] Data manipulation over Samba

Jim Tuttle jtuttle at prairienet.org
Tue May 22 07:30:42 EDT 2007


So, this has been bothering me. I'm hoping someone has an answer and,
perhaps, a reference.

Ok, I'm running some python data processing scripts against an
orthophoto collection residing on a disk array in the basement.  There
are about 4,300 images each about 76MB.  There are several smaller files
with each image.  Part of the processing includes copying each file to
another partition on our 14TB ATABeast.  The question is this: Is any of
this data moving over the network to my machine?

The processing is taking forever.  215 images in 8 hours.  I wondered if
the images are being read into memory by my machine then written to the
other partition on the array.  I have this fantasy that python tells the
processor on the disk array to do the copying, but I imagine that isn't
true.  To make matters worse, there are several connections through
which this data traverses.  The array is mounted via fiber channel to a
Solaris cluster which offers it to a linux machine in the cube next to
me via NFS and I'm mounting that via samba on my desktop.

I could have and probably should have run this on the intermediate
machine, but wasn't thinking last night.  Neither the ATABeast nor the
Solaris cluster have python installed and that's a non-starter.

Thanks,
Jim
-- 
--
---Jim Tuttle
------------------------------------------------------
url: http://www.prairienet.org/~jtuttle/
PGP key: http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x69B69B08




More information about the TriLUG mailing list