[TriLUG] OT: Standardized Data Crunching

MG mgmonza at gmail.com
Mon Feb 11 16:05:55 EST 2008



Tom Roche wrote:
> Ah, yes: a central catalog of data. Good luck with that :-) Phil
> Rhodes is quite correct that RDF et al provide the means to provide
> standardized global-scale repositories; unfortunately they don't
> provide the funding to do that.
>
>
>   


Just to give you an idea of what's involved with one application that's 
gathered and presented one type type of data, here's the EPA's watershed 
database:


http://www.epa.gov/storet/

There are plans for other types, but only the watershed data seems to be 
there in any quantity.


Note the EPA imposes a standard format on the data before it's uploaded 
from its public sources.  The public users who are also the source of 
the data do the work of pre-formatting the data before it goes into the 
network  While it's doable for users at each end of a node, it would be 
a gargantuan job at the main database end.  Standardizing pre-existing 
data is one of the biggest hurdles to this type of project.


MG



More information about the TriLUG mailing list