[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: XML data sets with (known) data quality problems
> In order to test exhaustively this library, we need to have XML data sets > that have data quality problems known a priori. > By data quality problems, we mean: missing values, misspellings, synonyms, > values out of domain, approximate duplicates, etc. Government data: http://data.gov.uk/data I did a short contract for 'LinkedGov' a while back (http://linkedgov.org/), it's their goal to make the data clean and usable, so you might want to get in touch with them. -- Andrew Welch http://andrewjwelch.com
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] |
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|