| Philip's profileNarnainBlogLists | Help |
|
|
October 19 Cleaning data is not always a good thingI came across a post from Claudia Imhoff. I totally agree that in some cases cleaning data also destroys some of information that can be derived from dirty data, and can destroy the audit trail. I usually insist on the option of minimal duplicates, this is always useful when converting things like dates and numbers from flat files, especially as the flat file data type is usually character based and that conversion process is rarely 100% reliable either due to dirty data such as 31 Feb 2007, or poor meta-data, is 05/04/2007 the 5th March 2007 or 4th May 2007. Depending on the localisation of the tool generating the data it could be either. I really like the sound of ABC tables and would like to hear from anyone with experience of implementing this, or if anyone knows of any tools for achieving such a solution. |
|
|