<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.3" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Some rare common sense</title>
	<link>http://andyonenterprisesoftware.com/2006/07/some-rare-common-sense/</link>
	<description>Andy Hayler, founder of Kalido and The Information Difference, gives his views on the enterprise software market. Issues covered include data warehousing, master data management, business intelligence and data quality.</description>
	<pubDate>Tue, 14 Oct 2008 10:54:38 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.3</generator>

	<item>
		<title>by: Andy Hayler</title>
		<link>http://andyonenterprisesoftware.com/2006/07/some-rare-common-sense/#comment-110</link>
		<pubDate>Thu, 20 Jul 2006 10:32:04 +0000</pubDate>
		<guid>http://andyonenterprisesoftware.com/2006/07/some-rare-common-sense/#comment-110</guid>
					<description>You raise an important distinction.  You should NEVER lose track of valid data that has simply become out of date e.g. when a reorganisation happens you want to expire the old relationship and add a new one.  Ideally all relationships should be time-stamped in thsi way, allowing you to recreate past history (this is the way that Kalido does it, for example). There is no reason why this cannot be achieved in custom warehouses also.

What I meant was data that was not yet validated as being correct i.e. has yet to be loaded into the warehouse.  In the Kalido philosophy at least, data should be put into a staging area prior to loading, and then validated against the business rules of the warehouse. Genuinely bad data should never be entered into the warehouse or it will destroy its integrity and credibility with the business users.  If they see a report where the numbers add up wrongly then they will be reluctant to trust the warehouse again.  This &quot;incomplete&quot; or &quot;unvalidated&quot; data does have a place in a master data repository though, as improving it to &quot;golden copy&quot; quality is part of the master data management process. Only when master data is validated should it be fed into the warehouse.</description>
		<content:encoded><![CDATA[<p>You raise an important distinction.  You should NEVER lose track of valid data that has simply become out of date e.g. when a reorganisation happens you want to expire the old relationship and add a new one.  Ideally all relationships should be time-stamped in thsi way, allowing you to recreate past history (this is the way that Kalido does it, for example). There is no reason why this cannot be achieved in custom warehouses also.</p>
<p>What I meant was data that was not yet validated as being correct i.e. has yet to be loaded into the warehouse.  In the Kalido philosophy at least, data should be put into a staging area prior to loading, and then validated against the business rules of the warehouse. Genuinely bad data should never be entered into the warehouse or it will destroy its integrity and credibility with the business users.  If they see a report where the numbers add up wrongly then they will be reluctant to trust the warehouse again.  This &#8220;incomplete&#8221; or &#8220;unvalidated&#8221; data does have a place in a master data repository though, as improving it to &#8220;golden copy&#8221; quality is part of the master data management process. Only when master data is validated should it be fed into the warehouse.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Vincent McBurney</title>
		<link>http://andyonenterprisesoftware.com/2006/07/some-rare-common-sense/#comment-109</link>
		<pubDate>Thu, 20 Jul 2006 08:21:17 +0000</pubDate>
		<guid>http://andyonenterprisesoftware.com/2006/07/some-rare-common-sense/#comment-109</guid>
					<description>Andy, when you talk about the DW only having validating data, how do you get a complete view of data in the DW if you drop off unclean data? For example what happens when a key item such as a customer is unclean and keeping that out of the DW loses all history, transactions and revenue from that customer.

I have been on some DW projects where the data had to go through. Incomplete records had default or dummy values set. Missing referential integrity meant augmenting key tables with dummy records. The users wanted all the measures to be present in the reports and that meant loading unclean data to support those measures.</description>
		<content:encoded><![CDATA[<p>Andy, when you talk about the DW only having validating data, how do you get a complete view of data in the DW if you drop off unclean data? For example what happens when a key item such as a customer is unclean and keeping that out of the DW loses all history, transactions and revenue from that customer.</p>
<p>I have been on some DW projects where the data had to go through. Incomplete records had default or dummy values set. Missing referential integrity meant augmenting key tables with dummy records. The users wanted all the measures to be present in the reports and that meant loading unclean data to support those measures.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>
