<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Data Architecture - Notes from the Trenches</title>
	<atom:link href="http://rob123xyz.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://rob123xyz.wordpress.com</link>
	<description>A Data Architecture warrior's war stories and battle scars.</description>
	<lastBuildDate>Sun, 15 Feb 2009 19:24:40 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='rob123xyz.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Data Architecture - Notes from the Trenches</title>
		<link>http://rob123xyz.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://rob123xyz.wordpress.com/osd.xml" title="Data Architecture - Notes from the Trenches" />
	<atom:link rel='hub' href='http://rob123xyz.wordpress.com/?pushpress=hub'/>
		<item>
		<title>The Changing Face of Data Warehouse Design</title>
		<link>http://rob123xyz.wordpress.com/2009/02/14/datawarehouse/</link>
		<comments>http://rob123xyz.wordpress.com/2009/02/14/datawarehouse/#comments</comments>
		<pubDate>Sat, 14 Feb 2009 16:35:05 +0000</pubDate>
		<dc:creator>rob123xyz</dc:creator>
				<category><![CDATA[Data Architecture]]></category>
		<category><![CDATA[Data Warehouse]]></category>
		<category><![CDATA[Data Warehouse Architecture]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[Data Warehouse architecture is no longer strictly a top-down versus bottom-up strategy decision. In today's economic climate, and with mergers and acquisitions and restructuring and re-organizing becoming the rule rather than the exception, projects to add subject areas to the so-called enterprise data warehouse more and more often are involving moving data to the warehouse platform before understanding all (if any) of the business requirements for doing so.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rob123xyz.wordpress.com&amp;blog=6583385&amp;post=1&amp;subd=rob123xyz&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Unless your organization is swimming cash (and whose is these days!), you&#8217;ve probably noticed that data warehouse and BI project timelines and budgets have been shrinking faster than the global headcount. It wasn&#8217;t very long ago that we data architects (and other professionals working on so-called enterprise data warehouses) actually had what seemed a reasonable amount of time at the beginning of a project to meet with business analysts and (gasp) real business people, to talk about the business and do a pretty adequate job of gathering and documenting analytical information requirements and then translating them into a logical data architecture.  In many data warehouse shops, those days are gone.</p>
<p>The paradigm has been shifting to moving new sources of data onto the data warehouse platform and figuring out how to integrate and use them later on. To a data warehouse architect who&#8217;s been in the business for more than a year or two, working under these prerogatives is akin to hearing fingernails being raked across a chalkboard. The holy mantram of data warehousing has always been that the integration is the message. We&#8217;ve been dedicating our careers to eliminating stovepipe analytical reporting databases and providing the business with an integrated solution to their business intelligence needs. We&#8217;ve debunked off-the-shelf packaged data warehouses as not providing significant competitive advantage. We&#8217;ve talked about the risks of federated data warehouses. Stovepipe data marts? Blasphemy!  I don&#8217;t know about anyone else, but every data warehousing book I&#8217;ve read, and every seminar I&#8217;ve ever attended has pounded home the point that this approach is a surefire recipe for disaster. And yet here we are, being asked by the full chain of command, from project management to the CIO, to essentially do things we&#8217;ve been trained to fight against &#8212; to get data loaded and then figure out how to integrate it and how the business might use it later on. I reckon that too many years of a perceived &#8220;ready, aim, aim, aim&#8221; approach has swung the pendulum to the &#8220;ready, fire, aim&#8221; end of the spectrum at many organizations.</p>
<p>So after you&#8217;ve taken your dings for expressing your concerns about resources and timelines, and after you&#8217;ve pulled out a more than a few hairs trying to figure out how to match up legacy keys coming from five different transaction systems (while still waiting for a CDI implementation that&#8217;s been years in the works already) you grab a triple latte, and sit down and roll up your sleeves and try to figure out where you can bend the lines on what you know are data warehouse architecture best practices.</p>
<p>Some of the notable changes you may see on such a project (aside from the frighteningly compressed timeline, and absence of business people at requirements sessions) &#8212; and yes I am being just a touch cynical in my portrayal:</p>
<ul>
<li>Mega design workshops where the IT business analysts, data architects, DBA&#8217;s and ETL developers are all in the same room at the same time to save the effort of translating business requirements into logical models and then physical designs.</li>
<li>The project leader is in attendance at a lot of meetings, simply to urge the group to &#8220;move on&#8221; when discussion of a difficult issue goes on for more than ten minutes.</li>
<li>Standards tend to become guidelines, and things like mis-named objects are quickly written off as &#8220;sweating the small stuff&#8221;. The old rule about commenting everything that goes into the database changes to commenting when you know what something is without having to call someone up or send out an email and wait for a reply.</li>
<li>The project manager tends to talk about fixing design anomalies in the &#8220;semantic layer&#8221;. Making sense of the design and communicating it to the users becomes a metadata issue.</li>
</ul>
<p>Some things you absolutely should not budge on:</p>
<ul>
<li>The need to have a common business model forming the architectural framework. This entails constructing conceptual and logical and physical data models prior to the first table being built or the first line of ETL (or in this case ELT) code being written. Without such a framework, you most surely are destined to have your name and reputation attached to what is known in the biz as a &#8220;data outhouse&#8221;.</li>
<li>The tendency of the DBA team to &#8220;move on&#8221; and modify the physical database, upon learning things from the source data &#8212; and not feeding this back into the conceptual and logical data models. When the data models become throw-away design artifacts and the DBMS catalogue and DDL scripts become the only up-to-date design metadata &#8212; be very afraid &#8212; afraid of managing change in the future!</li>
<li>Sometimes you just have to push to have the schedule relaxed when it is just too unrealistic. It might risk having you branded as a pessimist &#8212; a negative thinker, but in the long run you must go on the record at the very least in communicating your issues and concerns to project management. Once you are on the record, they know that they will have to share the burden of blame if the end result is a useless bucket-o-data. The risk of not identifying the issues is that your reputation as a competent data architect can be badly tarnished.</li>
</ul>
<p>One final bit of advice: Don&#8217;t let the pressures get to you. I&#8217;ve had to learn to accept that one of your responsibilities as data architect is to present issues and concerns about both the overall architecture as well as the specifics of the project to management. Ultimately, management makes the calls and you have to do the best you can with what you&#8217;ve got. As long as they understand the implications of first assembling that &#8216; bucket-o-data&#8217;, and then trying to organize and integrate it later on, and agree to that approach at the outset, there&#8217;s no point in getting overly stressed out over it.  Trust me, I&#8217;ve been down that road. Go home and play with the kids or the dog, get out on the sportbike and carve some twisting backroads, go for a jog, play your guitar, go out for a nice dinner with your better half. Remind yourself that it&#8217;s great to have a job with so many intellectual challenges. Thinking of the work as &#8220;bringing the operational data stores onto the data warehouse platform&#8221; can help to rationalize it. And then there&#8217;s always that miraculous semantic layer and wonderous metadata repository to fix everything up right as rain.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rob123xyz.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rob123xyz.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rob123xyz.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rob123xyz.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rob123xyz.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rob123xyz.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rob123xyz.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rob123xyz.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rob123xyz.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rob123xyz.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rob123xyz.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rob123xyz.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rob123xyz.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rob123xyz.wordpress.com/1/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=rob123xyz.wordpress.com&amp;blog=6583385&amp;post=1&amp;subd=rob123xyz&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://rob123xyz.wordpress.com/2009/02/14/datawarehouse/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/adb39a01225794ba8f9adecebe9a90e7?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rob123xyz</media:title>
		</media:content>
	</item>
	</channel>
</rss>
