Jill Dyche on 2012

In part 3 of the series for predictions for 2012, here is Jill Dyche, Baseline Consulting/DataFlux.

Part 2 was Timo Elliot, SAP at http://www.decisionstats.com/timo-elliott-on-2012/ and Part 1 was Jim Kobielus, Forrester at http://www.decisionstats.com/jim-kobielus-on-2012/

Ajay: What are the top trends you saw happening in 2011?

 

Well, I hate to say I saw them coming, but I did. A lot of managers committed some pretty predictable mistakes in 2011. Here are a few we witnessed in 2011 live and up close:

 

1.       In the spirit of “size matters,” data warehouse teams continued to trumpet the volumes of stored data on their enterprise data warehouses. But a peek under the covers of these warehouses reveals that the data isn’t integrated. Essentially this means a variety of heterogeneous virtual data marts co-located on a single server. Neat. Big. Maybe even worthy of a magazine article about how many petabytes you’ve got. But it’s not efficient, and hardly the example of data standardization and re-use that everyone expects from analytical platforms these days.

 

2.       Development teams still didn’t factor data integration and provisioning into their project plans in 2011. So we saw multiple projects spawn duplicate efforts around data profiling, cleansing, and standardization, not to mention conflicting policies and business rules for the same information. Bummer, since IT managers should know better by now. The problem is that no one owns the problem. Which brings me to the next mistake…

 

3.       No one’s accountable for data governance. Yeah, there’s a council. And they meet. And they talk. Sometimes there’s lunch. And then nothing happens because no one’s really rewarded—or penalized for that matter—on data quality improvements or new policies. And so the reports spewing from the data mart are still fraught and no one trusts the resulting decisions.

 

But all is not lost since we’re seeing some encouraging signs already in 2012. And yes, I’d classify some of them as bona-fide trends.

 

Ajay: What are some of those trends?

 

Job descriptions for data stewards, data architects, Chief Data Officers, and other information-enabling roles are becoming crisper, and the KPIs for these roles are becoming more specific. Data management organizations are being divorced from specific lines of business and from IT, becoming specialty organizations—okay, COEs if you must—in their own rights. The value proposition for master data management now includes not just the reconciliation of heterogeneous data elements but the support of key business strategies. And C-level executives are holding the data people accountable for improving speed to market and driving down costs—not just delivering cleaner data. In short, data is becoming a business enabler. Which, I have to just say editorially, is better late than never!

 

Ajay: Anything surprise you, Jill?

 

I have to say that Obama mentioning data management in his State of the Union speech was an unexpected but pretty powerful endorsement of the importance of information in both the private and public sector.

 

I’m also sort of surprised that data governance isn’t being driven more frequently by the need for internal and external privacy policies. Our clients are constantly asking us about how to tightly-couple privacy policies into their applications and data sources. The need to protect PCI data and other highly-sensitive data elements has made executives twitchy. But they’re still not linking that need to data governance.

 

I should also mention that I’ve been impressed with the people who call me who’ve had their “aha!” moment and realize that data transcends analytic systems. It’s operational, it’s pervasive, and it’s dynamic. I figured this epiphany would happen in a few years once data quality tools became a commodity (they’re far from it). But it’s happening now. And that’s good for all types of businesses.

 

About-

Jill Dyché has written three books and numerous articles on the business value of information technology. She advises clients and executive teams on leveraging technology and information to enable strategic business initiatives. Last year her company Baseline Consulting was acquired by DataFlux Corporation, where she is currently Vice President of Thought Leadership. Find her blog posts on www.dataroundtable.com.

Decisionstats Update

A List of All Decisionstats Interviews Till Now-

http://goo.gl/Xpo5y

 

Date Name of Interviewee Designation and Organization Url
6/26/2011 Elissa Fink VP Tableau Software http://decisionstats.com/interview-elissa-fink-vp-tableau-software/
6/14/2011 Gaurav Vohra Jigsaw Academy http://decisionstats.com/interview-gaurav-vohra-jigsaw-academy/
5/27/2011 Rob La Gesse Chief Disruption Officer ,Rackspace Hosting. http://decisionstats.com/interview-with-rob-la-gesse-chief-disruption-officer-rackspace/
5/24/2011 Sandro Saitta Data Mining Blog http://decisionstats.com/interview-top-data-mining-blogger-on-earth-sandro-saitta/
2011 2/21/2011 Anne Milley Senior Director JMP http://decisionstats.com/interview-anne-milley-jmp/
2/11/2011 David Katz Senior Analyst , Dataspora http://decisionstats.com/interview-david-katz-dataspora-david-katz-consulting/
1/18/2011 Carole Anne Matignon Sparkling Logic http://decisionstats.com/carole-ann%E2%80%99s-2011-predictions-for-decision-management/
1/12/2011 Luis Torgo Author,Data Mining with R http://decisionstats.com/interview-luis-torgo-author-data-mining-with-r/
1/8/2011 Ajay Ohri Decisionstats http://decisionstats.com/interview-ajay-ohri-decisionstats-com-with-dmr/
2010 11/29/2010 Timo Elliott SAP, Business Objects http://decisionstats.com/brief-interview-timo-elliott/
11/27/2010 Jill Dyche founder ,Baseline Consulting (acquired by DataFlux/SAS Institute) http://decisionstats.com/short-interview-jill-dyche/
11/25/2010 James Kobielus Senior Analyst, Forrester. http://decisionstats.com/brief-interview-with-james-g-kobielus/
11/22/2010 Jamie Nunnelly Communications Director of National Institute of Statistical Sciences http://decisionstats.com/interview-jamie-nunnelly-niss/
11/14/2010 James Dixon founder ,Pentaho http://decisionstats.com/pentaho/
10/12/2010 John F Moore CEO The Lab http://decisionstats.com/interview-john-f-moore-ceo-the-lab/
10/5/2010 Michael J. A. Berry Data Miners, Inc http://decisionstats.com/interview-michael-j-a-berry-data-miners-inc/
9/30/2010 Dean Abbott Abbott Analytics http://decisionstats.com/interview-dean-abbott-abbott-analytics/
8/23/2010 Stephanie McReynolds Director Product Marketing, AsterData http://decisionstats.com/stephanie/
8/3/2010 David Smith VP, Revolution Analytics http://decisionstats.com/q-a-with-david-smith-revolution-analytics/
6/29/2010 Bob Muenchen Author, R For Stata http://decisionstats.com/interview-r-for-stata-users/
3/13/2010 Interview Jeanne Harris Co-Author -Analytics at Work and Competing with Analytics
1/12/2010 Interview Hadley Wickham R Project Data Visualization Guru
1/6/2010 Audio Interviews -Dr. Colleen McCue National Security Expert
12/30/2009 Interview Sarah Blow – Girly Geekdom Founder
12/7/2009 Interview Donald Farmer Microsoft
11/24/2009 M2009 Interview Peter Pawlowski AsterData
11/19/2009 Interview Phil Rack WPS Consultant and Developer
11/10/2009 Data Mining 2009 Interviews- Terry Whitlock, BlueCross BlueShield of TN
11/2/2009 Audio Interview Anne Milley , Part 1
10/21/2009 Interview Carole Jesse Experienced Analytics Professional
10/5/2009 Interview Michael Zeller,CEO Zementis on PMML
10/5/2009 Interview Ken O Connor Business Intelligence Consultant
10/1/2009 Interview Shawn Kung Sr Director Aster Data
9/28/2009 Interview Thomas C. Redman Author Data Driven
9/25/2009 Interview Augusto Albeghi (Straycat) —Founder Straysoft
9/20/2009 Interview Evan Levy Baseline Consulting
9/18/2009 Interview James Taylor Decision Management Expert (Updated)
9/16/2009 Interview Timo Elliott SAP
9/14/2009 Interview Professor John Fox Creator R Commander
9/10/2009 Interview Stephen Baker Author The Numerati
9/9/2009 Interview Jeff Bass, Bass Institute (Part 2)
9/9/2009 Interview Neil Raden Founder of Hired Brains Inc
8/29/2009 Interview Dylan Jones DataQualityPro.com
8/13/2009 Interview Gregory Piatetsky KDNuggets.com
8/13/2009 Interview Tasso Argyros CTO Aster Data Systems
8/13/2009 Interview Steve Sarsfield Author The Data Governance Imperative
8/11/2009 Interview Dr Usama Fayyad Founder Open Insights LLC
7/28/2009 Interview Karen Lopez Data Modeling Expert
7/28/2009 Interview John Sall Founder JMP/SAS Institute
2009 7/16/2009 Jim Harris OCDQ Blog http://www.decisionstats.com/2009/07/16/interview-jim-harris-data-quality-evangelist/
7/14/2009 Eric Siegel Founder, Predictive Analytics World http://www.decisionstats.com/2009/07/14/interview_eric-siege/
7/10/2009 Gary D Miner Author ‘Handbook of Statistical Analysis and Data Mining Applications’ http://www.decisionstats.com/2009/07/10/interview-gary-d-miner-author-and-professor/
7/3/2009 John F Moore CTO, Swimfish http://www.decisionstats.com/2009/07/03/interview-john-moore-cto-swimfish/
7/2/2009 Peter J Thomas Award Winning BI Expert http://www.decisionstats.com/2009/07/02/peter-james-thomas-bi/
6/30/2009 Alison Bolen Editor- in- Chief SAS.COM http://www.decisionstats.com/2009/06/30/interview-alison-bolen-sas-com/
6/30/2009 Jill Dyche Co- Founder, Baseline Consulting http://www.decisionstats.com/2009/06/30/interview-jill-dyche-baseline-consulting/
6/18/2009 Gary Cokins Senior Leader, Performance Management SAS Institute http://www.decisionstats.com/2009/06/18/interview-gary-cokins-sas-institute/
6/9/2009 Karl Rexer President, Rexer Analytics http://www.decisionstats.com/2009/06/09/interview-karl-rexer-rexer-analytics/
6/5/2009 Jim Daves CMO, SAS Institute http://www.decisionstats.com/2009/06/05/interview-jim-davis-sas-institute/
6/4/2009 Paul van Eikeren President and CEO, Blue Reference http://www.decisionstats.com/2009/06/04/inference-for-r/
5/29/2009 David Smith Director of Community, REvolution Computing http://www.decisionstats.com/2009/05/29/interview-david-smith-revolution-computing/
5/17/2009 Dominic Pouzin CEO, Data Applied http://www.decisionstats.com/2009/05/17/interview-dominic-pouzin-data-applied/
5/11/2009 Bruno Delahaye VP, KXEN http://www.decisionstats.com/2009/05/11/interview-kxen-bruno-delahaye/
5/4/2009 Ron Ramos Director, Zementis http://www.decisionstats.com/2009/05/07/interview-ron-ramos-zementis/
4/30/2009 Oliver Jouve VP, SPSS Inc http://www.decisionstats.com/2009/04/30/interview-spss-olivier-jouve/
4/21/2009 Fabian Dill Co- Founder, Knime.com http://www.decisionstats.com/2009/04/21/interview-knime-fabian-dill/
4/18/2009 Alicia Mcgreevey Head Marketing, Visual Numerics http://www.decisionstats.com/2009/04/18/interview-visual-numerics-alicia-mcgreevey/
3/27/2009 Francoise Soulie Fogelman VP, KXEN http://www.decisionstats.com/2009/03/27/interview-franoise-soulie-fogelman-kxen/
3/17/2009 Jon Peck Principal Software Engineer, SPSS Inc http://www.decisionstats.com/2009/03/17/interview-jon-peck-spss/
3/6/2009 Anne Milley Director of product marketing, SAS Institute http://www.decisionstats.com/2009/03/06/interview-with-anne-milley-sas-ii/
3/4/2009 Anne Milley Director of product marketing, SAS Institute http://www.decisionstats.com/2009/03/04/interview-anne-milley-sas-part-1/
2/3/2009 Phil Rack Creator, Bridge to R,and CEO Minequest http://www.decisionstats.com/2009/02/03/interview-phil-rack/
2/3/2009 Michael Zeller CEO, Zementis http://www.decisionstats.com/2009/02/03/interview-michael-zeller-ceozementis/
1/31/2009 Richard Schultz CEO, Revolution Computing http://www.decisionstats.com/2009/01/31/interviewrichard-schultz-ceo-revolution-computing/
1/21/2009 Bob Muenchen Author, R for SAS and SPSS Users http://www.decisionstats.com/2009/01/21/r-for-sas-and-spss-users/
1/13/2009 Dr Graham Williams Creator, Rattle GUI for R http://www.decisionstats.com/2009/01/13/interview-dr-graham-williams/
1/5/2009 Roger Haddad CEO, KXEN http://www.decisionstats.com/2009/01/05/interview-roger-haddad-founder-of-kxen-automated-modeling-software/
2008 9/26/2008 June Dershewitz VP, Semphonic http://www.decisionstats.com/2008/09/26/online-analytics-june-dershewitz/
9/4/2008 Vincent Granville Head, Analyticbridge http://www.decisionstats.com/2008/09/04/the-worlds-largest-analytics-networker/

 

 

Short Interview Jill Dyche

Here is brief one question interview with Jill Dyche , founder Baseline Consulting.

 

In 2010.

 

  • It was more about consciousness-raising in the executive suite—
  • getting C-level managers to understand the ongoing value proposition of BI,
  • why MDM isn’t their father’s database, and
  • how data governance can pay for itself over time.
  • Some companies succeeded with these consciousness-raising efforts. Some didn’t.

 

But three big ones in 2011 would be:

  1. Predictive analytics in the cloud. The technology is now ready, and so is the market—and that includes SMB companies.
  2. Enterprise search being baked into (commoditized) BI software tools. (The proliferation of static reports is SO 2006!)
  3. Data governance will begin paying dividends. Until now it was all about common policies for data. In 2011, it will be about ROI.

I do a “Predictions for the coming year” article every January for TDWI,

Note- Jill ‘s January TDWI article seems worth waiting for in this case.

About-

Source-http://www.baseline-consulting.com/pages/page.asp?page_id=49125

Partner and Co-Founder

Jill Dyché is a partner and co-founder of Baseline Consulting.  She is responsible for key client strategies and market analysis in the areas of data governance, business intelligence, master data management, and customer relationship management. 

Jill counsels boards of directors on the strategic importance of their information investments.

Author

Jill is the author of three books on the business value of IT. Jill’s first book, e-Data (Addison Wesley, 2000) has been published in eight languages. She is a contributor to Impossible Data Warehouse Situations: Solutions from the Experts (Addison Wesley, 2002), and her book, The CRM Handbook (Addison Wesley, 2002), is the bestseller on the topic. 

Jill’s work has been featured in major publications such as Computerworld, Information Week, CIO Magazine, the Wall Street Journal, the Chicago Tribune and Newsweek.com. Jill’s latest book, Customer Data Integration (John Wiley and Sons, 2006) was co-authored with Baseline partner Evan Levy, and shows the business breakthroughs achieved with integrated customer data.

Industry Expert

Jill is a featured speaker at industry conferences, university programs, and vendor events. She serves as a judge for several IT best practice awards. She is a member of the Society of Information Managementand Women in Technology, a faculty member of TDWI, and serves as a co-chair for the MDM Insight conference. Jill is a columnist for DM Review, and a blogger for BeyeNETWORK and Baseline Consulting.

 

Interview Ken O Connor Business Intelligence Consultant

Here is an interview with an industry veteran of Business Intelligence, Ken O Connor.

Ajay- Describe your career journey across the full development cycle of Business Intelligence.

Ken- I started my career in the early 80’s in the airline industry, where I worked as an application programmer and later as a systems programmer. I took a computer science degree by night. The airline industry was one of the first to implement computer systems in the ‘60s, and the legacy of being an early adaptor was that airline reservation systems were developed in Assembler. Remarkable as it sounds now, as application programmers, we wrote our own file access methods. Even more remarkable, as systems programmers, we modified the IBM supplied Operating System, originally known as the Airline Control Program (ACP), later renamed as Transaction Processing Facility (TPF). The late ‘80s saw the development of Global “Computer Reservations Systems” (CRS systems) including AMADEUS and GALILEO. I moved from Aer Lingus, a small Irish airline, to work in London on the British Airways systems, to enable the British Airways systems share information and communicate with the new Global CRS systems.

I learnt very important lessons during those years.

* The criticality of standards

* The drive for interoperability of systems

* The drive towards information sharing

* The drive away from bespoke development

In the 90’s I returned to Dublin, where I worked as an independent consultant with IBM on many data intensive projects. On one project I was lead developer in the IBM Dublin Laboratory on the development of the Data Replication tool called “Data Propagator NonRelational”. This tool automatically propagates updates made on IMS databases to DB2 databases. On this project, we successfully piloted using the Cleanroom Development Method, as part of IBM’s derive towards Six Sigma quality.

In the past 15 years I have moved away from IT towards the business. I describe myself as a Hybrid. I believe there is a serious communications gap between business users and IT, and this is a frequent cause of project failures. I seek to bridge that gap. I ensure that requirements are clear, measurable, testable, and capable of being easily understood and signed off by business owners.

One of my favorite programmes was Euro Changeover, This was a hugely data intensive programme. It was the largest changeover undertaken by European Financial Institutions. I worked as an independent consultant with the IBM Euro Centre of Competence. I developed changeover strategies for a number of Irish Enterprises, and was the End to End IT changeover process owner in a major Irish bank. Every application and every data store holding currency sensitive data (not just amounts, but currency signs etc.) had to be converted at exactly the same time to ensure that all systems successfully switched to euro processing on 1st January 2002.

I learnt many, many lasting lessons about data the hard way on Euro Changeover programmes, such as:

* The extent to which seemingly separate applications share operational data – often without the knowledge of the owning application.

* The extent to which business users use (abuse) data fields to hold information never intended for the data field.

* The critical distinction between the underlying data (in a data store) and the information displayed to a business user.

I have worked primarily on what I call “End of food chain” projects and programmes, such as Single View of Customer, data migrations, and data population of repositories for BASEL II and Anti Money Laundering (AML) systems. Business Intelligence is another example of an “End of food chain” project. “End of food-chain” projects share the following characteristics:

* Dependent on existing data

* No control over the quality of existing data they depend on

* No control over the data entry processes by which the data they require is captured.

* The data required may have been captured many years previously.

Recently, I have shared my experience of “Enterprise wide data issues” in a series of posts on my blog, together with a process for assessing the status of those issues within an Enterprise (more details). In my experience, the success of a Business Intelligence programme and the ease with which an Enterprise completes “End of food chain” data dependent programmes directly depends on the status of the common Enterprise Wide data issues I have identified.

Ajay -Describe the educational scene for science graduates in Ireland. What steps do you think governments and universities can do to better teach science and keep young people excited about it?

Ken- I am not in a position to comment on the educational scene for science graduates in Ireland. However, I can say that currently there are insufficient numbers of school children studying science in primary and 2nd level education. There is a need to excite young people about science. There is a need for more interactive science museums, like W5 in Belfast which is hugely successful. Kids love to get involved, and practical science can be great fun.

Ajay- What are some of the key trends in business intelligence that you have seen-

Ken- Since the earliest days of my career, I have seen an ever increasing move towards standards based interoperability of systems, and interchange of data. This has accelerated dramatically in recent years. This is the good news. Further good news is the drive towards the use of external reference databases to verify the accuracy of data, at point of data entry (See blog post on Upstream prevention by Henrik Liliendahl Sørensen). One example of this drive is cloud based verification services from new companies like Ireland based Clavis Technology.

The harsh reality is that “Old hardware goes into museums, while old software goes into production every night”. Enterprises have invested vast amounts of money in legacy applications over decades. These legacy systems access legacy data in legacy data stores. This legacy data will continue to pose challenges in the delivery of Business Intelligence to the Business community that needs it. These challenges will continue to provide opportunities for Data Quality professionals.

Ajay- What is going to be the next fundamental change in this industry in your opinion?

Ken- The financial crisis will result in increased regulatory requirements. This will be good news for the Business Intelligence / Data Quality industry. In time, it will no longer be sufficient to provide the regulator with ‘just’ the information requested. The regulator will want to see the process by which the information was gathered; the process controls, and evidence of the quality the underlying data from which the information was derived. This move will result in funding for Data Governance programmes, which will lead to increased innovation in our industry.

Ajay- Describe your startup Map My Business, your target customer and your vision for it.

Ken- I started MapMyBusiness.com as a “recession buster”. Ireland was hit particularly hard by the financial crisis. I had become over dependent on the financial services industry, and a blanket ban on the use of external consultants left me with no option but to reinvent myself. MapMyBusiness.com helps small businesses to attract clients, by getting them on Google page one. Having been burnt by an over dependence on one industry, my vision is to diversify. I believe that Data Governance is industry independent, and I am focussing on increasing my customer base for my Data Governance consultancy skills, via my company Professional IT Personnel Ltd.

Ajay- What do you do when not working with customers or blogging on your website?

Ken- I try to achieve a reasonable work/life balance. I am married with two children aged 12 and 10, and like to spend time with them, especially outdoors, walking, hiking, playing tennis etc. I am involved in my community, lobbying for improved cycling infrastructure in our area (more details). Ireland, like most countries, is facing an obesity epidemic, due to an increasingly sedentary lifestyle. Too many people get little or no exercise, and don’t have the time, willpower, or perhaps money, to regularly work out in a gym. By including “Active Travel” in our daily lives – by walking or cycling to schools and local amenities, we can get enough physical exercise to prevent obesity, and obesity related health problems. We need to make our cities, towns and villages more pedestrian and cyclist friendly, to encourage “active travel”. My voluntary work in this area introduced me to mapping (see example), and enabled me to set up MapMyBusiness.com.

Biography-

Ken O’Connor is an independent IT Consultant with almost 30 years of work experience. He specialises in Data: Data Migration, Data Population, Data Governance, Data Quality, Data Profiling…His  company is called Professional IT Personnel Ltd.

Ken started his blog (Ken O’ Connor Data Consultant) to share his experience and to learn from the experience of others.   Dylan Jones, editor of dataqualitypro, describe Ken as a “grizzled veteran”, with almost 30 years experience across the full development lifecycle.

Interview Steve Sarsfield Author The Data Governance Imperative

Here is an interview with Steve Sarsfield, data quality evangelist and author of Data Quality Imperative.


Ajay- Describe your early career to the present point. At what point did you decide to specialize or focus on data quality and data governance? What were the causes for it?


Steve- When I was growing up, not many normal people had aspirations of becoming data management professionals. Back in those days, we had aspirations to be NFL wide receivers, writers, and engineers,and lawyers.  Data management careers tend to find you.

My career path has wandered through technical support, technical writer and managing editor, consulting,and product management for Lotus development. I’ve been working for the past nine years at a major data quality vendor – the longest job I’ve had to date. The good news is that this latest gig has given me a chance to meet with a LOT of people who have been implementing data quality and data governance projects.

When you get involved with the projects, you’ll begin to realize the power it has. You begin to love data governance for the efficiencies it brings, and for the impact it will have on your organization as it becomes more competitive.


Ajay- Some people think data quality is a boring job and data governance is an abstract philosophy. How would you interest a young high school /college student, with the right aptitude, in taking a business intelligence career and be focused on it.


Steve- In my opinion if you promote a geeky view of data governance the message will tend to fall flat. If there’s one thing I have written most about, it is about bridging the gap between technology and business.Those who succeed in this field now and in the future will be people who are a bit of a jack-of-all-trades.

You need to be a good technologist, critical thinker, marketer, and strategist, and you need to use those skills every day to succeed. Leadership skills are also important, especially if you are trying to bootstrap a data governance program at your corporation. Those job attributes are not boring, they are challenging and exciting.

In terms of being persuasive about getting involved in a data career, it’s clear that data is not likely to decrease in volume in the coming years, quite the contrary, so your job will have a reasonable amount of security.  Nor will there be less of a need in the future for developing accurate business metrics from the data.

In my book, I talk about the fact that the decision of a corporation to move toward data governance is really a choice between optimism and fear. Your company must decide to either be haunted by a never-ending vision that there will only will be more data, more mergers and more complexity in the years to come, orthey will decide to take charge for a more hopeful future that will bring more opportunity, more efficiency and a more agile working environment. When you choose data governance as a career, you choose to provide that optimism for your employer.


Ajay-What are the salient points in your book Data Governance Imperative. Do you think data governance is an idea whose time has come.


Steve-The book is about the increasing importance of data to a business. As your company collects more and more data about customers, products, suppliers, transactions and billing, it becomes more difficult to accurately maintain that information without a centralized approach and a team devoted to the data management mission.

The book comes from discussions with folks in the business who are trying to get a data governance program started in their corporation.  They are the data champions who “get it”, but are yet to convince their management that data is crucial to the success of the company.

The fact is, there are metrics you can follow, processes that you can put in place, conversations that you can have, and technology that you can implement in order to make your managers and co-workers see the importance of data governance.  We know this because it has worked for so many companies who are far more advanced in managing their data than most.

The most evolved companies will have support from executive management and the entire company to define reusable processes for data governance and a center of excellence is formed around it. Much of the book is about garnering support and setting up the processes to prove enterprise data’s importance.  Only when you do that will your company evolve its data governance strategy.


Ajay- Garbage Data In and Garbage Data Analysis Out. What percentage of a BI installation budget goes to input data quality at data entry center. What is the kind of budget you would like it to be.


Steve- I’m sure this varies depending upon many factors, including the number of sources, age and quality of the source data, etc. Anecdotally, the percentage of budget five years ago was near zero. You really only saw realization of the problem LATE in the project, after the first data warehouse loading occurred. What has happened over the years is that we’ve gotten a lot smarter about this, perhaps as a result of our past failures. In the past, if the data worked well in the source systems it was assumed that it would work in the target.

A lot of those projects failed because the team incorrectly scoped the project with regard to the data integration. Today we have the wisdom and experience to know that this is not true.  In order to really assess our needs for data quality, we know we need to profile the data as one of the first tasks in the process.  This will help us create a more accurate timeline and budget and ensure management that weknow what we’re doing with regard to data integration and business intelligence.


Ajay- Do you think Federal Governments can focus stimulus spending smarter with better input data quality?


Steve- Believe it or not, I’m encouraged by the US Government’s plan on data quality. To varying degrees,Presidents Clinton, Bush and Obama have all supported plans for greater transparency and openness. To accomplish that, you have to govern data. In Washington, many government agencies now have a Chief Information Officer. The government is recruiting leading universities like MIT to work toward better data governance in government.  The sheer number of databases even within a single US government agencywill be a huge challenge, but the direction is good.

This year’s MIT Information Quality Symposium, for example, had a very solid government track with speakers from the Army, Air Force, Department of Defense, EPA, HUD, and National Institute of Health to name just a few.

Other than the US, it gets even cloudier.  There are governments ahead of the US, like UK and Germany, and those who still need to catch up.


Ajay- Name some actual anecdotes in which 1) bad data quality led to disaster 2) good data quality gave great insights


Steve- There are certainly plenty of typical examples I always like the unusual examples, like:

A major motorcycle manufacturer used data quality tools to pull out nicknames from their customer records. Many of the names they had acquired for their prospect list were from motorcycle events and contests where the entries were, shall we say, colorful. The name fields contained data like “John the Mad Dog Smith” or “Frank Motor-head Jones”. The client used the tool to separate the name from the nickname, making it a more valuable marketing list.

One major utility company used data quality tools to identify and record notations on meter-reader records that were important to keep for operational uses, but not in the customer billing record. Upon analysis of the data, the company noticed random text like “LDIY” and “MOR” along with the customer records. After somework with the business users, they figured out that LDIY meant “Large Dog in Yard” which was particularly important for meter readers. MOR meant “Meter in Right, which was also valuable. The readers were given their own notes field, so that they could maintain the integrity of the name and address while also keeping this valuable data. IT probably saved a lot of meter readers from dog bite situations.

Financial organizations have used data quality tools to separate items like “John and Judy Smith/221453789 ITF George Smith”. The organization wanted to consider this type of record as three separate records “John Smith” and “Judy Smith” and “George Smith” with obvious linkage between the individuals. This type of data is actually quite common on mainframe migrations.

A food manufacturer standardizes and cleanses ingredient names to get better control of manufacturing costs. In data from their worldwide manufacturing plants, an ingredient might be “carrots” “chopped frozen carrots” “frozen carrots, chopped” “chopped carrots, frozen” and so on. (Not to mention all the possible abbreviations for the words carrots, chopped and frozen.) Without standardization of these ingredients, there was really no way to tell how many carrots the company purchased worldwide.

There was no bargaining leverage with the carrot supplier, and all the other ingredient suppliers, until the data was fixed.In terms of disasters, I’d recommend the IAIDQ’s web site – IQ Trainwrecks.http://www.iqtrainwrecks.com/ The IAIDQ does a great job and I contribute when I can.


Ajay- What are the essential 5 things a CEO should ask his CTO to ensure good data quality in an enterprise.


Steve- What a great question. I can think of more than five, but let’s start with:


1) What is poor quality data costing us?
This should inspire your CTO to go out and seek problem areas in partnership with the business and ways to improve processes.

2) Do I have to make decisions on gut-feel, or should I trust the business intelligence you give our employees?  What confidence level do you have in our BI?

The CEO should be confident in the metrics delivered with BI and he should make sure the CTO has the same concerns.

3) Are we in compliance with all laws regarding our governance of data?

CEOs are often culpable for non-compliance, so he/she should be concerned about any laws that govern the company’s industry. Even in unregulated industries, organizations must comply with spam laws and “do not mail” laws for marketing.

4) Are you working across business units to work towards data governance, or is data quality done in silos?

When possible data quality should be a process that is reusable and able to be implemented in similar manner across business units.

5) Do you have the access to data you need?

The CEO should understand if any office politics are getting in the way of ensuring data quality and this question opens the door to that discussion.

Ajay- What does Steve Sarsfield do when not writing blogs and books.


Steve-These days, when I’m not thinking about data or my blog, I’m thinking about my fantasy football team and the upcoming season. I’ve got a ticket to the New England Patriots opening game vs the Buffalo Bills and I’m looking forward to it. On the weekends, you may find me playing a game of mafia wars on Facebook or cooking up a big pot of chili for the family.


Biography-


Steve Sarsfield is a Data governance business expert, speaker, author of The Data Governance Initiative ( at http://www.itgovernance.co.uk/products/2446 ) and blogger at http://data-governance.blogspot.com/. Product marketing professional  at a major data quality vendor and author of the book “The Data Governance Imperative”.He was Guest speaker at MIT Information Quality Symposium (July 2007 and July 2008),  at the International Association for Information and Data Quality (IAIDQ) Symposium (December 2006) and at SAP CRM 2006 summit.