logo

Research and Practical Experiences in the Use of Multiple Data Sources

Abstract

Executive Summary

Integrating data sources within one organization: benefits, barriers, and lessons

Integrating data sources across multiple organizations: benefits, barriers, and lessons

Introduction

The Research Issues: Using Integrated Multiple Data Sources

Integration Efforts Involving Different Data Sources within One Organization

Integration Efforts Involving Different Data Sources across Multiple Organizations

A Summary of Integrating Information from Diverse Data Sets

References

A Summary of Integrating Information from Diverse Data Sets


(1) Efforts involving organization-wide data integration: benefits, barriers, and lessons


Benefits
Organization-wide data integration tends to lead to the following benefits in the context of enterprise-level planning and decision making:


Data integration is necessary for data to serve as a common language for communication within an organization. Without data integration there will be increased processing costs and ambiguity between sub-units or divisions. Without data integration, there will be delays and decreased levels of communication, reductions in the amount of summarization, and greater distortion of meaning (Huber, 1982). Data integration facilitates the collection, comparison, and aggregation of data from various parts of an organization, leading to better understanding (Goodhue, et al., 1992), and improved enterprise-level planning and decision making when there are complex, interdependent problems.

Barriers
Data integration can have a positive impact on reducing costs by reducing redundant design efforts (Goodhue, et al., 1992). However, because multiple sub-units or divisions are involved, data integration can also increase costs by increasing the size and complexity of the design problem or increasing the difficulty in getting agreement from all concerned parties. These barriers were summarized by Goodhue et al. (1992) as:


Organization-wide data integration may result in a loss of local autonomy in the design and use of data. In addition, it may also involve a loss of local effectiveness. Over time, different sub-units may face different task complexity and environmental challenges of unanticipated local events (Sheth and Larson, 1990).

Lessons
The following lessons have been derived from the cases of organization-wide data integration (Goodhue, et al., 1992; Sheth and larson, 1990; Robertson, 1997):


(2) Efforts involving data integration across multiple organizations: benefits, barriers, and lessons


Benefits
Like the efforts involved in single organization, the use of multiple data sources provide improved communications and coordination across different organizations, both in the same and different sectors of the economy. In addition, these efforts involved in the cases also tend to lead to the following benefits (Clark County Recorder's Office, 1998; QMAS Report, 1997; SEI's MassCHIP Program):


Barriers
While there are clearly advantages, using an integrated approach across multiple organizations also presents a number of challenges. For example, obtaining data from other agencies is often difficult, and in many cases will be impossible. Culhane and Metraux (1998) summarized these limitations as:


Other barriers are also identified from the cases discussed above (QMAS Report, 1997):


Lessons
Based on the experiences in the cases where organizations are either in the same or multiple sectors of the economy, the following important lessons regarding the implementation of a comprehensive data integration project are identified (QMAS Report, 1997):


In addition, there are some important questions regarding the use of multiple data sources from external organizations (QMAS Report, 1997):


All these questions should be carefully addressed in the data integration across multiple organizations.

It is worth noting that several cases in this paper are in health care field, it probably indicates that health care is a leader in data integration efforts. It also seems to us that organization-wide data integration is done for operational reasons, while data integration across multiple organizations (at least in the cases) is done for research and evaluation purposes.