Category Archives: Data Integration

The Challenge of Removing Duplicate Data

Data duplication is not a problem that is specific to any particular industry.  Data often gets duplicated for a variety of reasons: manual entry errors, redundant entries in different source systems, missing data constraints, etc.  However, before data can be successfully de-duplicated, you must first define the criteria which identify which 2 records are considered [...]

Also posted in Application, Business Rules, Data Lifecycle Management, Data Migration, Data Migration, Data Processing, Data Transformation, Data Warehousing, Data Warehousing, Ease-of-use, expressor Studio, Open Source ETL, Traditional ETL | 1 Comment
 

Laboratory Data Deluge

Having just attended the LabVantage Customer Training and Educational Conference (CTEC) in Charleston, SC, I have far more respect and appreciation for professionals whose job it is to manage the fast growing flood of data generated by laboratory clinicians, technicians, devices, systems and notebooks. The laboratory is hallowed ground. These folks are not trying to [...]

Also posted in expressor, Partners | Tagged , | 1 Comment
 

expressor / Teradata fast load benchmark results

With expressor 2.4, we recently introduced support for Teradata® Parallel Transporter (TPT) utilities and performed a brief benchmark to demonstrate the high performance characteristics of expressor’s new Teradata PT Load interface. Using just a single disk and a direct network connection to a Teradata 2580 appliance, our product was able to demonstrate load speeds in [...]

Also posted in Connectivity, Data Processing, expressor, Partners | Tagged , , , | Leave a comment
 

why data integration is hard

I just read Robin Bloor’s “10 reasons why data integration is hard” article referred to in a recent Lorraine Lawson blog and there isn’t anything he says I fundamentally disagree with.  Robin highlights various application and technology reasons ranging from “there’s no metadata warehouse” to “we never standardized on a single database product”, which have [...]

Also posted in Competition, expressor, Semantic Integration, Traditional ETL | 1 Comment