{"id":10421,"date":"2016-08-24T09:10:12","date_gmt":"2016-08-24T13:10:12","guid":{"rendered":"http:\/\/www.iri.com\/blog\/?p=10421"},"modified":"2017-11-03T14:45:04","modified_gmt":"2017-11-03T18:45:04","slug":"data-consolidation-wizard-for-data-quality","status":"publish","type":"post","link":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/","title":{"rendered":"Data Consolidation Wizard for Data Quality"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">Given the amount of data businesses garner daily from human interaction, it is easy to understand how their sources become rife with redundant or erroneous entries. For the sake of data quality and efficiency, master data management (MDM), and improved customer experiences, data architects are interested in a single, standard representation of their data.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">If \u201cJohn Smith\u201d is entered into a sales database, for example, and later he is entered again as \u201cJon Smith,\u201d there will be two instances of the same person. As this occurs many times with many different customers, the database gets clogged with redundant data, and processes like profiling, querying, and loading are slowed down. Beyond that, customer service representatives and other stakeholders in the business will not have a reliable view of the customer, misidentifying them, sending multiple catalogs to their home, etc.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In short, <\/span><a href=\"http:\/\/www.iri.com\/blog\/iri\/business\/the-importance-of-a-unified-view-of-the-customer-2\"><span style=\"font-weight: 400;\">data unification<\/span><\/a><span style=\"font-weight: 400;\"> is important to the continued health of your data and your business.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/1-Unified-View-of-Customer.jpeg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10422\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/1-Unified-View-of-Customer.jpeg\" alt=\"Unified View of Customer\" width=\"600\" height=\"345\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/1-Unified-View-of-Customer.jpeg 761w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/1-Unified-View-of-Customer-300x172.jpeg 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><\/p>\n<p style=\"text-align: center;\"><i><span style=\"font-weight: 400;\">Source: Tata Consultancy Services<\/span><\/i><\/p>\n<p><a href=\"http:\/\/www.iri.com\/products\/voracity\"><span style=\"font-weight: 400;\">IRI Voracity<\/span><\/a><span style=\"font-weight: 400;\"> users can access a new Data Consolidation wizard in <\/span><a href=\"http:\/\/www.iri.com\/products\/workbench\"><span style=\"font-weight: 400;\">IRI Workbench<\/span><\/a><span style=\"font-weight: 400;\"> to identify and correct redundant records in files and databases. The wizard accepts the following data formats as inputs: CSV, Delimited, LDIF, ODBC, or XML.<\/span><\/p>\n<h3><b>Source Selection<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">After designating the location of the project on the setup page, the first part of the wizard involves source selection. If your sources are only tables in a connection profile, you can check the appropriate checkbox. <\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/2-Data-Consolidation-Job.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10423\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/2-Data-Consolidation-Job.png\" alt=\"Data Consolidation Job\" width=\"600\" height=\"166\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/2-Data-Consolidation-Job.png 678w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/2-Data-Consolidation-Job-300x83.png 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><span style=\"font-weight: 400;\"><br \/>\nSelecting this box \u00a0and clicking\u00a0<\/span><b>Next <\/b><span style=\"font-weight: 400;\">opens an input page like the one below where you can choose the tables to be included:<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/3-Data-Consolidation-Job-Table-Selection.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10424\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/3-Data-Consolidation-Job-Table-Selection.png\" alt=\"Data Consolidation Job-Table Selection\" width=\"600\" height=\"469\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/3-Data-Consolidation-Job-Table-Selection.png 679w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/3-Data-Consolidation-Job-Table-Selection-300x235.png 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">If the checkbox is not selected, you can add files or ODBC sources in the same input screen. On this type of input page, you will also need to add the metadata for each source. In this example, there is an XML file and an Oracle table added.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/4-Data-Consolidation-Job-Data-Sources.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10425\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/4-Data-Consolidation-Job-Data-Sources.png\" alt=\"Data Consolidation Job-Data Sources\" width=\"600\" height=\"479\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/4-Data-Consolidation-Job-Data-Sources.png 679w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/4-Data-Consolidation-Job-Data-Sources-300x239.png 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Once you have added the desired sources added, click <\/span><b>Next<\/b><span style=\"font-weight: 400;\">.<\/span><\/p>\n<h3><b>Create Output Fields<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">This page allows a choice of which fields will be part of the output. Drag and drop the desired fields from the Input Field List to the Output Field List. For example, to create a new customer table to have master IDs and names, you could select just the ID and Name fields. To use all the fields for a particular source, select the data source name and drag it to the Output Field List.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/5-Data-Consolidation-Job-Create-Output-Fields.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10426\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/5-Data-Consolidation-Job-Create-Output-Fields.png\" alt=\"Data Consolidation Job-Create Output Fields\" width=\"600\" height=\"435\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/5-Data-Consolidation-Job-Create-Output-Fields.png 679w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/5-Data-Consolidation-Job-Create-Output-Fields-300x217.png 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><span style=\"font-weight: 400;\"><br \/>\nYou can also rename the output fields here by clicking\u00a0<\/span><b>Rename<\/b><span style=\"font-weight: 400;\"> or double-clicking the parent field and then clicking another item in the tree to save it. Once all the desired fields are in the Output Field List, click <\/span><b>Next<\/b><span style=\"font-weight: 400;\">.<\/span><\/p>\n<h3><b>Create Grouping Methods<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Here you create and commit grouping methods the wizard will use to determine if your data has redundancy. This page will also display a bar graph displaying the redundancy groups your data contains.<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Displays the name of the field in your output file.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Select the matching type for redundancy comparison.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Activates if a fuzzy comparison option is selected for the matching type and allows you to select the threshold for the fuzzy comparison.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Commit the corresponding method.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Redundancy grouping graph.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-weight: 400;\"><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/6-Data-Consolidation-Job-Create-Grouping-Methods.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10427\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/6-Data-Consolidation-Job-Create-Grouping-Methods.png\" alt=\"Data Consolidation Job-Create Grouping Methods\" width=\"600\" height=\"659\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/6-Data-Consolidation-Job-Create-Grouping-Methods.png 679w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/6-Data-Consolidation-Job-Create-Grouping-Methods-273x300.png 273w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><\/span><\/p>\n<p>Each field created on the previous page will be displayed on this page to allow you to select a grouping method. An example would be:<\/p>\n<ul>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">To check for potential redundant records in customer tables, select an exact match for ID and an approximate match for Name.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">For matching, choose between Exact, Dice, Levenshtein or leave the field blank.<\/span>\n<ul>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">The Exact option checks if both fields are exactly the same.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Dice divides words into character pairs and compares the number of similar pairs in both words. It is faster than Levenshtein but is less accurate in smaller words.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Levenshtein compares two words and counts the shortest distance to edit one word into the other. This is useful for spelling mistakes and common typos, but is the slowest.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Leaving the matching field blank will mean no grouping is performed on the data.<\/span><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Once you set all the options for a method, select <\/span><b>Commit Method<\/b><span style=\"font-weight: 400;\"> and the method will be applied to the data. This process can take time depending on the size of the data source. Once it is complete, the graph will update, showing the different groups and number of conflicts in that group. In this example, there are 111 records with no conflicts and another group where 9 records conflict.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Once you have all the methods created and committed, click <\/span><b>Next<\/b><span style=\"font-weight: 400;\">. See our <a href=\"http:\/\/www.iri.com\/blog\/vldb-operations\/data-quality-and-fuzzy-searching\/\">prior article<\/a> on fuzzy matching algorithms to see their users through data-centric examples.<\/span><\/p>\n<h3><b>Resolve Conflicts<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">This page displays the conflicts in your output, so you can choose the record that is the best representation of the (master) output data.<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Displays all conflicts in your data source.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">All conflicting records in the group.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Field names of the data in the conflict including the data source.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Record selected as the unique record.<\/span><\/li>\n<\/ol>\n<p><span style=\"font-weight: 400;\"><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10428\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png\" alt=\"Data Consolidation Job-Resolve Conflicts\" width=\"600\" height=\"659\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png 679w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts-273x300.png 273w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><\/span><\/p>\n<p>For each item in the Conflicts list, you must choose one or more of the conflicting records to be a unique record:<\/p>\n<ul>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Select an item in the Conflicts list to populate the bottom table with the field names and conflicting records.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">For each record that represents a different data entity, select to mark it as unique. To undo, click the record again and it will be unmarked as unique.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Once you are sure that you selected all of the records that should be unique in that group, select <\/span><b>Commit Unique Records<\/b><span style=\"font-weight: 400;\">. This will save those records and prevent the others from being added to the output.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">If you want to save all the unique records found, you can select <\/span><b>Commit All Unique Records<\/b><span style=\"font-weight: 400;\">. This button is handy when using the Exact method and you want to save all unique occurrences.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">If you want to ignore a set of records, you can click <\/span><b>Ignore Records<\/b><span style=\"font-weight: 400;\"> and that group of records will not be saved. <\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">If you want to ignore all conflicting records, you can click <\/span><b>Ignore All Records<\/b><span style=\"font-weight: 400;\">.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">You must repeat these steps for each item in the Conflicts list (unless you selected either of the All buttons). Once it displays that there are no more conflicts, click <\/span><b>Next<\/b><span style=\"font-weight: 400;\">.<\/span><\/p>\n<h3><b>Target Selection<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Select the type of target for your united data: either a database or flat file. If a flat file is chosen, enter the name and type of file (either CSV or XML).<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/8-Data-Consolidation-Job-Target-Selection.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10429\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/8-Data-Consolidation-Job-Target-Selection.png\" alt=\"Data Consolidation Job-Target Selection\" width=\"600\" height=\"170\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/8-Data-Consolidation-Job-Target-Selection.png 676w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/8-Data-Consolidation-Job-Target-Selection-300x85.png 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><span style=\"font-weight: 400;\"><br \/>\nIf sending the output to a database, enter the applicable information in\u00a0<\/span><b>Connection profile<\/b><span style=\"font-weight: 400;\">, <\/span><b>Schema<\/b><span style=\"font-weight: 400;\">, and <\/span><b>Table name<\/b><span style=\"font-weight: 400;\">. The table must not already exist. Then, click <\/span><b>Target Field Layout<\/b><span style=\"font-weight: 400;\"> to fine-tune the details of the fields in the new table.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/9-Data-Consolidation-Job-Target-Field-Layout.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10430\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/9-Data-Consolidation-Job-Target-Field-Layout.png\" alt=\"Data Consolidation Job-Target Field Layout\" width=\"600\" height=\"262\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/9-Data-Consolidation-Job-Target-Field-Layout.png 679w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/9-Data-Consolidation-Job-Target-Field-Layout-300x131.png 300w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Once you have the details entered, click <\/span><b>Finish<\/b><span style=\"font-weight: 400;\">. Your unified data will then be created from the wizard to your chosen target, ready to use.<\/span><\/p>\n<p><b>Sample Input and Output<\/b><\/p>\n<p><span style=\"font-weight: 400;\">In this screenshot, the two input tables are shown at the top while a flat-file and table version of the output is show at the bottom. All names that were misspelled between the original tables have been reconciled in the wizard so that only one \u201cgolden record\u201d name for each ID is added to the output.<br \/>\n<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/10-Sample-Input-and-Output.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-10431\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2016\/08\/10-Sample-Input-and-Output.png\" alt=\"Sample Input and Output\" width=\"600\" height=\"323\" srcset=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/10-Sample-Input-and-Output.png 1150w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/10-Sample-Input-and-Output-300x162.png 300w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/10-Sample-Input-and-Output-768x414.png 768w, https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/10-Sample-Input-and-Output-1024x552.png 1024w\" sizes=\"(max-width: 600px) 100vw, 600px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Given the amount of data businesses garner daily from human interaction, it is easy to understand how their sources become rife with redundant or erroneous entries. For the sake of data quality and efficiency, master data management (MDM), and improved customer experiences, data architects are interested in a single, standard representation of their data. If<\/p>\n<div><a class=\"btn-filled btn\" href=\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\" title=\"Data Consolidation Wizard for Data Quality\">Read More<\/a><\/div>\n","protected":false},"author":43,"featured_media":10428,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"footnotes":""},"categories":[363,776,232],"tags":[906,611,895,782,902,901,898,907,899,789,850,903,905,851,869,896,904,550],"class_list":["post-10421","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-quality","category-etl","category-master-data-management","tag-correct","tag-csv","tag-data-unification","tag-databases","tag-dice","tag-exact","tag-files","tag-flat-file","tag-fuzzy-comparison","tag-iri-voracity","tag-iri-workbench","tag-levenshtein","tag-locate","tag-master-data-metadata-management","tag-metaphone","tag-redundant-records","tag-scan","tag-xml"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v23.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Data Consolidation Wizard for Data Quality - IRI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data Consolidation Wizard for Data Quality - IRI\" \/>\n<meta property=\"og:description\" content=\"Given the amount of data businesses garner daily from human interaction, it is easy to understand how their sources become rife with redundant or erroneous entries. For the sake of data quality and efficiency, master data management (MDM), and improved customer experiences, data architects are interested in a single, standard representation of their data. IfRead More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\" \/>\n<meta property=\"og:site_name\" content=\"IRI\" \/>\n<meta property=\"article:published_time\" content=\"2016-08-24T13:10:12+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-11-03T18:45:04+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png\" \/>\n\t<meta property=\"og:image:width\" content=\"679\" \/>\n\t<meta property=\"og:image:height\" content=\"746\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Claudia Irvine\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Claudia Irvine\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\"},\"author\":{\"name\":\"Claudia Irvine\",\"@id\":\"https:\/\/beta.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0\"},\"headline\":\"Data Consolidation Wizard for Data Quality\",\"datePublished\":\"2016-08-24T13:10:12+00:00\",\"dateModified\":\"2017-11-03T18:45:04+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\"},\"wordCount\":1181,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png\",\"keywords\":[\"correct\",\"csv\",\"data unification\",\"databases\",\"Dice\",\"Exact\",\"files\",\"flat file\",\"fuzzy comparison\",\"IRI Voracity\",\"IRI Workbench\",\"Levenshtein\",\"locate\",\"MDM\",\"Metaphone\",\"redundant records\",\"scan\",\"xml\"],\"articleSection\":[\"Data Quality (DQ&#041;\",\"ETL\",\"Master Data Management\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\",\"url\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\",\"name\":\"Data Consolidation Wizard for Data Quality - IRI\",\"isPartOf\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png\",\"datePublished\":\"2016-08-24T13:10:12+00:00\",\"dateModified\":\"2017-11-03T18:45:04+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage\",\"url\":\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png\",\"contentUrl\":\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png\",\"width\":679,\"height\":746,\"caption\":\"Data Consolidation Job-Resolve Conflicts\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/beta.iri.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data Consolidation Wizard for Data Quality\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/beta.iri.com\/blog\/#website\",\"url\":\"https:\/\/beta.iri.com\/blog\/\",\"name\":\"IRI\",\"description\":\"Total Data Management Blog\",\"publisher\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/beta.iri.com\/blog\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/beta.iri.com\/blog\/#organization\",\"name\":\"IRI\",\"url\":\"https:\/\/beta.iri.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/beta.iri.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"contentUrl\":\"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"width\":750,\"height\":206,\"caption\":\"IRI\"},\"image\":{\"@id\":\"https:\/\/beta.iri.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/beta.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0\",\"name\":\"Claudia Irvine\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/beta.iri.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g\",\"caption\":\"Claudia Irvine\"},\"url\":\"https:\/\/beta.iri.com\/blog\/author\/claudiai\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data Consolidation Wizard for Data Quality - IRI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/","og_locale":"en_US","og_type":"article","og_title":"Data Consolidation Wizard for Data Quality - IRI","og_description":"Given the amount of data businesses garner daily from human interaction, it is easy to understand how their sources become rife with redundant or erroneous entries. For the sake of data quality and efficiency, master data management (MDM), and improved customer experiences, data architects are interested in a single, standard representation of their data. IfRead More","og_url":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/","og_site_name":"IRI","article_published_time":"2016-08-24T13:10:12+00:00","article_modified_time":"2017-11-03T18:45:04+00:00","og_image":[{"width":679,"height":746,"url":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png","type":"image\/png"}],"author":"Claudia Irvine","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Claudia Irvine","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#article","isPartOf":{"@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/"},"author":{"name":"Claudia Irvine","@id":"https:\/\/beta.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0"},"headline":"Data Consolidation Wizard for Data Quality","datePublished":"2016-08-24T13:10:12+00:00","dateModified":"2017-11-03T18:45:04+00:00","mainEntityOfPage":{"@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/"},"wordCount":1181,"commentCount":0,"publisher":{"@id":"https:\/\/beta.iri.com\/blog\/#organization"},"image":{"@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage"},"thumbnailUrl":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png","keywords":["correct","csv","data unification","databases","Dice","Exact","files","flat file","fuzzy comparison","IRI Voracity","IRI Workbench","Levenshtein","locate","MDM","Metaphone","redundant records","scan","xml"],"articleSection":["Data Quality (DQ&#041;","ETL","Master Data Management"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/","url":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/","name":"Data Consolidation Wizard for Data Quality - IRI","isPartOf":{"@id":"https:\/\/beta.iri.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage"},"image":{"@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage"},"thumbnailUrl":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png","datePublished":"2016-08-24T13:10:12+00:00","dateModified":"2017-11-03T18:45:04+00:00","breadcrumb":{"@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#primaryimage","url":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png","contentUrl":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png","width":679,"height":746,"caption":"Data Consolidation Job-Resolve Conflicts"},{"@type":"BreadcrumbList","@id":"https:\/\/beta.iri.com\/blog\/master-data-metadata-management\/master-data-management\/data-consolidation-wizard-for-data-quality\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/beta.iri.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Data Consolidation Wizard for Data Quality"}]},{"@type":"WebSite","@id":"https:\/\/beta.iri.com\/blog\/#website","url":"https:\/\/beta.iri.com\/blog\/","name":"IRI","description":"Total Data Management Blog","publisher":{"@id":"https:\/\/beta.iri.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/beta.iri.com\/blog\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/beta.iri.com\/blog\/#organization","name":"IRI","url":"https:\/\/beta.iri.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/beta.iri.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","contentUrl":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","width":750,"height":206,"caption":"IRI"},"image":{"@id":"https:\/\/beta.iri.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/beta.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0","name":"Claudia Irvine","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/beta.iri.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g","caption":"Claudia Irvine"},"url":"https:\/\/beta.iri.com\/blog\/author\/claudiai\/"}]}},"jetpack_featured_media_url":"https:\/\/beta.iri.com\/blog\/wp-content\/uploads\/2016\/08\/7-Data-Consolidation-Job-Resolve-Conflicts.png","_links":{"self":[{"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/posts\/10421"}],"collection":[{"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/users\/43"}],"replies":[{"embeddable":true,"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/comments?post=10421"}],"version-history":[{"count":13,"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/posts\/10421\/revisions"}],"predecessor-version":[{"id":11295,"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/posts\/10421\/revisions\/11295"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/media\/10428"}],"wp:attachment":[{"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/media?parent=10421"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/categories?post=10421"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/beta.iri.com\/blog\/wp-json\/wp\/v2\/tags?post=10421"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}