Skip to content
IRI Logo
Solutions Products
  • Solutions
  • Products
  • Blog
  • BI
  • Big Data
  • DQ
  • ETL
  • IRI
    • IRI Business
    • IRI Workbench
  • Mask
  • MDM
    • Master Data Management
    • Metadata Management
  • Migrate
    • Data Migration
    • Sort Migration
  • Test Data
  • Transform
  • VLDB
  • VLOG

Metadata Management: Lineage Tracking in Git (Part 3 of…

  • by Allison Aguirre

This article is third in a 4-part series on managing metadata assets in IRI Workbench using Git. It focuses on its value in tracking metadata lineage. Other articles in the series cover the use of Git as a metadata asset hub, for version control, and for metadata security.

In the previous article, I demonstrated the submission of IRI project metadata assets through Git in IRI Workbench, built on Eclipse™. This article explains the importance of metadata lineage analysis, and how to view the history of a project stored in a Git repository.

Metadata lineage shows us the origin of our data and how it has been transformed during processing in order to reach its current state. Understanding and retaining the knowledge of data’s origins, changes, and destinations, are fundamental to data governance. It also improves confidence in the metadata itself, the data and processes the metadata defines, and the decisions we make from the end results of the processing.

Along with documenting the flow of data, metadata lineage can help us manage and secure our data more appropriately, since lineage also stores information on the location, time, and data managers themselves.  

Using Git, lineage can be presented conveniently in the context of metadata version control. The history of each version of your asset can be viewed and analyzed, with a log of the changes and who made them. To access this view, right-click on your project and go to Team > Show in History.

612_1

With this selected, the project history appears, and details each commit made over time. “Head” means that this is the current commit where your repository is pointing; and “master” signifies that this is the main branch of your project.

Screen Shot 2014-06-25 at 2.50.42 PM

At any time, you can compare the current and past state of a file. To do so, right-click on the file you wish to analyze and select Compare With > Local History; then select the revision for comparison. This will indicate the exact changes between the two versions. Following is an example of a field (column) name change in a SortCL job script managed by an IRI CoSort user:

Screen Shot 2014-06-25 at 2.55.37 PM

Changes to metadata and other assets are easy to see in this view, as is who made the change (and when). Having those changes in the repository will allow others to deliberate over the implications. Searching (see below) for all instances of the string across your projects will expose their impact beyond the immediate application.

Beyond the metadata lineage information that Git provides, IRI Workbench users can use the extensive file and text search features of Eclipse to analyze lineage, along with the impact of changes (metadata impact analysis). Search for metadata within or across projects; for example, column names that appear in one or more data definition files (.DDF repositories), or transformation rules (like conditions) in IRI job scripts.

Search window results allow you to not only see occurrences at a glance, but drill down to open each asset where the item occurs (e.g., when a given field name is specified). This gives you and your team better insight (and foresight) into the impact changes to that metadata asset before (and after) they are made.

Screen Shot 2014-06-25 at 3.46.29 PM

If you need more information on the IRI metadata management using Git series, you can visit part two of the series – Version Control. You can also read more here about using Git for Asset Security.

Metadata Management: Version Control in Git (Part 2 of 4)
Metadata Management: Asset Security in Git (Part 4 of 4)
analyze metadata lineage data lineage EGit Git lineage tracking metadata asset metadata management metadata repository version control

Related articles

IRI Data Class Map
Schema Data Class Search
Masking RDB Data in the…
Find & Mask File PII…
Importing Data Classes into the…
Data Class & Rule Library…
Connecting MariaDB and MySQL to…
Sharing IRI Data Management Jobs…
Running IRI Software in a…
The IRI Platform
Pseudonym Hash Set (File) Creation…

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Categories

  • Big Data 66
  • Business Intelligence (BI) 77
  • Data Masking/Protection 163
  • Data Quality (DQ) 41
  • Data Transformation 94
  • ETL 122
  • IRI 229
    • IRI Business 86
    • IRI Workbench 162
  • MDM 37
    • Master Data Management 12
    • Metadata Management 25
  • Migration 65
    • Data Migration 60
    • Sort Migration 6
  • Test Data 102
  • VLDB 78
  • VLOG 40

Tracking

© 2025 Innovative Routines International (IRI), Inc., All Rights Reserved | Contact