Skip to content
IRI Logo
Solutions Products
  • Solutions
  • Products
  • Blog
  • BI
  • Big Data
  • DQ
  • ETL
  • IRI
    • IRI Business
    • IRI Workbench
  • Mask
  • MDM
    • Master Data Management
    • Metadata Management
  • Migrate
    • Data Migration
    • Sort Migration
  • Test Data
  • Transform
  • VLDB
  • VLOG

Data Class File Masking

  • by Claudia Irvine

The Data Class File Masking Job wizard in IRI Workbench protects large numbers of structured file sources that have been previously classified. While it was possible to use the data classes in many of the existing wizards, if the data class library included a lot of classified fields, selection of the classified sources was cumbersome. Additionally, only one file at a time could be protected. This wizard allows selection of multiple sources. 

While this example will not show all the prerequisite steps, here is an overview:

  1. Set up data classes in preferences.
  2. Create field rules.
  3. Create a data class library (in this example, using the Directory Data Class Search wizard). Make sure that data classes are mapped to fields.
  4. Assign default rules to data classes in the data class library.

In this library, there are three csv sources. There are six data classes that are used in eleven data class mappings. There are different rules assigned to the data classes.

Begin using the wizard by right-clicking the data class library in the Project Explorer and selecting Mask Included File Sources. 

On the setup page, enter the job details. There is an optional summary page; however, it is not recommended to display it if there are a large number of sources as it may take a while to load. In this example, check the box for the summary page to be shown. Select an output type. Same will overwrite the existing file. Different will create a new file in a location selected on the target page. Click Next.

The filter page allows the inclusion of selected data classes only. A warning will be shown if any of the selected data classes do not have a default rule assigned. Click Next.

The source page is populated with the data sources that are referenced in a data class map, even if in different locations. In this case, all sources in the library had a mapping and are listed below. Select the data sources to be protected by this job. Click Next.

The target page is where the target details are entered. Enter the location where the output files will be created. Click OK.

The summary page displays the data class rules and which fields will be using that rule. Click Finish.

After the wizard closes, a Flow Diagram of the job is opened. It displays the components produced for the designed output. It includes three transform mapping blocks representing scripts. The files are contained in the project folder and also include an executable script to run the job.

Below is one of the scripts that is produced by the wizard. This particular source had four classified columns which were transformed using the two different rules. The outline displays a different icon for the four fields that are now protected.

This wizard saves time by creating multiple task scripts, masking many different structured files in the same job.

A Splunk Phantom Playbook for Masking Sensitive Data
Securing FieldShield Passphrases in Azure Key Vault
data class data class masking data masking Eclipse IRI Workbench

Related articles

DarkShield PII Discovery & Masking…
Masking Flat Files in the…
Directory Data Class Search Wizard
Masking PII in a Relational…
IRI Data Class Map
Schema Data Class Search
Training NER Models in IRI…
Masking NoSQL DB PII in…
Masking RDB Data in the…
IRI DarkShield-NoSQL RPC API
Find & Mask File PII…

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Categories

  • Big Data 66
  • Business Intelligence (BI) 77
  • Data Masking/Protection 163
  • Data Quality (DQ) 41
  • Data Transformation 94
  • ETL 122
  • IRI 229
    • IRI Business 86
    • IRI Workbench 162
  • MDM 37
    • Master Data Management 12
    • Metadata Management 25
  • Migration 65
    • Data Migration 60
    • Sort Migration 6
  • Test Data 102
  • VLDB 78
  • VLOG 40

Tracking

© 2025 Innovative Routines International (IRI), Inc., All Rights Reserved | Contact