Data Management Glossary nnn
-
A
- Active Storage
- Adaptive Data Management
- AI Agents
- AI and Corporate Data
- AI Compute
- AI Data Extraction
- AI Data Governance
- AI Data Ingestion
- AI Data Leakage
- AI Data Management
- AI Data Pipelines
- AI Data Preparation
- AI Data Workflows
- AI Inferencing
- AI Infrastructure
- Air Gap
- Alternate Data Streams (ADS)
- Amazon (AWS) S3 Intelligent Tiering
- Amazon FSx
- Amazon Glacier (AWS Glacier)
- Amazon S3 (AWS S3)
- Amazon S3 Glacier Instant Retrieval
- Amazon Tiering
- Analytics-driven Data Management
- Application Programming Interface (API)
- Archival Storage
- Archiving
- Artificial Intelligence (AI)
- AWS DataSync
- AWS Lambda
- AWS Snowball
- AWS Storage
- Azure Data Box
- Azure NetApp Files
- Azure Storage
- Azure Tiering
-
C
- Capacity Planning
- Carbon footprint
- Carbon Usage Effectiveness
- Chain of Custody
- Chargeback
- Checksum
- Cloud Archiving
- Cloud Cost Optimization
- Cloud Costs
- Cloud Data Analytics
- Cloud Data Growth Analytics
- Cloud Data Management
- Cloud Data Migration
- Cloud Data Storage
- Cloud File Storage
- Cloud Migration
- Cloud NAS
- Cloud Object Storage
- Cloud Storage Gateway
- Cloud Tiering
- CloudPools
- Cold Data
- Common Internet File System (CIFS)
- Compression
-
D
- Dark Data
- Data Analytics
- Data Archiving
- Data Backup
- Data Center Consolidation
- Data Center Emissions
- Data Classification
- Data Curation
- Data Governance
- Data Hoarding
- Data Indexing
- Data Lake
- Data Lakehouse
- Data Lifecycle Management
- Data Lineage
- Data Literacy
- Data Management
- Data Management for AI
- Data Management Policy
- Data Migration
- Data Migration Chain of Custody
- Data Migration Plan
- Data Migration Software
- Data Migration Warm Cutover
- Data Mobilization
- Data Orchestration
- Data Protection
- Data Retention
- Data Retrieval
- Data Services
- Data Silos
- Data Sprawl
- Data Storage
- Data Storage Costs
- Data Storage Management Services (DSMS)
- Data Storage Optimization
- Data Storage Tags
- Data Tagging
- Data Tiering
- Data Transfer
- Data Virtualization
- Deduplication
- Deep Analytics
- Dell PowerScale
- Dell PowerScale SmartPools
- Department Showback
- Digital Business
- Digital Pathology Data Management
- Direct Data Access
- Director (Komprise Director)
- Disaster Recovery
- Dynamic Data Analytics
- Dynamic Links
-
E
-
F
-
G
-
H
-
I
-
K
-
M
-
N
-
O
-
P
-
R
-
S
- S3
- S3 Data Migration
- S3 Intelligent Tiering
- Scale-Out Grid
- Scale-Out Storage
- Secondary Storage
- Sensitive Data Detection
- Shadow AI
- Shadow IT
- Sharding
- Shared-Nothing Architecture
- Showback
- Smart Data Workflows
- SmartPools
- SMB Data Migration
- SMB protocol (Server Message Block)
- Solid State Drives (SSDs)
- Storage Area Network (SAN)
- Storage Array
- Storage as a Service
- Storage as a Service (STaaS)
- Storage Assessment
- Storage Costs
- Storage Efficiency
- Storage Insights
- Storage Metrics
- Storage Pool
- Storage Reclamation
- Storage Refresh
- Storage Tiering
- Stubs
- Sustainable Data Management
- Symbolic Link
- System Metadata
-
U
- Unstructured Data
- Unstructured Data AI
- Unstructured Data Analytics
- Unstructured Data Classification
- Unstructured Data Governance
- Unstructured Data Management
- Unstructured Data Migration
- Unstructured Data Preparation
- Unstructured Data Storage
- Unstructured Data Tiering
- Unstructured Data Workflows
- Unstructured Metadata
Regex (Regular Expressions)
What is Regex?
Regex, short for regular expressions, is a sequence of characters that defines a search pattern. It is commonly used in text processing to match, locate, and manage specific strings or data formats (e.g., email addresses, ID numbers, dates). For an overview of regex history and examples visit the Wikipedia definition here.
Why Does Regex Matter for Sensitive Data Detection?
Regex is essential for detecting sensitive or custom data formats because it goes beyond simple keyword matching. With regex, organizations can:
- Identify standard PII such as credit card numbers, social security numbers, and email addresses.
- Detect custom identifiers unique to an organization, such as employee IDs, patient record numbers, machine IDs, or project codes.
- Reduce the risk of data leakage and breaches by automating detection across massive volumes of unstructured data.
What is the Komprise Solution for Regex Keyword Search?
Komprise delivers built-in regex keyword search within its Smart Data Workflows automation. This enables customers to:
- Scan for both predefined PII types and custom data formats defined by regex patterns.
- Classify and tag sensitive information across unstructured data sets. (See Unstructured Data Classification)
- Automate governance workflows to mitigate risk, enforce compliance, and prepare AI data pipelines without exposing protected data.
Related Terms
Getting Started with Komprise:
- Learn about Intelligent Data Management
- Schedule a demonstration with our team
- Read the latest State of Unstructured Data Management Report
