Data Management Glossary nnn
-
A
- Active Storage
- Adaptive Data Management
- AI Agents
- AI and Corporate Data
- AI Compute
- AI Data Extraction
- AI Data Governance
- AI Data Ingestion
- AI Data Leakage
- AI Data Management
- AI Data Pipelines
- AI Data Preparation
- AI Data Workflows
- AI Inferencing
- AI Infrastructure
- Air Gap
- Alternate Data Streams (ADS)
- Amazon (AWS) S3 Intelligent Tiering
- Amazon FSx
- Amazon Glacier (AWS Glacier)
- Amazon S3 (AWS S3)
- Amazon S3 Glacier Instant Retrieval
- Amazon Tiering
- Analytics-driven Data Management
- Application Programming Interface (API)
- Archival Storage
- Archiving
- Artificial Intelligence (AI)
- AWS DataSync
- AWS Lambda
- AWS Snowball
- AWS Storage
- Azure Data Box
- Azure NetApp Files
- Azure Storage
- Azure Tiering
-
C
- Capacity Planning
- Carbon footprint
- Carbon Usage Effectiveness
- Chain of Custody
- Chargeback
- Checksum
- Cloud Archiving
- Cloud Cost Optimization
- Cloud Costs
- Cloud Data Analytics
- Cloud Data Growth Analytics
- Cloud Data Management
- Cloud Data Migration
- Cloud Data Storage
- Cloud File Storage
- Cloud Migration
- Cloud NAS
- Cloud Object Storage
- Cloud Storage Gateway
- Cloud Tiering
- CloudPools
- Cold Data
- Common Internet File System (CIFS)
- Compression
-
D
- Dark Data
- Data Analytics
- Data Archiving
- Data Backup
- Data Center Consolidation
- Data Center Emissions
- Data Classification
- Data Curation
- Data Governance
- Data Hoarding
- Data Indexing
- Data Lake
- Data Lakehouse
- Data Lifecycle Management
- Data Lineage
- Data Literacy
- Data Management
- Data Management for AI
- Data Management Policy
- Data Migration
- Data Migration Chain of Custody
- Data Migration Plan
- Data Migration Software
- Data Migration Warm Cutover
- Data Mobilization
- Data Orchestration
- Data Protection
- Data Retention
- Data Retrieval
- Data Services
- Data Silos
- Data Sprawl
- Data Storage
- Data Storage Costs
- Data Storage Management Services (DSMS)
- Data Storage Optimization
- Data Storage Tags
- Data Tagging
- Data Tiering
- Data Transfer
- Data Virtualization
- Deduplication
- Deep Analytics
- Dell PowerScale
- Dell PowerScale SmartPools
- Department Showback
- Digital Business
- Digital Pathology Data Management
- Direct Data Access
- Director (Komprise Director)
- Disaster Recovery
- Dynamic Data Analytics
- Dynamic Links
-
E
-
F
-
G
-
H
-
I
-
K
-
M
-
N
-
O
-
P
-
R
-
S
- S3
- S3 Data Migration
- S3 Intelligent Tiering
- Scale-Out Grid
- Scale-Out Storage
- Secondary Storage
- Sensitive Data Detection
- Shadow AI
- Shadow IT
- Sharding
- Shared-Nothing Architecture
- Showback
- Smart Data Workflows
- SmartPools
- SMB Data Migration
- SMB protocol (Server Message Block)
- Solid State Drives (SSDs)
- Storage Area Network (SAN)
- Storage Array
- Storage as a Service
- Storage as a Service (STaaS)
- Storage Assessment
- Storage Costs
- Storage Efficiency
- Storage Insights
- Storage Metrics
- Storage Pool
- Storage Reclamation
- Storage Refresh
- Storage Tiering
- Stubs
- Sustainable Data Management
- Symbolic Link
- System Metadata
-
U
- Unstructured Data
- Unstructured Data AI
- Unstructured Data Analytics
- Unstructured Data Classification
- Unstructured Data Governance
- Unstructured Data Management
- Unstructured Data Migration
- Unstructured Data Preparation
- Unstructured Data Storage
- Unstructured Data Tiering
- Unstructured Data Workflows
- Unstructured Metadata
Data Management for AI
Data Management for AI (artificial intelligence) is the process of gathering and storing data in a way that can be used by AI and machine learning models to generate insights, make predictions and drive research and innovation initiatives. AI models require significant amounts of data to train and improve their accuracy, most of which is unstructured data. However, this data is not simple rows and columns. It is files, objects, semi-structured and structured data, all of which can be messy and difficult to manage.
In late 2022, Komprise cofounder and CEO Kumar Goswami noted:
“Enterprises need to be ready for this wave of change and it starts by getting unstructured data prepped, as this data is the critical ingredient for AI/ML.”
He published this post in early 2023: The AI/ML Revolution: Data Management Needs to Evolve, making the following recommendations:
- Get full visibility so you can optimize and leverage your data
- If you aren’t indexing your data today, that’s a problem
- Make new uses of data while still being cost-efficient
- Collaborate with departments on data needs
SPOG: Data Management Requirements for AI
With so much discussion about ChatGPT, generative AI, AI regulations and the opportunities and threats posed by rapid AI innovation, Komprise cofounder and COO Krishna Subramanian tied the discussion back to data management for AI summarizing the need for strategies and policies focused on data security, data privacy, data ownership, data lineage and data governance.
Read:
- SPLOG: The Data Management Issues with Generative AI
- Komprise’s Krishna Subramanian on Generative AI and Data Management

Related Terms
Getting Started with Komprise:
- Learn about Intelligent Data Management
- Schedule a demonstration with our team
- Read the latest State of Unstructured Data Management Report