Data defined storage (also referred to as a data centric approach) is a marketing term for managing, protecting, and realizing value from data by combining application, information and storage tiers.[1]

This is achieved through a process where users, applications, and devices gain access to a repository of captured metadata that allows them to access, query and manipulate the relevant data to transform it into information, while providing a flexible and scalable platform for storage of the underlying data. The technology abstracts the data entirely from the storage, allowing fully transparent access to users.

Core technology

Data defined storage focuses on metadata with an emphasis on the content, meaning and value of information over the media, type and location of data. Data centric management enables organizations to take a single, unified approach to managing data across large, distributed locations which includes the use of content and metadata indexing. The technology pillars include:

  1. Media Independent Data Storage: Data defined storage removes media centric data storage boundaries within and across solid-state drive, hard disk drive, cloud storage and tape storage platforms, enables linear scale out functionality through a grid based Map Reduce architecture that leverages enterprise object storage technology and provides transparent data access across globally distributed repositories for high volume storage performance.
  2. Data Security & Identity Management: Data defined storage allows organizations to gain end-to-end identity management down to the individual user and device level to address growing enterprise mobility requirements and enhanced data security and information governance.
  3. Distributed metadata repository: Data defined storage enables organizations to virtualize aggregate file systems into a single global namespace. At ingestion; file, full text index and custom metadata is collected and stored in a distributed metadata repository. This repository is leveraged to enable speed and accuracy of search and discovery, and to extract value leading to informed business decisions and analytics.

Data defined storage builds on the benefits of both object storage and software-defined storage technologies, however, object and software-defined storage can only be mapped to media independent data storage, which enables a media agnostic infrastructure - utilizing any type of storage, including low cost commodity storage to scale out to petabyte-level capacities. Data defined storage unifies all data repositories and exposes globally distributed stores through the global namespace, eliminating data silos and improving storage utilization.

Usage

The first marketing using this term was by Tarmin, in its GridBank product. The data defined storage term might have been mentioned in 2013.[2] Tamrin issued press releases about customers.[3][4]

The data defined storage term was used for object storage with open protocol access for file system virtualization, such as CIFS, NFS, FTP as well as REST APIs and other cloud protocols such as Amazon S3, CDMI and OpenStack.

See also

  • Big data analytics – Extremely large or complex datasets
  • Block storage – Sequence of bits or bytes of a maximum predetermined size
  • Cloud storage – Model of computer data storage
  • Content-addressable storage – data storage mechanism
  • Enterprise search – Software for finding documents
  • Information governance – overall strategy for information at an organization which balances the risk that information presents with the value that it provides
  • Metadata management – Managing metadata about content data in digital media
  • Object storage – Computer data storage architecture that manages data as objects
  • Scale out storage – Ability of a system to handle an increasing amount of work
  • Software defined storage – Type of computer data storage

References

  1. Peters, Mark. "Unlocking the Power of Data with Data-Defined Storage" (PDF). ESG. Archived from the original (PDF) on 2014-11-29. Retrieved 30 June 2013.
  2. Goyal, Ambuj. "Edge2013 General Session Keynote Speech". IBM Edge. Archived from the original on 2016-04-13. Retrieved 2016-11-27.
  3. Miller, Dan (12 July 2013). "Tarmin and IBM help Premier Oil manage rapidly growing unstructured data". PR Newswire. Archived from the original on 31 July 2013. Retrieved 1 August 2013.
  4. Miller, Dan (17 December 2012). "Leading U.K. MSP brightsolid sees a shining future with Tarmin". PR Newswire. Archived from the original on 29 November 2014. Retrieved 1 August 2013.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.