Utilizing Client Side De-Duplication
No Thumbnail Available
Date
2015-07
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Addis Ababa University
Abstract
According to a recent survey by Iternational Data Corporation [63], 75% of today’s digital
data are duplicated copies. To reduce the unnecessarily redundant copies, the storage servers
would handle duplication (either at a file level or chunks of data sized 4KB and larger). Deduplication
can be managed both at the server-side and the client-side. In order to identify
duplicated copies, it is required that files be un-encrypted. However users may be worried
about the security of their files and may want their data to be encrypted. However encryption
makes cipher text indistinguishable from theoretically random data, i.e., encrypted data are
always distributed randomly, so identical plaintext encrypted by randomly generated
cryptographic keys will very likely have different cipher texts which cannot be de-duplicated.
In this research, a method that resolves the conflict between de-duplication and encryption
is presented.
Keywords - Cloud Storage, Client-side de-duplication, Proof of Ownership, File Server,
Security, Secure Hash Standard, Advanced Encryption Standard, Windows Communication
Foundation, Windows Presentation Foundation, and Dot Net Framework
Description
Keywords
Cloud Storage, Client-side de-duplication, Proof of Ownership, File Server, Security, Secure Hash Standard, Advanced Encryption Standard, Windows Communication Foundation, Windows Presentation Foundation, Dot Net Framework