ZFS Data Integrity and Authentication

ZFS provides end-to-end data integrity by storing checksums for each data block in the parent block's pointer rather than with the data block itself. This provides fault isolation between the data and checksum. When data and checksum disagree, the checksum can be trusted because it is validated by the parent block. ZFS uses these checksums to detect and correct silent data corruption by determining the correct copy of data when disks return bad data, and repairing damaged copies. The blocks in a ZFS storage pool form a Merkle tree where each block validates its children cryptographically, providing authentication for the entire storage pool.

Uploaded by

Deepak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views3 pages

ZFS Data Integrity and Authentication

Uploaded by

Deepak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Friday Dec 09, 2005

ZFS End-to-End Data Integrity

The job of any filesystem boils down to this: when asked to read a block, it should return the same data that was previously written to that block. If it can't do that -- because the disk is offline or the data has been damaged or tampered with -- it should detect this and return an error. Incredibly, most filesystems fail this test. They depend on the underlying hardware to detect and report errors. If a disk simply returns bad data, the average filesystem won't even detect it. Even if we could assume that all disks were perfect, the data would still be vulnerable to damage in transit: controller bugs, DMA parity errors, and so on. All you'd really know is that the data was intact when it left the platter. If you think of your data as a package, this would be like UPS saying, "We guarantee that your package wasn't damaged when we picked it up." Not quite the guarantee you were looking for. In-flight damage is not a mere academic concern: even something as mundane as a bad power supply can cause silent data corruption. Arbitrarily expensive storage arrays can't solve the problem. The I/O path remains just as vulnerable, but becomes even longer: after leaving the platter, the data has to survive whatever hardware and firmware bugs the array has to offer. And if you're on a SAN, you're using a network designed by disk firmware writers. God help you. What to do? One option is to store a checksum with every disk block. Most modern disk drives can be formatted with sectors that are slightly larger than the usual 512 bytes -- typically 520 or 528. These extra bytes can be used to hold a block checksum. But making good use of this checksum is harder than it sounds: the effectiveness of a checksum depends tremendously on where it's stored and when it's evaluated. In many storage arrays (see the Dell|EMC PowerVault paper for a typical example with an excellent description of the issues), the data is compared to its checksum inside the array. Unfortunately this doesn't help much. It doesn't detect common firmware bugs such as phantom writes (the previous write never made it to disk) because the data and checksum are stored as a unit -- so they're self-consistent even when the disk returns stale data. And

the rest of the I/O path from the array to the host remains unprotected. In short, this type of block checksum provides a good way to ensure that an array product is not any less reliable than the disks it contains, but that's about all. NetApp's block-appended checksum approach appears similar but is in fact much stronger. Like many arrays, NetApp formats its drives with 520-byte sectors. It then groups them into 8-sector blocks: 4K of data (the WAFL filesystem blocksize) and 64 bytes of checksum. When WAFL reads a block it compares the checksum to the data just like an array would, but there's a key difference: it does this comparison after the data has made it through the I/O path, so it validates that the block made the journey from platter to memory without damage in transit. This is a major improvement, but it's still not enough. A block-level checksum only proves that a block is self-consistent; it doesn't prove that it's the right block. Reprising our UPS analogy, "We guarantee that the package you received is not damaged. We do not guarantee that it's your package." The fundamental problem with all of these schemes is that they don't provide fault isolation between the data and the checksum that protects it. ZFS Data Authentication End-to-end data integrity requires that each data block be verified against an independent checksum, after the data has arrived in the host's memory. It's not enough to know that each block is merely consistent with itself, or that it was correct at some earlier point in the I/O path. Our goal is to detect every possible form of damage, including human mistakes like swapping on a filesystem disk or mistyping the arguments to dd(1). (Have you ever typed "of=" when you meant "if="?) A ZFS storage pool is really just a tree of blocks. ZFS provides fault isolation between data and checksum by storing the checksum of each block in its parent block pointer -- not in the block itself. Every block in the tree contains the checksums for all its children, so the entire pool is selfvalidating. [The uberblock (the root of the tree) is a special case because it has no parent; more on how we handle that in another post.] When the data and checksum disagree, ZFS knows that the checksum can be trusted because the checksum itself is part of some other block that's one level higher in the tree, and that block has already been validated.

ZFS uses its end-to-end checksums to detect and correct silent data corruption. If a disk returns bad data transiently, ZFS will detect it and retry the read. If the disk is part of a mirror or RAID-Z group, ZFS will both detect and correct the error: it will use the checksum to determine which copy is correct, provide good data to the application, and repair the damaged copy. As always, note that ZFS end-to-end data integrity doesn't require any special hardware. You don't need pricey disks or arrays, you don't need to reformat drives with 520-byte sectors, and you don't have to modify applications to benefit from it. It's entirely automatic, and it works with cheap disks. But wait, there's more! The blocks of a ZFS storage pool form a Merkle tree in which each block validates all of its children. Merkle trees have been proven to provide cryptographically-strong authentication for any component of the tree, and for the tree as a whole. ZFS employs 256-bit checksums for every block, and offers checksum functions ranging from the simple-and-fast fletcher2 (the default) to the slower-but-secure SHA-256. When using a cryptographic hash like SHA-256, the uberblock checksum provides a constantly up-todate digital signature for the entire storage pool. Which comes in handy if you ask UPS to move it.

White Paper Nexentastor Zfs Copy-On-write Checksums and Consistency
No ratings yet
White Paper Nexentastor Zfs Copy-On-write Checksums and Consistency
5 pages
ZFS: Advanced File System Overview
No ratings yet
ZFS: Advanced File System Overview
22 pages
ZFS: Advanced File System Overview
No ratings yet
ZFS: Advanced File System Overview
44 pages
Intro to ZFS for FreeNAS Users
No ratings yet
Intro to ZFS for FreeNAS Users
13 pages
ZFS: The Last Word in File Systems
No ratings yet
ZFS: The Last Word in File Systems
29 pages
Introduction To ZFS R1a
No ratings yet
Introduction To ZFS R1a
12 pages
2 Zfs Internals
No ratings yet
2 Zfs Internals
29 pages
Openzfs Basics: George Wilson Matt Ahrens
100% (1)
Openzfs Basics: George Wilson Matt Ahrens
39 pages
Zfs Aaron Toponce
No ratings yet
Zfs Aaron Toponce
25 pages
X: A Lightweight, General System For Finding Serious Storage System Errors
No ratings yet
X: A Lightweight, General System For Finding Serious Storage System Errors
16 pages
Yang Explode Osdi
No ratings yet
Yang Explode Osdi
16 pages
ZFS: Advanced File System Features
No ratings yet
ZFS: Advanced File System Features
34 pages
FreeBSD Mastery - ZFS (IT Mastery Book 7) - Michael W Lucas & Allan Jude
No ratings yet
FreeBSD Mastery - ZFS (IT Mastery Book 7) - Michael W Lucas & Allan Jude
188 pages
ZFS: Advanced File System Features
No ratings yet
ZFS: Advanced File System Features
33 pages
Zfs Internals Uli Graef
No ratings yet
Zfs Internals Uli Graef
32 pages
A Crash Course On Some Recent Bug Finding Tricks
No ratings yet
A Crash Course On Some Recent Bug Finding Tricks
70 pages
Bugs05 Explode
100% (2)
Bugs05 Explode
5 pages
FreeBSD Mastery ZFS - Michael W Lucas
No ratings yet
FreeBSD Mastery ZFS - Michael W Lucas
210 pages
Choosing the Right Checksum Algorithm
No ratings yet
Choosing the Right Checksum Algorithm
9 pages
Ensuring Data Integrity in Storage: Techniques and Applications
No ratings yet
Ensuring Data Integrity in Storage: Techniques and Applications
11 pages
Yang Fisc Tocs
No ratings yet
Yang Fisc Tocs
31 pages
TR 3603
No ratings yet
TR 3603
11 pages
Mod3-Wk3 CSG2132 Module 3 Data Storage Strategies and RAID 2020
No ratings yet
Mod3-Wk3 CSG2132 Module 3 Data Storage Strategies and RAID 2020
33 pages
Yang Fisc Osdi
No ratings yet
Yang Fisc Osdi
15 pages
OS CO4 S5 DataIntegrity DistributedSystems
No ratings yet
OS CO4 S5 DataIntegrity DistributedSystems
33 pages
Os Test 3 Key
No ratings yet
Os Test 3 Key
12 pages
ZFS Command Cheat Sheet
No ratings yet
ZFS Command Cheat Sheet
3 pages
DSTN Session 4
No ratings yet
DSTN Session 4
45 pages
Checksum
No ratings yet
Checksum
4 pages
Addressing Silent Data Corruption
No ratings yet
Addressing Silent Data Corruption
8 pages
5 FileSystems
No ratings yet
5 FileSystems
33 pages
Systems Engineering at HPCRD
No ratings yet
Systems Engineering at HPCRD
28 pages
14 Raid
No ratings yet
14 Raid
21 pages
Zfs Dedup
No ratings yet
Zfs Dedup
13 pages
FSCK Paper For AIX
No ratings yet
FSCK Paper For AIX
21 pages
Check Sum
No ratings yet
Check Sum
4 pages
11 Errors
No ratings yet
11 Errors
33 pages
NTFS Forensics:: Jason Medeiros
100% (1)
NTFS Forensics:: Jason Medeiros
27 pages
ZFS Cheat Sheet
No ratings yet
ZFS Cheat Sheet
22 pages
The Little Filesystem Technical Specification
No ratings yet
The Little Filesystem Technical Specification
7 pages
Fault Tolerance Cause Past Q and Ans
No ratings yet
Fault Tolerance Cause Past Q and Ans
14 pages
Zettabyte File System Overview and Benefits
100% (1)
Zettabyte File System Overview and Benefits
24 pages
ZFS Overview and iSCSI on FreeBSD
No ratings yet
ZFS Overview and iSCSI on FreeBSD
20 pages
UNIX/Linux File System Overview
No ratings yet
UNIX/Linux File System Overview
9 pages
L20 FS Reliability
No ratings yet
L20 FS Reliability
18 pages
Zfs Replication: With: ZFS Send ZFS Receive
No ratings yet
Zfs Replication: With: ZFS Send ZFS Receive
19 pages
Scale15x-2017-Postgresql Zfs Best Practices
No ratings yet
Scale15x-2017-Postgresql Zfs Best Practices
110 pages
Incremental Disk Imaging Techniques
No ratings yet
Incremental Disk Imaging Techniques
4 pages
ext3 Journaling and File System Transactions
No ratings yet
ext3 Journaling and File System Transactions
22 pages
ZFSNinja Slides PDF
No ratings yet
ZFSNinja Slides PDF
68 pages
Becoming A ZFS Ninja
No ratings yet
Becoming A ZFS Ninja
68 pages
Automating Checksums in Embedded Systems
No ratings yet
Automating Checksums in Embedded Systems
8 pages
Solaris Dynamic File System: Sun Microsystems, Inc
No ratings yet
Solaris Dynamic File System: Sun Microsystems, Inc
26 pages
Zfs A4
No ratings yet
Zfs A4
26 pages
FSCK
No ratings yet
FSCK
3 pages
Storage Systems: Disk, RAID, Dependability
No ratings yet
Storage Systems: Disk, RAID, Dependability
79 pages
Vocabulary & Grammar Test Unit 9 Test A
100% (5)
Vocabulary & Grammar Test Unit 9 Test A
5 pages
801-825 Differentiation Formulae (For Latex)
No ratings yet
801-825 Differentiation Formulae (For Latex)
14 pages
Akta Satelit On Astra 4A at 4
No ratings yet
Akta Satelit On Astra 4A at 4
6 pages
Surgery Osce 199
No ratings yet
Surgery Osce 199
59 pages
EDAN M3A Brochure
No ratings yet
EDAN M3A Brochure
2 pages
Free Range Farming Manual
No ratings yet
Free Range Farming Manual
55 pages
CA Intermediate FM SM Exam
No ratings yet
CA Intermediate FM SM Exam
6 pages
Fin Ijprems1727785021
No ratings yet
Fin Ijprems1727785021
5 pages
Stacks and Subroutines in 8085
No ratings yet
Stacks and Subroutines in 8085
25 pages
EC English Guide for Class 9 Students
No ratings yet
EC English Guide for Class 9 Students
40 pages
Project Proposal For Land For Submission
No ratings yet
Project Proposal For Land For Submission
15 pages
Beauty Influencer: Do Generation Z Women Consumers Trust Them?
No ratings yet
Beauty Influencer: Do Generation Z Women Consumers Trust Them?
74 pages
Cimic
No ratings yet
Cimic
255 pages
Fuel Systems: Multi-Port and Throttle Body Fuel Injection
No ratings yet
Fuel Systems: Multi-Port and Throttle Body Fuel Injection
194 pages
Foundation Engineering Course
No ratings yet
Foundation Engineering Course
70 pages
Modular Kitchen Design Details
No ratings yet
Modular Kitchen Design Details
25 pages
Recipe Evaluation Rubric Template
No ratings yet
Recipe Evaluation Rubric Template
1 page
Credit Card Fraud Detection1
No ratings yet
Credit Card Fraud Detection1
5 pages
Automatic Congestion Handling Feature Parameter Description: Issue Date
No ratings yet
Automatic Congestion Handling Feature Parameter Description: Issue Date
61 pages
Waybill-2023-06-21 09 - 33 - 41
No ratings yet
Waybill-2023-06-21 09 - 33 - 41
10 pages
QCSV510 - 2024 Assignment
No ratings yet
QCSV510 - 2024 Assignment
2 pages
Coal Mine Safety Modeling Guide
No ratings yet
Coal Mine Safety Modeling Guide
11 pages
Advanced Proton Precession System
No ratings yet
Advanced Proton Precession System
2 pages
"Good, Better, Best" How Do I Know Which Progesterone Cream To Buy
100% (2)
"Good, Better, Best" How Do I Know Which Progesterone Cream To Buy
26 pages
Career Guidance: Senior High Exit Plan
No ratings yet
Career Guidance: Senior High Exit Plan
4 pages
Scope of Work
100% (1)
Scope of Work
2 pages
Non Experimental Research Designs
No ratings yet
Non Experimental Research Designs
42 pages
《心灵捕手》还是徒劳无功？：男性气质与阶级流动的神话Richard Rees - - Good Will Hunting - or Wild Goose Chase - - Masculinities and the Myth of Class Mobility (1999) (10.2307 - 30225729) - Libgen.li
No ratings yet
《心灵捕手》还是徒劳无功？：男性气质与阶级流动的神话Richard Rees - - Good Will Hunting - or Wild Goose Chase - - Masculinities and the Myth of Class Mobility (1999) (10.2307 - 30225729) - Libgen.li
14 pages
Piano Quintet
No ratings yet
Piano Quintet
16 pages
Transfer/Promotion Discrimination Form
No ratings yet
Transfer/Promotion Discrimination Form
6 pages

ZFS Data Integrity and Authentication

Uploaded by

ZFS Data Integrity and Authentication

Uploaded by

Friday Dec 09, 2005

ZFS End-to-End Data Integrity

You might also like