/root

Category: Storage

StorJ price adjustment

June 7th, 2026

From July 1 StorJ will introduce a minimum 50 dollar per month payment irrespective from storage usage !

As of now I run weekly S3 backups with restic from my TrueNAS to StroJ. I may need to migrate to some other solution e.g. Hetzner ….

https://forum.storj.io/t/important-update-upcoming-storj-customer-pricing-adjustments-and-sno-impact/31827
ZFS checksum self-heal

May 24th, 2026
The setup

In April I expanded my main pool from four-wide RAIDZ2 to five-wide by adding a single 10 TB Seagate IronWolf to four existing 4 TB WD Red Plus drives. OpenZFS 2.3+ supports RAIDZ expansion: the new column gets added, the existing data keeps its old parity layout until rewritten, and the pool stays online throughout. The expansion completed normally.

About five weeks later, a scrub showed the following CKSUM error:
$ sudo zpool status zfs_tank pool: zfs_tank state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. config: NAME STATE READ WRITE CKSUM zfs_tank ONLINE 0 0 0 raidz2-0 ONLINE 0 0 0 6a169351-6031-41d5-ad2a-9681142190c5 ONLINE 0 0 0 a006e053-c865-4330-8861-e21e4a3e37a6 ONLINE 0 0 0 b7d78b79-cb70-4afe-b9d0-8e4b2282fb18 ONLINE 0 0 0 707c32af-4e4e-4fc7-b000-dd5b52f75158 ONLINE 0 0 1 ff3c3a00-9f71-4b6b-87e6-c56deb4c6854 ONLINE 0 0 1 errors: No known data errors
One checksum error on each of two disks. The pool itself reports zero errors: errors: No known data errors. So no data were degraded.

RAIDZ2 carries two parity columns, the scrub detected bad blocks, and ZFS reconstructed them from parity so the pool status remains healthy. But zpool status only tells you that an error happened and not if it got corrected.

So where is the healing actually recorded?

What zpool status shows, and what it doesn’t

The four columns in zpool status map directly to four counters in the kernel’s vdev_stat_t structure (include/sys/vdev_impl.h):
- vs_read_errors
- vs_write_errors
- vs_checksum_errors
- the implicit STATE
zpool status parses each leaf vdev’s stats and prints those four numbers. It does not print any of the other ~30 fields in the structure — including this one:
uint64_t vs_self_healed; /* total bytes self-healed */
vs_self_healed is incremented in vdev_stat_update() whenever ZFS issues a write with the ZIO_FLAG_SELF_HEAL flag set, which happens after a successful parity reconstruction. The kernel knows exactly how many bytes were healed on each disk. It just doesn’t tell you via the standard zpool status output.

Three places the heal counter does surface

1. Raw kstats (Linux only)

The OpenZFS Linux module exposes every leaf vdev’s full vdev_stat_t under /proc/spl/kstat/zfs/<pool>/. The filenames use the leaf vdev GUID. Pull those GUIDs out of the pool query:
$ sudo ls /proc/spl/kstat/zfs/zfs_tank/ | head io state txgs vdev_395717205876781294 vdev_4003307236673040230 vdev_7306733904703790705 vdev_803393823450321549 vdev_9021081546382363770
The 9021... and 4003... files are the two disks with errors. Inside:
$ sudo cat /proc/spl/kstat/zfs/zfs_tank/vdev_9021081546382363770 ... name type data vdev_state 3 7 vdev_guid 4 9021081546382363770 read_errors 4 0 write_errors 4 0 checksum_errors 4 1 self_healed 4 4096 ...
self_healed 4096 — four kilobytes. The pool’s ashift is 12, so one block. Exactly one block was reconstructed and rewritten on this disk. Same value on the other affected disk.

2. The TrueNAS middleware API

If you’re on TrueNAS, the same field comes back as JSON from the pool.query endpoint:
{ "name": "707c32af-4e4e-4fc7-b000-dd5b52f75158", "stats": { "checksum_errors": 1, "self_healed": 4096, "read_errors": 0, "write_errors": 0 } }
This is how I first saw the number. The middleware just unpacks vdev_stat_t into JSON.

3. zpool events — the actual heal log

Counters tell you how much. To see when and where, look at the ZFS event ring buffer:
$ sudo zpool events -v zfs_tank | grep -A 30 'ereport.fs.zfs.checksum' May 23 2026 05:42:14.823145112 ereport.fs.zfs.checksum class = "ereport.fs.zfs.checksum" ena = 0x... detector = (embedded nvlist) version = 0x0 scheme = "zfs" pool = 0x6794d4c... vdev = 0x7d24a0... (end detector) pool = "zfs_tank" pool_guid = 0x6794d4cc9d3a8916 vdev_guid = 0x7d24a0aaa18cb6ba vdev_type = "disk" vdev_path = "/dev/disk/by-partuuid/707c32af-4e4e-4fc7-b000-dd5b52f75158" zio_err = 0 zio_offset = 0x... zio_size = 0x1000 zio_objset = 0x... zio_object = 0x... zio_blkid = 0x... cksum_expected = ... cksum_actual = ...
Each event names the affected disk, the byte offset on that disk, the size (here 0x1000 = 4 KiB), the dataset and object, and both checksums. With this you can compute exactly which file (if any) the block belonged to. The buffer holds ~1000 events by default (zfs_zevent_len_max), so old events roll out unless ZED has persisted them to /var/log/zfs/zed.log.

This is the closest thing ZFS has to a “self-heal log.”

What I actually had

Two disks, one block each, both healed. Different manufacturers (WD Red Plus 4 TB / Seagate IronWolf 10 TB), different uptime (6124 h / 3724 h), so a shared hardware fault was unlikely. SMART on both was clean:
$ sudo smartctl -a /dev/sde | grep -E '^( 5|197|198|199) ' 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
Same story on the Seagate. No reallocated sectors, no pending sectors, no UDMA CRC errors.

UDMA_CRC_Error_Count is the SATA-link error counter. If a cable, backplane, or HBA channel is marginal, this is where it shows up. Both at zero rules out the data path between disk and controller.

What it doesn’t rule out is RAM. This system runs 32 GB of non-ECC DDR5. A single bit-flip in a write buffer leaves a permanently-bad block on disk that scrubs will keep detecting and healing on every pass. The block stays wrong because the heal write reads the (correct) reconstructed buffer from the same RAM that may flip again. Without ECC, you can’t fully exclude this; with non-ECC, you also can’t measure it.

zpool clear vs zpool scrub

The counters in zpool status and vs_self_healed are cumulative since the last zpool clear (or since pool creation). A scrub does not reset them.

So when I ran a scrub after the original event, the 1s in the CKSUM column did not go away — they were the same 1s from before.
$ sudo zpool clear zfs_tank $ sudo zpool status zfs_tank pool: zfs_tank state: ONLINE config: NAME STATE READ WRITE CKSUM zfs_tank ONLINE 0 0 0 ... errors: No known data errors
The status line and the per-disk counters reset together. vs_self_healed resets too. After that, the next scrub starts from zero — if the same blocks show up healed again, you know the corruption is persistent on-disk (and the suspicion shifts toward RAM); if they don’t, the original event was probably a one-shot.

After the zpool clear
How I almost lost all of my data!

March 13th, 2026
This is a cautionary tale about how I nearly lost everything on my external SSD because of a moment of carelessness.

What Happened

I wanted to create a bootable USB with Ventoy to run a Linux or FreeBSD ISO. Simple enough — I’ve done it a hundred times. The problem was that I also had my external SSD connected at the same time.

I somehow selected the wrong disk. Instead of formatting the USB stick, I formatted my external SSD. Just like that — all my data was gone.

That sinking feeling when you realize what you’ve done is something I wouldn’t wish on anyone.

Recovery with PhotoRec

Thankfully, I was able to recover most of my data using PhotoRec, a free and open-source data recovery tool (currently at version 7.2, with 7.3 in beta as of January 2026). PhotoRec ignores the filesystem and goes after the underlying data, so it works even after formatting. It can recover over 480 file formats.

Install it (it comes with TestDisk):
```
# Arch Linux
sudo pacman -S testdisk

# FreeBSD
pkg install testdisk
```
Run it:
```
sudo photorec /dev/sdX
```
PhotoRec will scan the disk and recover files into a directory of your choice. It recovered most of my files, though filenames and directory structure were lost — everything gets sorted by file type.

Lessons Learned
1. Always double-check the target disk. Run lsblk before any destructive operation. Verify the disk size and partitions match what you expect. Ventoy (currently at v1.1.10) shows disk names and sizes — take the extra second to verify.
2. Disconnect drives you don’t need. If you’re formatting a USB, unplug your external drives first. It takes 5 seconds and can save you hours of recovery.
3. Follow the 3-2-1 backup rule: 3 copies, 2 different media, 1 offsite. If I had a proper backup, the accidental format would have been a minor inconvenience instead of a disaster.
4. ZFS snapshots are your friend. On my TrueNAS server, I now run automatic snapshots. Even if something goes wrong, I can roll back instantly.
5. Keep PhotoRec/TestDisk installed. You never know when you’ll need it. Better to have it ready than to scramble in a panic.
Don’t be like me. Disconnect your drives, check twice, and back up your data. Your future self will thank you.
Seaweedfs 4.06

January 12th, 2026
What’s Changed
- Helm Chart
  - chart: Set admin metrics port to http port by @sheyabernstein in #7936
  - fix: Invalid volume mount conditional in filer template by @nichobi in #7992
- S3 API
  - Fix S3 list objects marker adjustment for delimiters by @chrislusf in #7938
  - fix: directory incorrectly listed as object in S3 ListObjects by @chrislusf in #7939
  - Refine Bucket Size Metrics: Logical and Physical Size by @chrislusf in #7943
  - Fix AWS SDK Signature V4 with STS credentials (issue #7941) by @chrislusf in #7944
  - fix: correcting S3 nil cipher dereference in filer init by @tjasko in #7952
  - Support AWS standard IAM role ARN formats (issue #7946) by @chrislusf in #7948
  - s3api: fix authentication bypass and potential SIGSEGV (Issue #7912) by @chrislusf in #7954
  - store S3 storage class in extended atrributes #7961 by @ravenschade in #7962
  - fix: handle range requests on empty objects (size=0) by @chrislusf in #7963
  - Fix trust policy wildcard principal handling by @chrislusf in #7970
  - Support Policy Attachment for Object Store Users by @chrislusf in #7981
  - Fix STS identity authorization by populating PolicyNames (#7985) by @chrislusf in #7986
  - Fix: ListObjectVersions delimiter support by @chrislusf in #7987
  - Fix STS authorization in streaming/chunked uploads by @chrislusf in #7988
  - fix(s3api): ensure S3 configuration persistence and refactor authorization tests by @chrislusf in #7989
- Misc
  - Standardize -ip.bind flags to default to empty and fall back to -ip by @chrislusf in #7945
  - Fix unaligned 64-bit atomic operation on ARM32 (#7958) by @aimmac23 in #7959
  - Fix flaky EC integration tests by collecting server logs on failure by @chrislusf in #7969
  - test: fix EC integration test needle blob mismatch by @chrislusf in #7972
  - chore: execute goimports to format the code by @promalert in #7983
- Filer
  - fix(gcs): resolve credential conflict and improve backup logging by @chrislusf in #7951
  - Fix jwt error in Filer pod (k8s) by @MorezMartin in #7960
  - Fix chown Input/output error on large file sets by @chrislusf in #7996
- Admin
  - fix: EC UI template error when viewing shard details by @chrislusf in #7955
  - Fix special characters in admin-generated secret keys by @chrislusf in #7994
- FUSE Mount
  - Fix: prevent panic when swap file creation fails by @LeeXN in #7957
  - Enable writeback_cache and async_dio FUSE options by @chrislusf in #7980
- Mini
  - feat: add flags to disable WebDAV and Admin UI in weed mini by @chrislusf in #7971
- Volume Server
  - storage/needle: add bounds check for WriteNeedleBlob buffer by @chrislusf in #7973
  - opt: reduce ShardsInfo memory usage with bitmap and sorted slice by @chrislusf in #7974
https://github.com/seaweedfs/seaweedfs/releases/tag/4.06
Meme I found in the TrueNAS forums on September 20th

December 25th, 2025
November 5th, 2025
Brace yourselves for incoming raise of SSD and HDD prices

October 31st, 2025

https://www.tomshardware.com/pc-components/storage/expect-hdd-ssd-shortages-as-ai-rewrites-the-rules-of-storage-hierarchy-multiple-companies-announce-price-hikes-too
StorJ Backup Task on TrueNAS

October 14th, 2025

I joined StorJ, a distributed cloud storage platform, to back up approximately 1 TB of data. StorJ uses strong encryption and distributes file partitions across multiple nodes worldwide.

The first step is to set up an iX-StorJ account and create a bucket. The Starter Pack plan costs $150 per year and includes up to 5 TB of storage.

To back up your data to the bucket, you need to create an access key and then import it as a Cloud Credential.

https://storj.dev/
October 7th, 2025
RAID 5 vs RAID 6 ON YOUR NAS – PROs and CONs

September 21st, 2025
ZFS Basecamp

September 13th, 2025

An initiative of Klara Inc to launch a Webinar with the most experienced devs in the ZFS storage industry.
Synology to TrueNAS backup

September 3rd, 2025

A very useful video for setting an rsync task to backup your synology into a TrueNAS server.
Storage media life expectacy

August 25th, 2025
Refurbished disks ?

August 23rd, 2025
30TB HDDs are a curse ?

August 12th, 2025