virtio/fs/macos: preload directory entries to avoid calling telldir() by pftbest · Pull Request #548 · containers/libkrun

pftbest · 2026-02-13T11:57:18Z

This PR depends on #544 and includes its commits. I can rebase when 544 is merged.

I was doing some performance measurements on filesystem operations and noticed that telldir() call is very heavy on macOS.

In some of my synthetic tests it could take up to 40% of total CPU cycles on the host:

The proposed fix is to avoid calling telldir() at all and preload all dir entries ahead of time on the first call to do_readdir().

The memory impact: In the worst-case scenario each entry would be max 1K in size. A plain directory with 100k files would require 100M in host memory.

However each call to telldir() also allocates memory internally via malloc() and stores it inside DIR stream. We call it on each dir entry so the actual difference should be smaller than that in practice.

Unfortunately after I made this change I did not see a significant time savings on the guest side. Yes the host cycles were reduced but the most of the time is spent waiting on communication between host and guest, so I only see 5-10% improvement on the guest side.

For example running time make clean on the linux kernel source code went from around 12s to 11s.

Trace for make clean without this patch. do_readdir is at 24% cycles

Trace for make clean with this patch do_readdir is at 9% cycles

src/devices/src/virtio/fs/macos/passthrough.rs

telldir() is a very heavy call on APFS, but we have to invoke it on each entry in the directory to handle offsets correctly. The alternative is to preload all dir entries at the start and use normal indexes as offsets. The memory impact: In the worst-case scenario each entry would be around 1K in size. A plain directory with 100k files would require 100M in host memory. However each call to telldir() also allocates memory internally via malloc() and we call it on each dir entry, so the actual difference should be smaller than that in practice. Signed-off-by: Vadzim Dambrouski <pftbest@gmail.com>

dorindabassey

LGTM, Thanks!

mtjhrc

LGTM too, thanks!

mtjhrc · 2026-02-18T12:10:37Z

/gemini review

gemini-code-assist

Code Review

This pull request aims to improve filesystem performance on macOS by pre-loading directory entries to avoid expensive telldir() calls. The implementation replaces the previous directory stream handling with a caching mechanism, which is a sound approach to address the identified performance bottleneck. The code is generally well-written, but I have identified a couple of areas for improvement related to lock contention and code clarity in an unsafe block. My review comments provide specific suggestions to address these points.

gemini-code-assist · 2026-02-18T12:12:48Z

src/devices/src/virtio/fs/macos/passthrough.rs

-                    name.push(*c as u8);
+        if !ds.ready {
+            // Fill the cache on first call
+            if let Err(e) = ds.fill_from_fd(data.file.write().unwrap().as_raw_fd()) {


The fill_from_fd function only needs a file descriptor to read directory entries; it doesn't modify the file itself. Using a read lock (read()) instead of a write lock (write()) is sufficient and avoids unnecessary lock contention, which can improve performance.

Suggested change

if let Err(e) = ds.fill_from_fd(data.file.write().unwrap().as_raw_fd()) {

if let Err(e) = ds.fill_from_fd(data.file.read().unwrap().as_raw_fd()) {

gemini-code-assist · 2026-02-18T12:12:48Z

src/devices/src/virtio/fs/macos/passthrough.rs

+            let name = unsafe {
+                let name_len = usize::from((*dentry).d_namlen);
+                let name_ptr = (*dentry).d_name.as_ptr().cast();
+                let name = std::slice::from_raw_parts(name_ptr, name_len);
+
+                if name == b"." || name == b".." {
+                    continue;
+                }
+                Box::<[u8]>::from(name)
+            };


For better maintainability and to minimize the scope of unsafe code, consider refactoring this block. The unsafe block is only needed to create a slice from the raw pointer. The subsequent logic for filtering and boxing the name can be performed in safe code. This refactoring reduces the scope of unsafe code, which is a good practice for safety and maintainability.

Suggested change

let name = unsafe {

let name_len = usize::from((*dentry).d_namlen);

let name_ptr = (*dentry).d_name.as_ptr().cast();

let name = std::slice::from_raw_parts(name_ptr, name_len);

if name == b"." || name == b".." {

continue;

}

Box::<[u8]>::from(name)

};

let name = {

let name_slice = unsafe {

let name_len = usize::from((*dentry).d_namlen);

let name_ptr = (*dentry).d_name.as_ptr().cast();

std::slice::from_raw_parts(name_ptr, name_len)

};

if name_slice == b"." || name_slice == b".." {

continue;

}

Box::<[u8]>::from(name_slice)

};

slp · 2026-02-18T12:24:51Z

I think gemini's both suggestions are quite reasonable.

pftbest · 2026-02-18T14:58:01Z

@slp Sorry, but I don't agree with its conclusion for the first one. I specifically rely on the write lock to be locked for the full duration of the fill_from_fd function because it updates internal position of the original fd. All duped fd share the same position internally so running 2 instances of fill_from_fd on the same fd will lead to corrupted data.

I'm not quite sure if libkrun uses single thread or multiple threads to handle fs requests, but even if it's single threaded then read() vs write() doesn't matter and if it's multiple threads then write() is correct here.

Second suggestion I can apply, but will need to move the safety comment too.

mtjhrc · 2026-02-18T15:05:37Z

@slp I specifically rely on the write lock to be locked for the full duration of the fill_from_fd function because it updates internal position of the original fd. All duped fd share the same position internally so running 2 instances of fill_from_fd on the same fd will lead to corrupted data.

Makes sense

Second suggestion I can apply, but will need to move the safety comment too.

Looking at it again, the way you have written it is maybe safer (it is very clear what the lifetime of the slice is).

Feel free to ignore Gemini.

pftbest requested review from MatiasVara, jakecorrenti, mtjhrc, slp and tylerfanelli as code owners February 13, 2026 11:57

dorindabassey reviewed Feb 17, 2026

View reviewed changes

src/devices/src/virtio/fs/macos/passthrough.rs Show resolved Hide resolved

dorindabassey reviewed Feb 17, 2026

View reviewed changes

src/devices/src/virtio/fs/macos/passthrough.rs Show resolved Hide resolved

dorindabassey reviewed Feb 17, 2026

View reviewed changes

src/devices/src/virtio/fs/macos/passthrough.rs Show resolved Hide resolved

pftbest force-pushed the preload-dir-entries branch from f3f2d49 to 9c59116 Compare February 17, 2026 14:20

dorindabassey approved these changes Feb 17, 2026

View reviewed changes

mtjhrc approved these changes Feb 17, 2026

View reviewed changes

gemini-code-assist bot reviewed Feb 18, 2026

View reviewed changes

slp merged commit 54da12a into containers:main Feb 19, 2026
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

virtio/fs/macos: preload directory entries to avoid calling telldir()#548

virtio/fs/macos: preload directory entries to avoid calling telldir()#548
slp merged 1 commit intocontainers:mainfrom
pftbest:preload-dir-entries

pftbest commented Feb 13, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dorindabassey left a comment

Uh oh!

mtjhrc left a comment

Uh oh!

mtjhrc commented Feb 18, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 18, 2026

Uh oh!

gemini-code-assist bot Feb 18, 2026

Uh oh!

slp commented Feb 18, 2026

Uh oh!

pftbest commented Feb 18, 2026

Uh oh!

mtjhrc commented Feb 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

	if let Err(e) = ds.fill_from_fd(data.file.write().unwrap().as_raw_fd()) {
	if let Err(e) = ds.fill_from_fd(data.file.read().unwrap().as_raw_fd()) {

Conversation

pftbest commented Feb 13, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dorindabassey left a comment

Choose a reason for hiding this comment

Uh oh!

mtjhrc left a comment

Choose a reason for hiding this comment

Uh oh!

mtjhrc commented Feb 18, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

slp commented Feb 18, 2026

Uh oh!

pftbest commented Feb 18, 2026

Uh oh!

mtjhrc commented Feb 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments