Usually going via kernel nfs client will use up more memory bandwidth. I would expect lower per-client numbers. From what I've read you go from 3 memcopies on userspace to 4 with kernel nfs.
I haven't yet instrumented memory bandwidth on my amd machines, but it feels like I'm at the limit.
I haven't yet instrumented memory bandwidth on my amd machines, but it feels like I'm at the limit.