-
Notifications
You must be signed in to change notification settings - Fork 8
Description
I am using margo in my distribution cache program which uses linux fuse to mount shared filesystem directory on local directory,accelerates file access with local nvme ssd ,and the program composes clients(margo clients) and servers(margo servers) on multi nodes, but the overall performace of this program doesn't well. To analize the performance issue, I enabled margo monitoring, and got some *stats.json files.I tried to analize the *stats.json files and diagnose the bottlenecks following the instructions list in mochi-hpc/mochi-performance-anaysis, but i still don't know how to tunne margo to fix the perfmance issue. Could you help to analize the issue and give me some advises? The following *stats.json snippet generated from a dd read test: dd of=/dev/null if=/mountpoint/4gfile bs=128K count=32768, /mountpoint is the fuse mountpoint of the cache program, the 4gfile is already on local cache(nvme ssd), so the client(margo client)reads file data from server(margo)on nvme ssd acctually.
The client(origin)side shows the iforward_wait occupies the most latency time:
The server(target)side shows the irespond:relative_timestamp_from_ult_start occupies the most:
Thank you in advance!


