Linux NFS server reliability?

This is a discussion on Linux NFS server reliability? within the Linux Networking forums, part of the Linux Forums category; Hi,there, We are using two linux clusters. Both of them have a similiar NFS problem. The fileserver has a ...


Go Back   Usenet Forums > Linux Forums > Linux Networking

FAQ Members List Calendar Search Today's Posts Mark Forums Read
  #1 (permalink)  
Old 04-13-2004
Shi Jin
 
Posts: n/a
Default Linux NFS server reliability?

Hi,there,

We are using two linux clusters. Both of them have a similiar NFS
problem. The fileserver has a RAID attached to it and it is exporting
the /home as NFS to many linux client nodes. Whenever the clients are
trying to write too much data to the NFS /home or when the somebody is
doing a lot of IO on the fileserver itself, the writes on the clients
to the NFS /home will frequently leads to an error. We tried to tune
the parameters but still have the same problem. The current mounting
parameters are
rw,nosuid,soft,intr,rsize=8192,wsize=8192
And the NFS server exporting parameters are simple:
rw,sync,root_squash

Could anybody tell me whether this is our own problem or it is common
to all linux NFS servers. Is there a way to solve it?
Thank you very much.

Shi Jin
Reply With Quote
  #2 (permalink)  
Old 04-14-2004
Joshua Baker-LePain
 
Posts: n/a
Default Re: Linux NFS server reliability?

In article <e5cbf201.0404131247.28cd9e06@posting.google.com >, Shi Jin wrote:
>
> We are using two linux clusters. Both of them have a similiar NFS
> problem. The fileserver has a RAID attached to it and it is exporting
> the /home as NFS to many linux client nodes. Whenever the clients are
> trying to write too much data to the NFS /home or when the somebody is
> doing a lot of IO on the fileserver itself, the writes on the clients
> to the NFS /home will frequently leads to an error. We tried to tune


What is the error you're getting? What hardware do you have on
the server side? What is your network setup?

> the parameters but still have the same problem. The current mounting
> parameters are
> rw,nosuid,soft,intr,rsize=8192,wsize=8192


The 'hard' mount option is preferred over 'soft', but that's a bit of
a religous issue.

> And the NFS server exporting parameters are simple:
> rw,sync,root_squash
>
> Could anybody tell me whether this is our own problem or it is common
> to all linux NFS servers. Is there a way to solve it?


The Linux NFS HOWTO has a whole chapter on tuning:

http://nfs.sourceforge.net/nfs-howto/

Not knowing what exact error you're getting, I can can only guess. One
thing to check is to make sure you have enough server threads. By default,
e.g., Red Hat only launches 8 nfsd threads, which is definitely not enough
for a busy server. Check the output of 'cat /proc/net/rpc/nfsd'. Look
at the output of the 4th line, which should look something like this:

th 256 846 2422.580 993.510 436.910 195.680 123.210 54.990 36.530 27.830 19.010 32.410

The first number is the number of server threads (256 in this case). The
second number is the most important one. That's the number of times
requests have had to be queued because all the threads were busy. In
this case, I had been using 128 threads, saw that 846, and then upped the
threads to 256 (without rebooting and thus without resetting the statistics).
The number hasn't increased since then. The rest of the numbers represent
the amount of time a particular percentage of the threads have been busy
at the same time. I.E., 10% of the threads were busy for 2422.58 units (I'm
not sure of the units), 20% were busy for 993.51, etc. Having
the last few numbers be low indicates that you've got enough threads.

--
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University
Reply With Quote
  #3 (permalink)  
Old 04-15-2004
Shi Jin
 
Posts: n/a
Default Re: Linux NFS server reliability?

Thank you very much.
I think my server threads are not enough. But how can I increase this
number?
I am using Redhat, unfortunately.:)
Thanks.

Shi Jin

On Wed, 14 Apr 2004 14:07:25 +0000, Joshua Baker-LePain wrote:

> In article <e5cbf201.0404131247.28cd9e06@posting.google.com >, Shi Jin wrote:
>>
>> We are using two linux clusters. Both of them have a similiar NFS
>> problem. The fileserver has a RAID attached to it and it is exporting
>> the /home as NFS to many linux client nodes. Whenever the clients are
>> trying to write too much data to the NFS /home or when the somebody is
>> doing a lot of IO on the fileserver itself, the writes on the clients
>> to the NFS /home will frequently leads to an error. We tried to tune

>
> What is the error you're getting? What hardware do you have on
> the server side? What is your network setup?
>
>> the parameters but still have the same problem. The current mounting
>> parameters are
>> rw,nosuid,soft,intr,rsize=8192,wsize=8192

>
> The 'hard' mount option is preferred over 'soft', but that's a bit of
> a religous issue.
>
>> And the NFS server exporting parameters are simple:
>> rw,sync,root_squash
>>
>> Could anybody tell me whether this is our own problem or it is common
>> to all linux NFS servers. Is there a way to solve it?

>
> The Linux NFS HOWTO has a whole chapter on tuning:
>
> http://nfs.sourceforge.net/nfs-howto/
>
> Not knowing what exact error you're getting, I can can only guess. One
> thing to check is to make sure you have enough server threads. By default,
> e.g., Red Hat only launches 8 nfsd threads, which is definitely not enough
> for a busy server. Check the output of 'cat /proc/net/rpc/nfsd'. Look
> at the output of the 4th line, which should look something like this:
>
> th 256 846 2422.580 993.510 436.910 195.680 123.210 54.990 36.530 27.830 19.010 32.410
>
> The first number is the number of server threads (256 in this case). The
> second number is the most important one. That's the number of times
> requests have had to be queued because all the threads were busy. In
> this case, I had been using 128 threads, saw that 846, and then upped the
> threads to 256 (without rebooting and thus without resetting the statistics).
> The number hasn't increased since then. The rest of the numbers represent
> the amount of time a particular percentage of the threads have been busy
> at the same time. I.E., 10% of the threads were busy for 2422.58 units (I'm
> not sure of the units), 20% were busy for 993.51, etc. Having
> the last few numbers be low indicates that you've got enough threads.


Reply With Quote
  #4 (permalink)  
Old 04-15-2004
Joshua Baker-LePain
 
Posts: n/a
Default Re: Linux NFS server reliability?

In article <pan.2004.04.15.04.08.09.397266@hotmail.com>, Shi Jin wrote:
> Thank you very much.
> I think my server threads are not enough. But how can I increase this
> number?
> I am using Redhat, unfortunately.:)


That's all we use here (of course, Red Hat itself is about 15min away...).
With a recent version of RedHat, put the following line in
/etc/sysconfig/nfs:

RPCNFSDCOUNT=128

replacing 128 with however many threads you want to run. Then do
'/etc/init.d/nfs restart'.

--
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University
Reply With Quote
  #5 (permalink)  
Old 04-15-2004
Rex Dieter
 
Posts: n/a
Default Re: Linux NFS server reliability?

Shi Jin wrote:

> I think my server threads are not enough. But how can I increase this
> number?
> I am using Redhat


Create/edit
/etc/sysconfig/nfs
to contain
RPCNFSDCOUNT=8

(The default # threads is 8, so edit to suit your needs).

And restart the nfs service,
/sbin/service nfs restart

-- Rex
Reply With Quote
Reply
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are Off
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT +1. The time now is 01:46 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO 3.0.0