The CellServDB processing in auth/cellconfig and WINNT/afsd/cm_config
differ in one important way. cellconfig uses the IP addresses specified
in the CellServDB file and ignores the host name; whereas cm_config
calls gethostbyname() on the host name and falls back to the specified
address only if gethostbyname() fails.
This commit modifies cellconfig to use gethostbyname() calls on the
host names when the client CellServDB file is being used. The server
CellServDB file is left alone because ubik servers need to be given
the exact list of ip addresses specified by the administrator and
there is no method of indicating clones to other servers using DNS.
This change permits a greater resilency to cell server configuration
changes due to IP address renumbering as the DNS names in the CellServDB
file can be CNAME or A records.
A host name can be resolved to multiple IP addresses and all of the
addresses will be added to the hostAddr list for the cell provided
that they do not exceed the maximum number of hosts. If the same
host name is listed multiple times the duplicate IP addresses will
be ignored.
The behavior of all tools that use CellServDB files on Windows will now
be consistent.
When processing RX_CALL_TIMEOUT, if there is no fid specified its a vldb
rpc and we should retry. If there is a fid and the volume is a readonly
volume then we should also retry. Only fail if the request has no alternate
server to look at.
When processing RX_CALL_DEAD, log the fact that the call is dead.
When processing any other RX error between -2 and -64, do not force
a new connection to the same server. Mark the server down and retry
with a new server if possible.
When inserting a new vldb server, if the rank is 0 (unknown), do not
override the default rank assigned by cm_NewServer(). Doing so erases
the randomized ranking.
Turn SRXAFSCB_InitCallBackState3() into the primary function for
processing InitCallBackStateX RPCs. InitCallBackState3 accepts an
optional server uuid which can be used instead of the IP address
to locate a matching server object. If the RPC comes from a server
that we know of, we can restrict the calback state initialization
to cm_scache_t objects on that file server. If the request comes
from a server that we know nothing about and a uuid was specified,
we can ignore it.
During callback revocation, make use of the cell associated with
the server to restrict the revocation to cm_scache_t objects in
the one cell. This is especially important for readonly volume
callbacks since volume ids are shared across multiple cells.
Add a missing case where the cm_volume_t cbExpiresRO time should
be set to the cm_scache_t cbExpires time.
Add cm_FindServerByUuid(). This function is similar to cm_FindServerByIP()
but uses the server's known uuid instead which is constant across multi-
homed systems.
cm_cellLock protects the cm_cell_t fields allNextp, nameNextp, idNextp,
and freeNextp. Therefore, a write lock must be obtained whenever those
items may change. This patch makes that consistent.
This patch also fixes an out of order lock acquisition and removes
cm_cell_t objects from the id and name hash tables before freeing them.
If there are dirty buffers when the file server becomes unresponsive,
the cache manager will attempt to write them repeatedly resulting in
use of 100% of the cpu of the machine until the file server becomes
responsive.
This patch reduces the cpu utilization by ensuring that only the first
in a list of buffers on the same file needs to fail with timed out,
all down, all busy, all offline, or clock skew. The other dirty buffers
will just be skipped. A small delay is enforced in the buf_IncrSync
thread between invocation.
This patch is not a complete fix for the problem. The buf_IncrSync
thread needs to become more intelligent in order to avoid attempts to
write to volumes that are known to be inaccessible.
curpag needs to know about kernel constructs (getpagvalue on AIX, onegroup
versus two group on linux) and on aix 5.1 simply can't work. add a new pioctl
and use it to simply ask the kernel what the current pag is
Updates to chapter one of the Admin Guide. Remove references to the
Authentication Server, add references to a Kerberos server, revise ntpd
parts to reflect the fact that OpenAFS doesn't ship ntpd, and removed
the distinction between the US and non-US versions of the Update Server.
rxi_Findcbi, rxi_FIndIfnet, rxi_FindIfMTU "failure" end up returning
the RX_REMOTE_PACKET_SIZE as the mtu to use unless we allow our override
to apply, so we do that. then, add an afsd switch to allow setting it.
afsd man page update required and will follow.
====================
This delta was composed from multiple commits as part of the CVS->Git migration.
The checkin message with each commit was inconsistent.
The following are the additional commit messages.
====================
LICENSE IPL10
FIXES 124880
only valid addr/port pairs are registered in the hash table.
add then remove when changing addresses.
make host restoral properly hash hosts.
remove should remove the address we asked for and not simply the
primary address.
ForceDNS is implied non-zero if there are no [servername]
keys under the [cellname] key. Otherwise, ForceDNS is zero.
If [servername] keys are specified and none of them evaluate
to a valid server configuration, the return code is success.
This prevents failover to the CellServDB file or DNS.
Registry distributed info takes precedence over the CellServDB file.
Registry support has been added to both the Windows specific cm_config
interface and the auth/cellconfig interface utilized by aklog, the
services, and the vast majority of support commands.
Enhance the DNS lookup for Cell vlserver info to support ranking info
which is used with _vlserver._udp SRV record lookups when AFSDB records
are not present. Priorities become ranks.
libafsconf.dll has been present since OpenAFS 1.0. However,
for some unknown reason the components that it consists of
(cm_dns.obj, cm_config.obj, cm_nls.obj) have been staticly
linked into exes and dlls all over the code base. This commit
removes all of the static references and replaces them with
libafsconf.lib.
libafsconf.dll is also moved from Client\Program to Common
because it is now linked to by server and utility components.
A flush operation on a directory will result in the btree being destroyed.
This can race with on-going operations. Make sure that the dirlock is held
if the btree is destroyed. Otherwise, just invalidate the btree version
number.
====================
This delta was composed from multiple commits as part of the CVS->Git migration.
The checkin message with each commit was inconsistent.
The following are the additional commit messages.
====================
LICENSE MIT
FIXES 124787
avoid either reopening closed vnodes and leaving cached descriptors around,
or discarding a reference we're not holding; instead, sync changes when the
fd is closed, and note such has been done; otherwise, no changes from older
code.
Replace version info in the DocBook files with a new ENTITY "version"
associated with a local "version.xml" file which contain a <revision>
tag for the current release.
The version.xml file should be autogenerated by the Makefile system.
Standardize the UNIX Makefiles for all of the DocBook guides. Remove the
rest of the generated files and switch to xsltproc and dblatex for the
document generation in all cases. Fix a few DocBook errors by removing
the contents of the <index> tag and removing the unknown <pubsnumber> tag
in the <revision> field.
Use dblatex to build PDF documentation instead of docbook2html and xsltproc
to build HTML instead of docbook2html. Remove all the index generation
logic, since dblatex and xsltproc handle that automatically. Remove the
contents of the <index> tag in the source, since neither program requires
there be anything in there.
Remove the style sheets and configuration that were used for docbook2*.