Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 24 Feb 1999 23:37:55 -0500 (EST)
From:      tom@tomqnx.com (Tom Torrance at home)
To:        brian@Awfulhak.org (Brian Somers)
Cc:        tom@tomqnx.com, hackers@FreeBSD.ORG, current@FreeBSD.ORG
Subject:   Re: Missing files/directories
Message-ID:  <m10FsYd-000I5dC@TomQNX.tomqnx.com>
In-Reply-To: <199902230822.IAA00707@keep.lan.Awfulhak.org> from Brian Somers at "Feb 23, 1999  8:22:47 am"

next in thread | previous in thread | raw e-mail | index | archive | help

--ELM919917475-1022-0_
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

Hi Brian,

It was a good thought, but we can't put the blame on bad hardware.

These tests were done on the RELENG_3 system cvsup'd 
as of Feb 22 @ 20:00 EST. All tests were run internal to the
same machine. So that I don't remain the only guy in the world
to see these test results, Control files are included so you 
can test locally:-)

"ppp0 -direct" on localhost is started by port 6671.

I know (now) that setting up the test this way the ppp's were
communicating via localhost rather than the tunnel, but this way
was much cleaner as far as verifying exactly how close the results
were to what I saw running the server under 2.2-stable. There were
differences, but the main issues are demonstrated.

You will recall our discussion about the server hanging around
under 2.2-stable after the client is terminated? Required by the
RFCs you said? Under RELENG_3 the server meekly goes away, which
makes sense to me.

Two tests were done. The first involved "kill -KILL clientpid".
The second was "kill -TERM clientpid".
In the first test, the server illegally removed the default route.
In the second test, the server did the same - neither ppp actioned
the second command in the linkdown scripts.

I was surprised that the first test ended immediately - I thought
the LQR packets would cause the server to terminate after 1 minute.

Files:
test1.netstat0	shows routing after boot
test1.netstat1	shows routing after "ppp -background testloop"
test1.psaxl	show ps results for the executing processes.
test1.netstat2	shows routing after killing the client.
test1.tun0	ifconfig while active.
test1.tun1	ifconfig while active.
test2.netstat	routing tables after terminating the client.
Logs are supplied for both tests.

I hope that this is very helpful to you. I really appreciate
your efforts!!

Cheers,
Tom

> Hi,
> 
> I don't claim to know a great deal about cache code etc, but I'm 
> pretty sure that it's extremely unlikely that the file name has any 
> chance of affecting the buffer cache.  While NFS has its fair 
> share of problems (with which Matt is dealing with admirably), I 
> would think that the code that does the work there is equally unlikely 
> to know anything about file names.
> 
> Having said all that in as vague a way as possible, the reason I'm 
> posting this is that you seem to be experiencing difficulties with 
> ppp that are of a similar nature - that is, completely inexplicable 
> and unseen by anyone else - disappearing default routes, ppp.linkdown 
> not being processed,
> 
> I'm beginning to suspect a hardware problem - perhaps with your disk 
> controller or something.  This wouldn't easily explain the default 
> route problem, but may explain the failure to process ppp.linkdown....
> 
> Maybe you could try treating the other machine (your son's machine?) 
> as the gateway, and see if things become more stable.  If they do, 
> the finger might be pointed more firmly at hardware.
> 
> > On the weekend I reported to hackers about problems experienced with
> > 2.2-stable and RELENG-3 systems where I experienced files that
> > disappeared from cache and Mail directories that disappeared.
> > The RELENG-3 system had files affected with softupdates enabled.
> > The 2.2-stable system had sub-directories missing from the
> > same directories that I was writing to via nfsv2.
> > 
> > By coincidence, I had cvsup'd and compiled new kernels and naturally
> > made the assumption that there was causality there. Subsequently
> > I have come to believe that the problem may have more to do with what 
> > I was doing, not changes to the code.
> > 
> > For about 3-4 hours prior to noticing the problems, I had been 
> > repetitively editing dot files, then writing a kludge of dot files
> > to the local system hard drive and to the nfs exported FS of the
> > other computer, while occasionally checking mail on that computer. 
> > 
> > All files and directories missing were being updated for
> > one reason or another by myself or by mail processes while
> > I was doing this.
> > 
> > It is speculation, but there is a good chance that there is a bug
> > in the cache-handling code that causes problems with other files
> > or directories being dropped from cache because of bad processing
> > common to BOTH or ALL releases, when large numbers of dot files are
> > being written. The dot files themselves did not disappear - other
> > items to be written disappeared before their writes actually 
> > occurred. 
> > 
> > I know that this is a frustrating kind of message to receive, but
> > I am not a developer & not qualified to go into the code myself.
> > Also no logs or hard output are available - files/directories
> > simply disappeared without any error messages.
> > 
> > I just did a scan of the entire /usr/src/sys tree for \"\\.\"
> > and \'\\.\' to see what code sections might be affected - mostly
> > cache-handling. In quantity, not bad, really.
> > 
> > Others have apparently reported missing files to do with nfs
> > I believe. THis might or might not be a related problem.
> > 
> > I guess that I am asking someone who is qualified, and concerned
> > about missing files or directories, if they would be willing
> > to do what I cannot - check the code for bad interactions when
> > dot files are being written- bearing in mind that it is OTHER
> > files/directories that are disappearing from cache before being
> > written.
> > 
> > Is anyone out there sufficiently intrigued by the possibility
> > to invest some valuable time?
> > 
> > I am a QA tester, not a developer, and therefore much more
> > comfortable with discussion of symptoms and speculative
> > causality than most developers I have known. I hope that
> > someone thinks enough of the possibility to invest some 
> > time, which I know is in very short supply. I cannot deny
> > that this is (informed) speculation - there are no guarantees.
> > 
> > Regards and best wishes,
> > Tom
> 
> -- 
> Brian <brian@Awfulhak.org> <brian@FreeBSD.org> <brian@OpenBSD.org>
>       <http://www.Awfulhak.org>;
> Don't _EVER_ lose your sense of humour !
> 
> 
> 


--ELM919917475-1022-0_
Content-Type: application/x-gtar
Content-Disposition: attachment; filename=testppp.tgz
Content-Description: Test1 and test2 results
Content-Transfer-Encoding: base64

H4sIAMfM1DYAA+1dbXPaSBLO1/Ar5nJ1VcmVQ/SCJKAue+XCSdZXjs2ZcPshlQ+yELbOILFIjp37
9TejdwmNpkcSsN5MV23AQD+al+6eme6e3sD2g/5ms+lbnrt8sR+SJUkfDNALhGRDk8grpuQVfyjL
A4R0XdfkgS6rOvlEV7QXSNpTewr04AfmFqEXgbeu/d3jnW2vDtGgw9LCXpoPq2Dce+nbAQqcte09
BEiK/lz9vt3YW8dbIDn5xLtF0zvTt9HkzgzQxHNd2wrQxWSKzqf4n4m3Xpvuovdy4fjmzcpG/hbj
2X7vpe2Gf2PI3kvTsuxNCN/rrTxvEz99YX93LButTWf11Mfz8bv7hMVyPdZ1Q45/4ZirtCGOG701
H4I711zbaOt5QfbRvf0DTf97PzT+9noUfeoszcVii/vSl/pyX07eKEjRtH7ynxQ16a3jjtNGW3fm
Brd6tfIe0dpb2LgdW9ztKlQlg4++9u3td3uL3n03t++C9ebdxgxu/AV69QpJsmH0eqRL3A+TkCJp
fVmT+gO1L1c/iABjxc6e5MdDvfLMBSLvC6O+8ixzdef5QTTcPTxpARdDr3dsWRbET0Fi/1eOe7/w
Ht09PINh/xE2+5n9N4j9VzRZ2P+DUGKtsJov7JUd2Jll6UvvlEH6cfLDXi+xdBmLrI36ymCEP5be
yXqZRRGW4Y9LBf1/2OzlGUz9N4xM/zWD6D/+odD/Q1BO//HOoqT8mQaHX2YbJ8Xoh+8LxiD8ScES
ZEYjz6/k+I/d/Z+eiP7LfdcOsBoEe1I4lv7rUqL/Kj7/Dcj5z5Alof+HoGt8PHPcWxSQs4ff6527
gb3F4jDunWHJcFwzcDw3mahPZmA/mj9Qjj6uzFs/fHdtL6M3c3w6JHRpB84SfXja4PNLLz5n5llz
Rxgl/mj+aWalXydvpOjF3ki91HBkKBUfzX/NPSR+HcYoK0/qZQ9Of0ZWv78qKE/zSfZeqmhLRfOl
sS6Nh9Z4YY9vlmNbDtty8VvCrMWvowyFvOjDwfHsYEH/5f08g+X/wat+sv4rkq6F+q8K/T8IPTP9
T3YgGaWbipzm5vQ//WmEEjy4Mnr/Swr0Tk0ZqwxJrjllE0AMScWzK1pY3xwp1xylZXO4raMe/0rP
o0RbuFwHKjtFbwsZ48w67qAoYJTjW2pVTlHIizGU/mw71oL9V/bzDNb+z5C1dP9n6Epo/xVZ2P9D
0KHsf0cGl9vCxYZD1pUM5XnbJiVpsZKihB8bRhPbFOl/6ADybjuXroiI/ksSTf/xic9I9n+qIUX6
r+lC/w9CH+0bpAyQoowleayM0MLc3ocjgkXiqypr38ZRvG+M1ZrYCYfYh6Vp4Q/CzUsPDoC3gERB
xmiytbEhWWAsZK08H78ji48NwooDjCFaGLhEpUhjQ5QsPNkQIItp8gGkcTaUhtn4ACJmVvy0MaZj
rpryhjHapsyFqG4bEBIH/ntMTXFgkeOmE08NqjIB9Uy7ptMpmuFviGK9vjGt+1usD+4iDCG/6fMg
3WC2FX794JNNgePf8TCnSh5r9ttfkLexXWI6mqDECQbhDsVrMTplQHvxl0bsSV9wt0CaQTBIpkQO
4ozwmUFgr7G5kJG3RM16Ej4ftwOmZVUIESeBsNg2hwBcTKZj9HH2OVkMXsVQr5DpIxMFW9P1N96W
ra4pWNqWWbgCWHeme2ujc9cJwt7htk0iOWqJGKMQwFngbTY7iKrERrwwf9jbUMcaMM9sd4GFb+nc
Xtu/v5bfxGvee/72oNPJx8nV5+lX5RsH0/T66ssVP9vpZPL5dPpV/4akJykmDvbP1/Ovg28I73C5
uE4/nU8u55+jx6o3Q1Vd3pgcAP+en16EHf46/IY2Ww/bDktStJNo+/Idi5ZMOrIu7xZAE5mXq2Ty
iGDheX2LZ7mJcFzb1vdK4eDH/OmkQ1Itc2kfWTrQ6fzLrxGmRlplKYqKXk9+PZ2SkdHetDIYp9a9
kAkhE2Xbk8oBMT5YRtobn5KgtcAsNDTFIQ29wjuo5qvvvLxRqeYs7WRP8TEAP9+xdk+Ztfzps+8c
H48HmboTtHZcMjiu53Jh4V3gBm9sNg94L4iBLi4+XH76gF4vt94awUQhD3T1EIRI1x9m06vLGQYi
xyR+nLhBs/lk8mE2azQ2ZPMY7/KbzM2lHTx623sQK0kxbrIBrUNruAMFQWYbxvIBDNYi2paRu0X4
5fTs7JpYx1x+MpydrAKRJQsRdPSffyF/5QU+enSCu/Atsrz1Zmv7PvEXtx3+3Hada7UrYXazraIO
ofJHG8JOdgzPp7vdLIZUmWmzGtY1telyWGVc5hsey7L+UXRj4WUt/OA93xSnLqw0eRDV5A62gdzN
OGSi5RzOU3OLhzlyjUX+3QVstAr8eKUPnKWDJ8lbIv/BsrDEYphjO+47ojT+I+0vAMSI/2jqQMri
P0aY/6cPRPznIFTnVpOwOhhYHaz4lle4V019sLkIQ61zLkGpCSMxHKAlhHZxpASsZSCJDgONJNER
oKGkHYTk4hrKXSVrClF/ta05JuU+XBqhIXfVYPi/mVvii8crIRFJfKJwyMNOcYuxhUbmCkvH4gcR
jgcfKBMpYtZQfNwwHbxwoAGXjBaiMdEFPlAkpgTDGYqhKQpnLIaqb8DYCY0/FzxhRhwSjG5CDgW0
bmIOAEjeoEM1JPUQw48J9gMWueCOwPLTeD2BRX6wK7DE1iB6UEToyBlIEZFWgSAxnyDf7v7ns513
lyUapbOnEI3nqeq0UCHMl8BaDxo5KAAt5XZQVGOCHPaUDQ/cY0/bbEQue+KlT132oXLygBVc7ZnT
nhsi9rJnzvrwJAXw2Ne1B+Szp40O2GlPmR+Q1z7h7chtX4Trxm9PwQQ77mltauO5L2LyO2JL/J16
YiFzwO28p4C29N4zh5HhSzzGMLZz4D/HHrf04TNFp9Ua2akXv87UMN34ReaiH1/Z9eMDJ7rsdVdi
r3v1Jf52mDt3/wtw6liF+RCIUwe9lt6M0Scsfv+ztx66+VH21lHRKjYoZ96j24S5IAzx/Ecnfbxq
lF0s4O6FC3KFf4SnRx8dt+wpatKntCOkV8m6CgKtknAyzOPqBYyKUyFKlQVnSmgDsGCW6tTAcOqM
AlsOYKC5eezzACT12EgdN9xRrKK+bfljJCHPCmxsuR33JPuDVHrjAEeBF2ARkCKFe4eRT9DGNu/z
nyC8DvyGR6CMKo9Go/bDCxBJCmpp+/rF3q7J3TabizuXwu1b1b5QMER5puR4pkZGYa5GI517tuJH
xdMlazvzJatqlzNGOVtYu1td6PScESPPw0jc7emc4vXB9bZrc/XmzxOm3RvF8V/ffNpfcJN1/1fJ
4r+6pqsk/qsYAxH/PQQl11FVOaoFIMd/K+GLbKg6uaA5wNZpRczVLLzi+89/RvdUJbxwakQd0dv4
xlEvwzMIGqnmkcfTB0OMp0h0vFGo3uhtXPXy2OPzZ6dI/8lVzv09g1n/TUvr/0rSINR/SVGF/h+C
yMyP0ZLc4n8/lDT5H/PpyfTq/PLLVfjvyfX88vL88tPJ5/nFl/PJ6ezLL2gdPISO+t5Lx7WD7HhI
NmXpVhp/szZ9vDd8Wi6TcMCx+ypol1L931PtJ0Jc+q+H9b8lVeR/HYTIzHei/0pe/2Wh/8+EiP4r
Sf2fPT0DXv9noA3UcP3XVbH+H4SeWf2fI9VJGyQLVVUVoePU/0HKMEUhLyNVb1r/Rzlq/R9Fknb1
X9OE/h+EMiebOhrjM3vhNoQyzN2GYNb/YQHw5G1TsbjStjlQqrO2OQCqk7bZAIz6P2wA/vo/XJgl
FzsP705lEh5mav0fXhBa/R8eHP76PzwTD6r/UwlIyzivrf/DQqpNOmcxw3LOwSjQ+j+8gKW4DZi9
pv4PFYOv/g+4KdT6P2AEWv0fKkCTZPx6sCZZXE0Q61PxCeKQjVhd/wfIzJP2XQ9JSe1lMNEye5nP
qk/sZbBT8npZXIW0Xt0aSkNTUjkAGmX1QieSp/4PEJMr1U1IR1468AFKGY0WN8eVDp7bANwGA5AM
KGTi55AJrnRJbuMDSJZs0lBGqiTP6jvf2ahUcgJvE7D4eer/sLDA9X9AQID6PzwNqrhLAB4bylUC
6NxU3CSgsja6SMBAa7oDhUBSrhGAWwS/RcCAZOWCs9hbpII3Gn7GFQIoZjfbKtg9jD/CEHayY3g+
3e1mMWxzdaBRU5suh4CLAwxevvo/VLDm9X94Ievr/1Si7dTvodf/YS1RtfV/CjDaqAZm5ty6YSLu
CQqSHFUYADORvjwc1TAVg1v5/3ptCcbVpgrTlSbwEmOtZIpXqSIwVEpuPlleSh7JgTSWlJYNpcBq
QKuTwhLD0w0s7R4Bg7+YnD4weG4RMKCbXSKIQMsp6dBR2NlaJFcIdvdRVEjIBQIqM+DaEZCXT0+g
oHxqUjc+rItMjacsblLFpQ9gJ7vSMNCFqyYD31AwYU5vnnED33YBI8RHRNKqB5g41F+XGQxji6Qr
ct4maYbGa5WKl2Xwj3cvyxgdWqYdx0I0JJXXZaC2qHxbhsUnLssISinN/zle/UfdyPJ/0/w/TRH5
P4egurCqhO3G8Btf/Uc6CrT+IxOhXR5RAtYykYgOA80koiNAU4l2EFj1H3kgoPUfeTF56z/S8ZvW
fwQgsuo/MmUUVv+RCcOZikNTFM5cHKq+AXNnaPz0+o90jG5STgpo3eScACB5k06qIXnqPzIwwXHg
Ihc8EFx+Gm8kuMgPDgWX2BpkjxQROgoGU0SkVSLQTz+fsNj+/uezXXSfJRrs+o9CNJ6BqvPUf+Rf
DxoFqAAt5Q5QVWOCEjYoGx54xgZtswGo/8gEY9d/hEEA6z/ytQeUs0EbHXDSBmV+QFkbCW9HaRtF
uG7yNiiY4MQNWpvaZG4UMfkD8SX+TiPxkDngTt6ggLbM3mAOIyOWfIxhbJfA8Rx73DKHgyk6rdbI
TrM46kwNM42jyMxX/5GO1rz+Iz9mbf3H3YB+zcS2yROomwR6iUI6TpMShbvxmR3/05m9QqEXEi1D
79MYSeGgSXjl9dy39pPjByUppqOmbUwxauoe0mHqEiFK8VRIJBqiY4Vyinj3zIzRNpCaVsB1g8AH
XBOsbhDmr9mQsxMP6GfTLuYbWkWVZ/AaJx/QACBVZht1Me0XrUQo11xCEg9Ajaovq9lMeVt0tXEW
UxGg2zSm0u5or3lMsPFtOmvNcplousKRHsIILJSzO3LpHGSqcske4Mk6VHoH45wNSe6gzA47u6Mi
9CTSOwQJEiRIkCBBggQJEiRIkCBBggQJEiRIkCBBggQJEiRIkKCfif4P0wx0aQDIAAA=

--ELM919917475-1022-0_--


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-current" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?m10FsYd-000I5dC>