Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 10 Oct 2000 11:50:51 +0800 (CST)
From:      Tai-hwa Liang <avatar@mmlab.cse.yzu.edu.tw>
To:        freebsd-stable@freebsd.org
Subject:   md hangs in 4.1.1R
Message-ID:  <Pine.BSF.4.21.0010101116530.323-100000@brahms.mmlab.cse.yzu.edu.tw>

next in thread | raw e-mail | index | archive | help
-----BEGIN PGP SIGNED MESSAGE-----

Hi,

	I've followed the man page to create, mount a md device and perform
postfix(snapshot-20001005) compilation on it; however, the building process
just hung after some(heavily?) I/O operations.

	According to the advised "ps axl," it seems that the building process
was hung on process with "MD sec" in the WCHAN field. It's completely
reproducible on my laptop(tested on 4.1-RELEASE & 4.1.1-RELEASE). Sometimes
the "MD sec" appeared on 'syncer', sometimes the 'bufdaemon' stuck in it.
Following snapshot appears it just stuck in the 'cp' process.

  UID   PID  PPID CPU PRI NI   VSZ  RSS WCHAN  STAT  TT       TIME COMMAND
    0     0     0   0 -18  0     0    0 sched  DLs   ??    0:00.01  (swapper)
    0     1     0   0  10  0   456  288 wait   ILs   ??    0:00.03 /sbin/init --
    0     2     0   0 -18  0     0    0 psleep DL    ??    0:00.01  (pagedaemon)
    0     3     0   0  18  0     0    0 psleep DL    ??    0:00.00  (vmdaemon)
    0     4     0   0 -18  0     0    0 psleep DL    ??    0:00.17  (bufdaemon)
    0     5     0   0  -2  0     0    0 getblk DL    ??    0:00.57  (syncer)
    0    33     1   6  18  0   208   92 pause  Is    ??    0:00.00 adjkerntz -i
    0    96     1   0   2  0   908  612 select Ss    ??    0:00.08 syslogd -s
    0   122     1   0   2  0  1512 1068 select Is    ??    0:00.87 /usr/local/sbin/sshd
 1000  6113     1   0   2  0  1152  812 select Ss    ??    0:00.00 /usr/local/bin/ssh-agent
    0   252     1   0  18  0  1376 1032 pause  Is    v0    0:00.22 -tcsh (tcsh)
 1000  5536   252   0  18  0  1376 1032 pause  I     v0    0:00.16 -su (tcsh)
 1000  5559  5536   8  10  0  1008  676 wait   I+    v0    0:00.02 make PATH=.:/bin:/usr/bin:/sbin:/usr/sbin:/usr/local/bin:/usr/local/sbin:/usr/X11R6/bin
 1000  5978  5559   8  10  0   628  432 wait   I+    v0    0:00.00 /bin/sh -ec make MAKELEVEL= update
 1000  5979  5978   8  10  0  1012  688 wait   I+    v0    0:00.02 make MAKELEVEL= update
 1000  5980  5979 168  10  0   628  432 wait   I+    v0    0:00.01 /bin/sh -ec set -e; for i in src/util src/global src/dns src/master src/postfix src/smtpstone  src/sendmail src/error src/pickup src/cleanup src/smtpd src/local  src/lmtp src/trivial-rewrite src/qmgr src/smtp src/bounce src/pipe  src/showq src/postalias src/postcat src/postconf src/postdrop  src/postkick src/postlock src/postlog src/postmap src/postsuper  src/nqmgr src/spawn src/flush; do  (set -e; echo "[$i]"; cd $i; make "CC=gcc -Wmissing-prototypes -Wformat" update MAKELEVEL=) || exit 1;  done
 1000  7442  5980 171  10  0  1080  764 wait   I+    v0    0:00.05 make CC=gcc -Wmissing-prototypes -Wformat update MAKELEVEL=
 1000  7477  7442 171 -20  0   688  108 MD sec D+    v0    0:00.00 cp lmtp ../../libexec
 1000   188     1   3  18  0  1376 1028 pause  Ss    v1    0:00.22 -tcsh (tcsh)
 1000  7501   188   3  28  0   416  256 -      R+    v1    0:00.00 ps axlwww
    0   189     1   0   3  0   920  624 ttyin  Is+   v2    0:00.01 /usr/libexec/getty Pc ttyv2


	Furthermore, if the building process was hung, it's noway to break
the process with ^C. If I issue a 'shutdown -h now' at this moment, the system
would try to flush the buffer and failed with "syncer...timed out."

Shutting down daemon processes:.
Oct 10 11:06:27 brahms syslogd: exiting on signal 15
Oct 10 11:06:27 init: some processes would not die; ps axl advised
Waiting (max 60 seconds) for system process `bufdaemon' to stop...stopped
Waiting (max 60 seconds) for system process `syncer' to stop...timed out

syncing disks... 192 192 192 192 192 192 192 192 192 192 192 192 192 192 192
192 192 192 192 192

1: dev:#md/2, flags:21021024, blkno:65712, lblkno:65712
2: dev:#md/2, flags:21021024, blkno:20112, lblkno:-12
3: dev:#md/2, flags:21021024, blkno:81328, lblkno:12
4: dev:#md/2, flags:21021024, blkno:65648, lblkno:65648
5: dev:#md/2, flags:21021024, blkno:80912, lblkno:1
6: dev:#md/2, flags:21021024, blkno:81008, lblkno:56
.
.
.
72: dev:#md/2, flags:21021024, blkno:81376, lblkno:5
73: dev:#md/2, flags:21021024, blkno:80806, lblkno:3
giving up on 73 buffers
Uptime: 2h9m22s

The operating system has halted.
Please press any key to reboot.

	BTW, I've tried MFS instead of MD last night. The building process
worked well; however, if I reboot the system without unmounting the MFS first,
the "Waiting (max 60 seconds) for system process.....timed out" message just
popped up again in the console. I've never encountered such problem with MFS
in 2-stable & 3-stable. Is there anyone who can verify this mystic hang for
me? TIA.

dmesg output:

Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 4.1.1-RELEASE #0: Sun Oct  8 15:37:39 CST 2000
    root@brahms.mmlab.cse.yzu.edu.tw:/usr/src/sys/compile/brahms
Timecounter "i8254"  frequency 1193108 Hz
CPU: Pentium II/Pentium II Xeon/Celeron (265.25-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x652  Stepping = 2
  Features=0x183f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR>
real memory  = 134217728 (131072K bytes)
avail memory = 127586304 (124596K bytes)
Preloaded elf kernel "kernel" at 0xc031b000.
Preloaded userconfig_script "/boot/kernel.conf" at 0xc031b09c.
Preloaded elf module "splash_bmp.ko" at 0xc031b0ec.
Preloaded splash_image_data "/boot/chuck1.bmp" at 0xc031b190.
VESA: v2.0, 2048k memory, flags:0x0, mode table:0xc00c876a (c000876a)
VESA: Copyright 1994 TRIDENT MICROSYSTEMS INC.

Pentium Pro MTRR support enabled
md0: Malloc disk
apm0: <APM BIOS> on motherboard
apm: found APM BIOS v1.2, connected at v1.2
npx0: <math processor> on motherboard
npx0: INT 16 interface
pcib0: <Intel 82443BX host to PCI bridge (AGP disabled)> on motherboard
pci0: <PCI bus> on pcib0
pci0: <Trident model 9388 VGA-compatible display device> at 2.0 irq 9
isab0: <Intel 82371AB PCI to ISA bridge> at device 3.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel PIIX4 ATA33 controller> port 0xfcd0-0xfcdf at device 3.1 on pci0
ata0: at 0x1f0 irq 14 on atapci0
pci0: <Intel 82371AB/EB (PIIX4) USB controller> at 3.2 irq 9
chip1: <Intel 82371AB Power management controller> port 0xff80-0xff8f at device 3.3 on pci0
pcic-pci0: <TI PCI-1221 PCI-CardBus Bridge> at device 10.0 on pci0
pcic-pci0: TI12XX PCI Config Reg: [speaker enable][FUNC pci int + CSC serial isa irq]
pcic-pci1: <TI PCI-1221 PCI-CardBus Bridge> at device 10.1 on pci0
pcic-pci1: TI12XX PCI Config Reg: [speaker enable][FUNC pci int + CSC serial isa irq]
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: model Generic PS/2 mouse, device ID 0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
sc0: <System console> on isa0
sc0: VGA <16 virtual consoles, flags=0x200>
fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 flags 0x10 on isa0
sio1: type 16550A
sbc0: <ESS 688> at port 0x220-0x22f irq 5 drq 3 flags 0x15 on isa0
pcm0: <ESS 18xx DSP> on sbc0
pcic0: <VLSI 82C146> at port 0x3e0-0x3e1 irq 10 on isa0
pcic0: management irq 10
pccard0: <PC Card bus -- kludge version> on pcic0
pccard1: <PC Card bus -- kludge version> on pcic0
ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
ppi0: <Parallel I/O> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
plip0: <PLIP network interface> on ppbus0
ad0: 3102MB <HITACHI_DK237A-32> [6304/16/63] at ata0-master using UDMA33
acd0: CDROM <UJDA150> at ata0-slave using WDMA2
Mounting root from ufs:/dev/ad0s2a
WARNING: / was not properly dismounted
pccard: card inserted, slot 1


kernel configuration:

machine		i386
ident		brahms
maxusers	10
makeoptions	CONF_CFLAGS=-fno-builtin  #Don't allow use of memcmp, etc.

options 	MAXDSIZ="(256*1024*1024)"
options 	DFLDSIZ="(256*1024*1024)"
options 	BLKDEV_IOSIZE=8192
options		PQ_CACHESIZE=512	# color for 512k/16k cache
options 	ROOTDEVNAME=\"ufs:ad0s3a\"

cpu		I686_CPU
options 	NO_F00F_HACK
options 	COMPAT_43
options 	SYSVSHM
options 	SYSVSEM
options 	SYSVMSG
options 	DDB
options 	KTRACE			#kernel tracing
options 	USERCONFIG		#boot -c editor
options 	VISUAL_USERCONFIG	#visual boot -c editor
options 	INET			#Internet communications protocols

pseudo-device	ether			#Generic Ethernet
pseudo-device	loop			#Network loopback device
pseudo-device	bpf			#Berkeley packet filter

options                ACCEPT_FILTER_DATA
options                ACCEPT_FILTER_HTTP
options 	TCP_DROP_SYNFIN		#drop TCP packets with SYN+FIN
options 	TCP_RESTRICT_RST	#restrict emission of TCP RST
options 	ICMP_BANDLIM

options 	FFS			#Fast filesystem
options 	FFS_ROOT		#FFS usable as root device
options 	SOFTUPDATES

options 	MD_NSECT=131072
options 	EXT2FS
options 	VFS_AIO

options 	P1003_1B
options 	_KPOSIX_PRIORITY_SCHEDULING
options 	_KPOSIX_VERSION=199309L

pseudo-device	pty		#Pseudo ttys
pseudo-device	gzip		#Exec gzipped a.out's
pseudo-device	vn		#Vnode driver (turns a file into a device)
pseudo-device	md		#Memory/malloc disk

options 	MSGBUF_SIZE=40960

device		isa
options 	AUTO_EOI_1
options 	MAXMEM="(128*1024)"

device		atkbdc0	at isa? port IO_KBD
device		atkbd0	at atkbdc? irq 1

options 	KBD_DISABLE_KEYMAP_LOAD	# refuse to load a keymap

device		psm0	at atkbdc? irq 12
options 	PSM_HOOKRESUME		#hook the system resume event, useful
options 	PSM_RESETAFTERSUSPEND	#reset the device at the resume event

device		vga0	at isa?
options 	VESA
pseudo-device	splash

device		sc0	at isa?
options 	MAXCONS=16		# number of virtual consoles

device		npx0	at nexus? port IO_NPX flags 0x0 irq 13

device		ata
device		atadisk		# ATA disk drives
device		atapicd		# ATAPI CDROM drives

options 	ATA_ENABLE_ATAPI_DMA

device		fdc0	at isa? port IO_FD1 irq 6 drq 2
device		fd0	at fdc0 drive 0

device		sio0	at isa? port IO_COM1 flags 0x10 irq 4
device		sio1	at isa? port IO_COM2 flags 0x10 irq 3

device		pcm
device		sbc0	at isa? port 0x220 irq 5 drq 3 flags 0x15

device		apm0
device		pci

device		pcic0 at isa? irq 10
device		card
device		ed

options 	PCIC_RESUME_RESET	# reset after resume

device		ppc0	at isa? irq 7
device		ppbus
device		lpt
device		plip
device		ppi

options 	NSFBUFS=1024

options 	INIT_PATH="/sbin/init:/stand/sysinstall"

options 	CLK_USE_I8254_CALIBRATION
options 	CLK_USE_TSC_CALIBRATION
options 	SHOW_BUSYBUFS	# List buffers that prevent root unmount

-----BEGIN PGP SIGNATURE-----
Version: 2.6.3i
Charset: noconv

iQCVAwUBOeKSIGi3mv/0ATqxAQGfBwP9G/2nuudTVy5RhBWDixELRBXtRTKE0lN0
Ko3VtoBPFyhDsTuVZaa+IzpiVQfnL567Ke/KKZDm0g+05tiFwxr59+Ov0PfCAC7o
bDJO3alpTD9DiGyEdb5M967OBxAAYLSBNNdEEozj06AD3BabJdYphiC3TzvItuoA
vd7aHI4owms=
=G9UY
-----END PGP SIGNATURE-----



To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-stable" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.BSF.4.21.0010101116530.323-100000>