From owner-freebsd-stable@FreeBSD.ORG Sat Dec 13 20:41:22 2003 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 6F86B16A4CE for ; Sat, 13 Dec 2003 20:41:22 -0800 (PST) Received: from malkav.snowmoon.com (malkav.snowmoon.com [209.23.60.62]) by mx1.FreeBSD.org (Postfix) with SMTP id 83D4043D39 for ; Sat, 13 Dec 2003 20:41:18 -0800 (PST) (envelope-from jaime@snowmoon.com) Received: (qmail 39260 invoked from network); 14 Dec 2003 04:41:17 -0000 Received: from alb-24-195-202-60.nycap.rr.com (HELO snowmoon.com) (24.195.202.60) by 10.5.1.62 with SMTP; 14 Dec 2003 04:41:17 -0000 Date: Sat, 13 Dec 2003 23:40:56 -0500 Mime-Version: 1.0 (Apple Message framework v553) Content-Type: text/plain; charset=US-ASCII; format=flowed From: Jaime To: freebsd-stable@freebsd.org Content-Transfer-Encoding: 7bit Message-Id: X-Mailer: Apple Mail (2.553) Subject: Page faults every few days X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 14 Dec 2003 04:41:22 -0000 I have a server that has been experiencing kernel panics every few weeks for months. Lately, it seems to happen anywhere from once every two weeks up to 3 times in the same week. It is always a page fault. I've replaced the drive with the swap partition on it, the RAM, the mother board, the IDE cable, the IDE controller (on that mother board), the CPU, and a few other parts in the last 12 months. I *think* that these page faults have been happening since before the most current cvsup/make-world process, but I can't be 100% certain that more than one happened. I've removed the vinum RAID-5 system that used to manage /home, but that didn't help the stability at all. (Though it made rebooting a lot quicker and less prone to failure.) I'm running out of ideas. What information I do have is listed below. Any help would be appreciated. I don't even know how to do any useful diagnostics in order to isolate the problem. Thanks in advance, Jaime zeus# uname -a FreeBSD zeus.cairodurham.org 4.9-PRERELEASE FreeBSD 4.9-PRERELEASE #9: Tue Aug 26 14:01:09 EDT 2003 jkikpole@zeus.cairodurham.org:/usr/obj/usr/src/sys/ZEUS i386 zeus# pwd /usr/local/crash zeus# whoami root zeus# ls -l total 1585106 -rw-r--r-- 1 root wheel 2 Dec 9 09:11 bounds -rw-r--r-- 1 root wheel 2324353 Nov 12 08:01 kernel.0 -rw-r--r-- 1 root wheel 2324353 Nov 12 15:05 kernel.1 -rw-r--r-- 1 root wheel 2324353 Nov 18 11:54 kernel.2 -rw-r--r-- 1 root wheel 2324353 Dec 2 10:22 kernel.3 -rw-r--r-- 1 root wheel 2324353 Dec 5 07:35 kernel.4 -rw-r--r-- 1 root wheel 2324353 Dec 9 09:11 kernel.5 -rw------- 1 root wheel 268369920 Nov 12 08:01 vmcore.0 -rw------- 1 root wheel 268369920 Nov 12 15:05 vmcore.1 -rw------- 1 root wheel 268369920 Nov 18 11:54 vmcore.2 -rw------- 1 root wheel 268369920 Dec 2 10:22 vmcore.3 -rw------- 1 root wheel 268369920 Dec 5 07:35 vmcore.4 -rw------- 1 root wheel 268369920 Dec 9 09:11 vmcore.5 zeus# gdb -k kernel.0 vmcore.0 GNU gdb 4.18 (FreeBSD) Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"... (no debugging symbols found)... IdlePTD at phsyical address 0x0036f000 initial pcb at physical address 0x002dc240 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0x13 fault code = supervisor read, page not present instruction pointer = 0x8:0xc022cb68 stack pointer = 0x10:0xcca0fe48 frame pointer = 0x10:0xcca0fe50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 91607 (perl5.00503) interrupt mask = none trap number = 12 panic: page fault syncing disks... 54 1 done Uptime: 2d20h1m1s dumping to dev #ad/0x20001, offset 1572992 dump ata0: resetting devices .. done 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 --- #0 0xc016b1ba in dumpsys () (kgdb) quit zeus# gdb -k kernel.1 vmcore.1 GNU gdb 4.18 (FreeBSD) Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"...(no debugging symbols found)... IdlePTD at phsyical address 0x0036f000 initial pcb at physical address 0x002dc240 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0x13 fault code = supervisor read, page not present instruction pointer = 0x8:0xc022cb68 stack pointer = 0x10:0xcca37e48 frame pointer = 0x10:0xcca37e50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 16463 (perl5.00503) interrupt mask = none trap number = 12 panic: page fault syncing disks... 52 1 done Uptime: 7h1m56s dumping to dev #ad/0x20001, offset 1572992 dump ata0: resetting devices .. done 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 --- #0 0xc016b1ba in dumpsys () (kgdb) quit zeus# gdb -k kernel.2 vmcore.2 GNU gdb 4.18 (FreeBSD) Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"...(no debugging symbols found)... IdlePTD at phsyical address 0x0036f000 initial pcb at physical address 0x002dc240 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0x13 fault code = supervisor read, page not present instruction pointer = 0x8:0xc022cb68 stack pointer = 0x10:0xccef1e48 frame pointer = 0x10:0xccef1e50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 67351 (perl5.00503) interrupt mask = none trap number = 12 panic: page fault syncing disks... 41 done Uptime: 5d20h45m52s dumping to dev #ad/0x20001, offset 1572992 dump ata0: resetting devices .. done 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 --- #0 0xc016b1ba in dumpsys () (kgdb) quit zeus# gdb -k kernel.3 vmcore.3 GNU gdb 4.18 (FreeBSD) Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"...(no debugging symbols found)... IdlePTD at phsyical address 0x0036f000 initial pcb at physical address 0x002dc240 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0xb00036 fault code = supervisor read, page not present instruction pointer = 0x8:0xc022cb68 stack pointer = 0x10:0xcc7dee48 frame pointer = 0x10:0xcc7dee50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 93374 (perl5.00503) interrupt mask = none trap number = 12 panic: page fault syncing disks... 65 done Uptime: 13d22h25m47s dumping to dev #ad/0x20001, offset 1572992 dump ata0: resetting devices .. done 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 --- #0 0xc016b1ba in dumpsys () (kgdb) quit zeus# gdb -k kernel.4 vmcore.4 GNU gdb 4.18 (FreeBSD) Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"...(no debugging symbols found)... IdlePTD at phsyical address 0x0036f000 initial pcb at physical address 0x002dc240 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0xe4000113 fault code = supervisor read, page not present instruction pointer = 0x8:0xc022cb68 stack pointer = 0x10:0xcc9d4e48 frame pointer = 0x10:0xcc9d4e50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 19861 (perl5.00503) interrupt mask = none trap number = 12 panic: page fault syncing disks... 35 4 1 done Uptime: 2d21h11m34s dumping to dev #ad/0x20001, offset 1572992 dump ata0: resetting devices .. done 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 --- #0 0xc016b1ba in dumpsys () (kgdb) quit zeus# gdb -k kernel.5 vmcore.5 GNU gdb 4.18 (FreeBSD) Copyright 1998 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "i386-unknown-freebsd"...(no debugging symbols found)... IdlePTD at phsyical address 0x0036f000 initial pcb at physical address 0x002dc240 panicstr: page fault panic messages: --- Fatal trap 12: page fault while in kernel mode fault virtual address = 0x13 fault code = supervisor read, page not present instruction pointer = 0x8:0xc022cb68 stack pointer = 0x10:0xccc10e48 frame pointer = 0x10:0xccc10e50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 26642 (perl5.00503) interrupt mask = none trap number = 12 panic: page fault syncing disks... 55 done Uptime: 4d1h33m35s dumping to dev #ad/0x20001, offset 1572992 dump ata0: resetting devices .. done 255 254 253 252 251 250 249 248 247 246 245 244 243 242 241 240 239 238 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 --- #0 0xc016b1ba in dumpsys () (kgdb) quit