Can't Offline a disk via web interface

Status
Not open for further replies.

RobbieG1977

Dabbler
Joined
Jul 20, 2018
Messages
19
I am sure I am not doing something right but my problem is that I have a bad disk and I want to replace it. I go to view disk status of the pool, select the disk and choose offline. I get the following error code. Please help!

Code:
Request Method: POST
Request URL: http://192.168.1.3/storage/zpool-RobzData/disk/offline/gptid/8061d888-af1d-11e8-8305-002170456035/
Software Version: FreeNAS-11.1-U6 (caffd76fa)
Exception Type: MiddlewareError
Exception Value:
[MiddlewareError: Disk offline failed: "cannot offline gptid/8061d888-af1d-11e8-8305-002170456035: no valid replicas, "]
Exception Location: ./freenasUI/middleware/notifier.py in zfs_offline_disk, line 1065
Server time: Tue, 2 Oct 2018 18:45:56 -0700
Traceback

Code:
Environment:

Software Version: FreeNAS-11.1-U6 (caffd76fa)
Request Method: POST
Request URL: http://192.168.1.3/storage/zpool-RobzData/disk/offline/gptid/8061d888-af1d-11e8-8305-002170456035/


Traceback:
File "/usr/local/lib/python3.6/site-packages/django/core/handlers/exception.py" in inner
  42.			 response = get_response(request)
File "/usr/local/lib/python3.6/site-packages/django/core/handlers/base.py" in _legacy_get_response
  249.			 response = self._get_response(request)
File "/usr/local/lib/python3.6/site-packages/django/core/handlers/base.py" in _get_response
  178.			 response = middleware_method(request, callback, callback_args, callback_kwargs)
File "./freenasUI/freeadmin/middleware.py" in process_view
  162.		 return login_required(view_func)(request, *view_args, **view_kwargs)
File "/usr/local/lib/python3.6/site-packages/django/contrib/auth/decorators.py" in _wrapped_view
  23.				 return view_func(request, *args, **kwargs)
File "./freenasUI/storage/views.py" in disk_offline
  643.		 notifier().zfs_offline_disk(volume, label)
File "./freenasUI/middleware/notifier.py" in zfs_offline_disk
  1065.			 raise MiddlewareError('Disk offline failed: "%s"' % error)

Exception Type: MiddlewareError at /storage/zpool-RobzData/disk/offline/gptid/8061d888-af1d-11e8-8305-002170456035/
Exception Value: [MiddlewareError: Disk offline failed: "cannot offline gptid/8061d888-af1d-11e8-8305-002170456035: no valid replicas, "]
Request information GET No GET data POST Variable Value __form_id 'form_str' FILES No FILES data COOKIES Variable Value fntreeSaveStateCookie 'root%2Croot%2F62%2Croot%2F62%2F63%2F64%2Croot%2F62%2F63' csrftoken '********' sessionid 'zt0qxdrpa4cq5roaepremw29qp1s5j1b' META Variable Value
 

HoneyBadger

actually does care
Administrator
Moderator
iXsystems
Joined
Feb 6, 2014
Messages
5,112
MiddlewareError: Disk offline failed: "cannot offline gptid/8061d888-af1d-11e8-8305-002170456035: no valid replicas"

That would seem to imply you're trying to offline a disk that you can't, because it would kill the pool; but I assume that you would have mentioned if that was the case.

Can you please post the output of zpool status in CODE tags?
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504

RobbieG1977

Dabbler
Joined
Jul 20, 2018
Messages
19
Your assumption would be correct. Previously I had a bad disk. I just yanked it out and put a new one in. I had no issues. This time around I did the same thing and it screamed at me. I researched what I did wrong and apparently I was supposed to go into the volume and take the disk offline. Thats what I attempted and thus I got the above error. So now your second point, how do I get tge output for the zpool code?

Thank you
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
So now your second point, how do I get tge output for the zpool code?
Go to the shell (or, better, log in via SSH), type zpool status, and post the output.
 

RobbieG1977

Dabbler
Joined
Jul 20, 2018
Messages
19
Here is my code when I typed zpool status.

Code:
login as: root
root@192.168.1.3's password:
FreeBSD 11.1-STABLE (FreeNAS.amd64) #0 r321665+9902d126c39(freenas/11.1-stable):																   Tue Aug 21 12:24:37 EDT 2018

		FreeNAS (c) 2009-2017, The FreeNAS Development Team
		All rights reserved.
		FreeNAS is released under the modified BSD license.

		For more information, documentation, help or support, go here:
		http://freenas.org
Welcome to FreeNAS

Warning: settings changed through the CLI are not written to
the configuration database and will be reset on reboot.

root@SRVRGROSS02:~ # zpool status
  pool: JAILS
 state: ONLINE
  scan: none requested
config:

		NAME										  STATE	 READ WRITE CKSUM
		JAILS										 ONLINE	   0	 0	 0
		  gptid/bd4bb2ab-af20-11e8-8749-002170456035  ONLINE	   0	 0	 0

errors: No known data errors

  pool: RobzData
 state: ONLINE
  scan: none requested
config:

		NAME										  STATE	 READ WRITE CKSUM
		RobzData									  ONLINE	   0	 0	 0
		  gptid/7e4e06f3-af1d-11e8-8305-002170456035  ONLINE	   0	 0	 0
		  gptid/7f55a73a-af1d-11e8-8305-002170456035  ONLINE	   0	 0	 0
		  gptid/8061d888-af1d-11e8-8305-002170456035  ONLINE	   0	 0	 0
		  gptid/81b8796e-af1d-11e8-8305-002170456035  ONLINE	   0	 0	 0
		  gptid/82fb0551-af1d-11e8-8305-002170456035  ONLINE	   0	 0	 0
		  gptid/83eb23d4-af1d-11e8-8305-002170456035  ONLINE	   0	 0	 0

errors: No known data errors

  pool: freenas-boot
 state: ONLINE
  scan: scrub repaired 0 in 0 days 00:00:14 with 0 errors on Wed Sep 26 03:45:14																   2018
config:

		NAME		STATE	 READ WRITE CKSUM
		freenas-boot  ONLINE	   0	 0	 0
		  ada3p2	ONLINE	   0	 0	 0

errors: No known data errors
root@SRVRGROSS02:~ #

 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
Yep, there's your problem. RobzData is a six-disk striped pool with no redundancy, so you can't offline any of the disks. When (not if) a disk fails, you will lose all your data on that pool.

If you have a spare SATA port, you can replace the questionable disk without taking it offline first--install the replacement disk, go to Volume Status, select the appropriate disk, click Replace, and pick the new disk in the pop-up window. Once the replacement is complete, the old disk will drop offline and you can remove it. But in the long term, back up, destroy the pool, recreate it with redundancy, and restore.
 

RobbieG1977

Dabbler
Joined
Jul 20, 2018
Messages
19
Good Morning, so I followed your advice and got as far as completing the replacing of the drive. After roughly 3 hours, I got an error, as seen in the attached photo. I don't know how to retrieve more details of that error so maybe you can help me with that.

Thanks,
Rob
 

Attachments

  • diskreplacementerror.PNG
    diskreplacementerror.PNG
    44.2 KB · Views: 313

HoneyBadger

actually does care
Administrator
Moderator
iXsystems
Joined
Feb 6, 2014
Messages
5,112
Back up any critical data off the array to another disk as soon as possible.

Device ada0 is failing as can be seen by the 8 uncorrectable sectors, it also tripped an alert for temperature (which may well have caused the failing sectors) and when that drive goes it will likely take much more or all of your pool with it.

Once you have gotten your critical data off, go to a shell/SSH and run smartctl -a /dev/ada0 and check your drive's reported temperature.
 

RobbieG1977

Dabbler
Joined
Jul 20, 2018
Messages
19
I am not concerned with the temp right now because I had to use a power sata connection to power up a new disk to replace ada0. Once I get this disk replaced, those errors will go away. I just am having trouble replacing the bad disk. Once I get the old disk out and the new disk in, then I will figure out a redundency plan for my data as well as plug in the fans.
 

HoneyBadger

actually does care
Administrator
Moderator
iXsystems
Joined
Feb 6, 2014
Messages
5,112
High drive temperature causes failures, and /dev/ada0 (the failing drive) is the one throwing a temperature alert.

Once I get this disk replaced, those errors will go away.

Any errors on your pool or the files within are permanent, as you have no redundancy to rebuild them from.

Again - your priority should be getting any data you cannot afford to lose off that array right now.
 

RobbieG1977

Dabbler
Joined
Jul 20, 2018
Messages
19
Yes I am aware of that. All my data that I have to backed up elsewhere. I have it saved in another location. So right now my concern is replacing this drive. Once I do that then I can reconstruct the pool with redundancy as well as fix the temp issues.
 
Status
Not open for further replies.
Top