[nmglug] gsmartcontrol Weirdness

Alucard alucard at swcp.com
Wed Apr 24 15:52:24 PDT 2019


Hi Brian,
That 1 TB WD Blue SSD isn't reporting SMART data properly. Or at least 
they're not following the SMART standard(s). Using the Western Digital 
SSD Dashboard ( Windows only :-( ) you might get better health metrics. 
Going off the data you sent I wouldn't be able to say for sure that the 
drive is okay. But it appears to be.
187 Reported_Uncorrect      -O--CK   100   100   ---    -    0
^^^^
That's a very good sign. Anything more than 0 is bad.

232 Available_Reservd_Space PO--CK   100   100   004    -    100
^^^^
That's also a good sign. Similar to the one below.

233 Media_Wearout_Indicator -O--CK   100   100   ---    -    3598
^^^^
I normally use that to tell how much life is left. However, I can't tell 
if that is B, KiB, or MiB. Sometimes you see that given as a percent value.

At this point I would rule out the drive as the failure. However, I 
would not recommend WD SSDs. Samsung and Crucial make some pretty 
decently priced SSDs, and I have battle tested those brands.

Maybe you were hacked. Very unlikely though.

Though I still think you should nuke and start over. Were you able to 
look at journalctl to see if you can pin point the failure? Or can you 
not get to a shell prompt from that OS?

Jared


On 4/23/19 5:34 PM, Brian O'Keefe wrote:
>
> Sorry AGAIN for the stream of emails. Output of gsmartcontro extended 
> test:
>
> smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.15.0-47-generic] (local 
> build)
> Copyright (C) 2002-16, Bruce Allen, Christian Franke, 
> www.smartmontools.org
>
> === START OF INFORMATION SECTION ===
> Device Model:     WDC WDS100T2B0A-00SM50
> Serial Number:    181228800969
> LU WWN Device Id: 5 001b44 8b6aebe6a
> Firmware Version: X61130WD
> User Capacity:    1,000,204,886,016 bytes [1.00 TB]
> Sector Size:      512 bytes logical/physical
> Rotation Rate:    Solid State Device
> Form Factor:      2.5 inches
> Device is:        Not in smartctl database [for details use: -P showall]
> ATA Version is:   Unknown(0x0ff0), ACS-4 T13/BSR INCITS 529 revision 5
> SATA Version is:  SATA >3.2 (0x1ff), 6.0 Gb/s (current: 3.0 Gb/s)
> Local Time is:    Tue Apr 23 16:13:00 2019 MDT
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> AAM feature is:   Unavailable
> APM level is:     254 (maximum performance)
> Rd look-ahead is: Enabled
> Write cache is:   Enabled
> ATA Security is:  Disabled, NOT FROZEN [SEC1]
>
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
>
> General SMART Values:
> Offline data collection status:  (0x00)    Offline data collection 
> activity
>                     was never started.
>                     Auto Offline Data Collection: Disabled.
> Self-test execution status:      (   0)    The previous self-test 
> routine completed
>                     without error or no self-test has ever
>                     been run.
> Total time to complete Offline
> data collection:         (    0) seconds.
> Offline data collection
> capabilities:              (0x11) SMART execute Offline immediate.
>                     No Auto Offline data collection support.
>                     Suspend Offline collection upon new
>                     command.
>                     No Offline surface scan supported.
>                     Self-test supported.
>                     No Conveyance Self-test supported.
>                     No Selective Self-test supported.
> SMART capabilities:            (0x0003)    Saves SMART data before 
> entering
>                     power-saving mode.
>                     Supports SMART auto save timer.
> Error logging capability:        (0x01)    Error logging supported.
>                     General Purpose Logging supported.
> Short self-test routine
> recommended polling time:      (   2) minutes.
> Extended self-test routine
> recommended polling time:      (  10) minutes.
>
> SMART Attributes Data Structure revision number: 4
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
>   5 Reallocated_Sector_Ct   -O--CK   100   100   ---    -    0
>   9 Power_On_Hours          -O--CK   100   100   ---    - 2694
>  12 Power_Cycle_Count       -O--CK   100   100   ---    - 2430
> 165 Unknown_Attribute       -O--CK   100   100   ---    - 34380513503
> 166 Unknown_Attribute       -O--CK   100   100   ---    -    1
> 167 Unknown_Attribute       -O--CK   100   100   ---    -    33
> 168 Unknown_Attribute       -O--CK   100   100   ---    -    21
> 169 Unknown_Attribute       -O--CK   100   100   ---    -    564
> 170 Unknown_Attribute       -O--CK   100   100   ---    -    0
> 171 Unknown_Attribute       -O--CK   100   100   ---    -    0
> 172 Unknown_Attribute       -O--CK   100   100   ---    -    0
> 173 Unknown_Attribute       -O--CK   100   100   ---    -    3
> 174 Unknown_Attribute       -O--CK   100   100   ---    - 1698
> 184 End-to-End_Error        -O--CK   100   100   ---    -    0
> 187 Reported_Uncorrect      -O--CK   100   100   ---    -    0
> 188 Command_Timeout         -O--CK   100   100   ---    -    14
> 194 Temperature_Celsius     -O---K   064   048   ---    -    36 
> (Min/Max 9/48)
> 199 UDMA_CRC_Error_Count    -O--CK   100   100   ---    -    0
> 230 Unknown_SSD_Attribute   -O--CK   100   100   ---    - 188980527148
> 232 Available_Reservd_Space PO--CK   100   100   004    -    100
> 233 Media_Wearout_Indicator -O--CK   100   100   ---    - 3598
> 234 Unknown_Attribute       -O--CK   100   100   ---    - 5422
> 241 Total_LBAs_Written      ----CK   253   253   ---    - 4262
> 242 Total_LBAs_Read         ----CK   253   253   ---    - 7820
> 244 Unknown_Attribute       -O--CK   000   100   ---    -    0
>                             ||||||_ K auto-keep
>                             |||||__ C event count
>                             ||||___ R error rate
>                             |||____ S speed/performance
>                             ||_____ O updated online
>                             |______ P prefailure warning
>
> General Purpose Log Directory Version 1
> SMART           Log Directory Version 1 [multi-sector log support]
> Address    Access  R/W   Size  Description
> 0x00       GPL,SL  R/O      1  Log Directory
> 0x01           SL  R/O      1  Summary SMART error log
> 0x02           SL  R/O      2  Comprehensive SMART error log
> 0x03       GPL     R/O      1  Ext. Comprehensive SMART error log
> 0x04       GPL,SL  R/O      8  Device Statistics log
> 0x06           SL  R/O      1  SMART self-test log
> 0x07       GPL     R/O      1  Extended self-test log
> 0x10       GPL     R/O      1  SATA NCQ Queued Error log
> 0x11       GPL     R/O      1  SATA Phy Event Counters log
> 0x30       GPL,SL  R/O      9  IDENTIFY DEVICE data log
> 0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
> 0xde       GPL     VS       8  Device vendor specific log
>
> SMART Extended Comprehensive Error Log Version: 1 (1 sectors)
> No Errors Logged
>
> SMART Extended Self-test Log Version: 1 (1 sectors)
> Num  Test_Description    Status                  Remaining 
> LifeTime(hours)  LBA_of_first_error
> # 1  Extended offline    Completed without error       00% 2694         -
> # 2  Short offline       Completed without error       00% 2693         -
>
> Selective Self-tests/Logging not supported
>
> SCT Commands not supported
>
> Device Statistics (GP Log 0x04)
> Page  Offset Size        Value Flags Description
> 0x01  =====  =               =  ===  == General Statistics (rev 1) ==
> 0x01  0x008  4            2430  ---  Lifetime Power-On Resets
> 0x01  0x010  4               0  ---  Power-on Hours
> 0x01  0x018  6      8939654830  ---  Logical Sectors Written
> 0x01  0x020  6       440746242  ---  Number of Write Commands
> 0x01  0x028  6     16400872050  ---  Logical Sectors Read
> 0x01  0x030  6      1053092276  ---  Number of Read Commands
> 0x07  =====  =               =  ===  == Solid State Device Statistics 
> (rev 1) ==
> 0x07  0x008  1               0  N--  Percentage Used Endurance Indicator
>                                 |||_ C monitored condition met
>                                 ||__ D supports DSN
>                                 |___ N normalized value
>
> SATA Phy Event Counters (GP Log 0x11)
> ID      Size     Value  Description
> 0x0001  4            0  Command failed due to ICRC error
> 0x0002  4            0  R_ERR response for data FIS
> 0x0005  4            0  R_ERR response for non-data FIS
> 0x000a  4            3  Device-to-host register FISes sent due to a 
> COMRESET
>
> On 4/23/19 10:15 AM, Alucard wrote:
>> Hi Brian,
>>
>> My 2¢.
>>
>> If my OS was that hosed, I would just start over. There is a point 
>> where the rabbit hole gets too deep to climb out of.
>>
>> Have you looked at the hard drive to see if that HDD/SSD is failing?
>>
>> sudo apt install smartmontools
>> sudo smartctl --all /dev/sda
>>
>> If you have never looked at SMART data, then you will want to 
>> probably send us/me the output. You will probably be better off doing 
>> this from a live USB/CD. If you want a GUI for smartmontools, look at 
>> GSmartControl.
>>
>> If the drive is failing, then fixing the OS is a moot point.
>>
>> Regards,
>>
>> Jared
>>
>> On 4/22/19 9:45 PM, Brian O'Keefe wrote:
>>>
>>> Thanks Harold. Responses inserted for ease and clarity. Many thanks 
>>> again
>>>
>>> On 4/22/19 9:23 PM, Harold Furbiter wrote:
>>>> Here is a receipe for boot from an older kernel, and how to set it 
>>>> to a default.
>>>>
>>>> If you have a few Kernels in your system you can set manually what 
>>>> Kernel version will start:
>>>>
>>>> 1.
>>>>
>>>>     Reboot your PC with pressed Shift button for display GRUB after
>>>>     BIOS will start. You will see something like: GRUB start page
>>>>     <https://i.stack.imgur.com/sSCzp.png>
>>>>
>>> /I have booted into older kernels or safe mode this way in the pas. 
>>> Now I cannot reboot as all that comes up is the them color screen, 
>>> no login, nothing. So I have to do a hard shutdown. I do not get a 
>>> Grub menu holding down shift key upon starting. I get a flat theme 
>>> color screen. Nothing more/
>>>>
>>>> 1.
>>>>
>>>>     Select "Advanced options for Ubuntu" and memorize index of this
>>>>     menu line(count starts from 0) On the picture index is 1
>>>>
>>> /Since I can't access Grub menu I can't do any of the following 
>>> except edit grub setup file, which I have not done because of above 
>>> issue/
>>>>
>>>>       2.      Select concrete Kernel 
>>>> <https://i.stack.imgur.com/yYhnM.png>
>>>>
>>>> 3.
>>>>
>>>>     Select concrete kernel for boot and also memorize index of this
>>>>     menu line(count starts from 0) On the picture index of chosen
>>>>     Kernel is 2
>>>>
>>>> 4.
>>>>
>>>>     Start system. This action is for one boot on concrete kernel.
>>>>     If you want to start from concrete Kernel all time you should
>>>>     do next steps:
>>>>
>>>> 4.1. Open and edit GRUB setup file:
>>>>
>>>> |sudo nano /etc/default/grub |
>>>>
>>>> 4.2. Find line GRUB_DEFAULT=...(by default GRUB_DEFAULT=0) and sets 
>>>> in quotes menu path to concrete Kernel(Remember menu indexes from 
>>>> steps 2 and 3). In my system first index was 1 and second was 2. I 
>>>> set in to GRUB_DEFAULT
>>>>
>>>> |GRUB_DEFAULT="1>2" |
>>>>
>>>> Save file.
>>>>
>>>> 4.3. Update GRUB information for apply changes:
>>>>
>>>> |sudo update-grub |
>>>>
>>>> 4.4. After reboot you automatically boot on Kernel by chosen menu 
>>>> path. An example on my machine 1 -> 2
>>>>
>>>> 4.5. Check Kernel version after reboot:
>>>>
>>>> uname -r
>>>>
>>>> *Sent:* Thursday, April 18, 2019 at 10:54 PM
>>>> *From:* "Brian O'Keefe" <okeefe at cybermesa.com>
>>>> *To:* nmglug at lists.nmglug.org
>>>> *Subject:* Re: [nmglug] Weirdness
>>>>
>>>> Thanks again,
>>>>
>>>> Tried them all to no avail
>>>>
>>>> On 4/15/19 6:33 PM, Harold Furbiter wrote:
>>>>
>>>>     You could try:
>>>>     sudo /sbin/init 6   (see if it reboots)
>>>>
>>>> reboots to a blank, colored screen, no log in
>>>>
>>>>     sudo /sbin/init 1   (see if it give you a prompt)
>>>>
>>>> Gives me a rescue mode prompt that I cannot use as I can't enter 
>>>> any of the options
>>>>
>>>>     maybe su to root and try init
>>>>     sudo su - root  (you need '- root' to insure your path is root's)
>>>>     you might try booting from a older or oldest kernel:
>>>>
>>>> Can't access the grub menu for older kernels. Using shift key 
>>>> during boot just gives me the blank colored screen
>>>>
>>>>     dpkg -l | grep linux-image | awk '{print$2}'
>>>>     Gives you a list of bootable kernels available on your system.
>>>>     Try booting from the oldest version.
>>>>
>>>> Is there a way to reboot from a terminal with an older kernel as I 
>>>> cannot access the grub menu. this is a recent issue, within the 
>>>> last couple of weeks.
>>>>
>>>>     Cheers,
>>>>     *Sent:* Saturday, April 13, 2019 at 8:32 AM
>>>>     *From:* "Harold Furbiter" <wwcorigan at mail.com>
>>>>     *To:* nmglug at lists.nmglug.org
>>>>     *Subject:* Re: [nmglug] Weirdness
>>>>     Then you have a corrupt kernel. On boot try booting to an older
>>>>     kernel.
>>>>     https://askubuntu.com/questions/82140/how-can-i-boot-with-an-older-kernel-version
>>>>     *Sent:* Friday, April 12, 2019 at 6:15 PM
>>>>     *From:* "Brian O'Keefe" <okeefe at cybermesa.com>
>>>>     *To:* nmglug at lists.nmglug.org
>>>>     *Subject:* Re: [nmglug] Weirdness
>>>>
>>>>     Thanks for the input Harold,
>>>>
>>>>     Same result, just hangs on the splash screen. Can't logout,
>>>>     esc. key does nothing, can't switch to a console. Just have to
>>>>     do a hard shutdown.
>>>>
>>>>     On 4/11/19 8:59 PM, Harold Furbiter wrote:
>>>>
>>>>         Out of curiousity have you tried init 0 ?
>>>>         *Sent:* Wednesday, April 10, 2019 at 11:26 AM
>>>>         *From:* "Brian O'Keefe" <okeefe at cybermesa.com>
>>>>         *To:* "NMGLUG.org mailing list" <nmglug at nmglug.org>
>>>>         *Subject:* [nmglug] Weirdness
>>>>
>>>>         Hi All,
>>>>
>>>>         Since I may not make meeting (niece visiting and Thurs. is
>>>>         her last day) I'm putting my issue out for comment and
>>>>         hopefully answers. As many of you know I used to updgrade
>>>>         instead of clean installs and did that since Ubuntu 6.04. I
>>>>         had also added many apps from third parties and also
>>>>         modified many, many conf files to keep things working. I
>>>>         had a meltdown and lost much of my data but following a
>>>>         partial recovery, thanks to a certain group member, I
>>>>         installed a clean version of 18.04 onto a new 1TB SSD. I
>>>>         have ot tinkered at all with 3rd party software nor
>>>>         modified any conf files or been a bad boy in any way!
>>>>
>>>>         My issue ids that I cannot shut down my box in anyway other
>>>>         than a hard shutdown. I also cannot restart it. I have
>>>>         tried the GUI option as well as switching to text mode and
>>>>         using "sudo shutdown now" or "sudo restart now". In those
>>>>         cases I get the splash screen with the "traveling lights"
>>>>         and Unutu but it hangs there. The traveling dots hang on
>>>>         the first dot of the splash screen and nothing happens. I
>>>>         had hoped that text mode would give give me an indication
>>>>         of the issues but I can't stay in that mode for some reason.
>>>>
>>>>         Thanks for any help.
>>>>
>>>>         Brian
>>>>
>>>>         --
>>>>         _______________________________________________ nmglug
>>>>         mailing list nmglug at lists.nmglug.org
>>>>         http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>>>
>>>>         _______________________________________________
>>>>         nmglug mailing list
>>>>         nmglug at lists.nmglug.org
>>>>         http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>>>
>>>>     --
>>>>     _______________________________________________ nmglug mailing
>>>>     list nmglug at lists.nmglug.org
>>>>     http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>>>     _______________________________________________ nmglug mailing
>>>>     list nmglug at lists.nmglug.org
>>>>     http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>>>
>>>>     _______________________________________________
>>>>     nmglug mailing list
>>>>     nmglug at lists.nmglug.org
>>>>     http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>>>
>>>> --
>>>> _______________________________________________ nmglug mailing list 
>>>> nmglug at lists.nmglug.org 
>>>> http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>>>
>>>> _______________________________________________
>>>> nmglug mailing list
>>>> nmglug at lists.nmglug.org
>>>> http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>> -- 
>>>
>>> _______________________________________________
>>> nmglug mailing list
>>> nmglug at lists.nmglug.org
>>> http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
>>
>>
>> _______________________________________________
>> nmglug mailing list
>> nmglug at lists.nmglug.org
>> http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org
> -- 
>
> _______________________________________________
> nmglug mailing list
> nmglug at lists.nmglug.org
> http://lists.nmglug.org/listinfo.cgi/nmglug-nmglug.org

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.nmglug.org/pipermail/nmglug-nmglug.org/attachments/20190424/e385b719/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: jnliejnbgoejhpcm.png
Type: image/png
Size: 3148 bytes
Desc: not available
URL: <http://lists.nmglug.org/pipermail/nmglug-nmglug.org/attachments/20190424/e385b719/attachment-0002.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.png
Type: image/png
Size: 3148 bytes
Desc: not available
URL: <http://lists.nmglug.org/pipermail/nmglug-nmglug.org/attachments/20190424/e385b719/attachment-0003.png>


More information about the nmglug mailing list