While I still disagree, I won't say it is impossible for a 192 error to cause a 199 error, stranger things have happened and I definitely do not know it all, and it is okay to disagree. This wouldn't be a learning forum if we didn't challenge each other from time to time.
As for it being a hardware problem, I completely agree for the UDMA CRC Errors, but what hardware specifically? We do not know that just yet and the log file does not provide that data, nether does the current SMART data. So I will request the extended SMART data
smartctl -x /dev/ada3
be provided.
The most likely suspect is the SATA cable, either the connection at the motherboard, the connection at the drive, or possibly the cable has just failed (yes, it actually does happen out of the blue). Any one of the three are very likely the issue.
Those 199 UDMA CRC Errors could have occured at any time, maybe the extended data will show when they occured. If the data does show these errors happening and if it's not been fairly recent, then I'd say the problem is gone for now. You might be able to link it to when the system was built, these things do happen especially when someone's hands are in the case bumping cables or just moving the case around. I prefer the better quality locking SATA data cables, and as short as possible for the distance needed.
An easier way to grab data off the NAS if using the GUI, or SSH as well, send an email to yourself. This is very handy so long as your email is setup in TrueNAS already and this can be used for many things.
Here is how:
smartctl -x /dev/ada3 | mail -s "ada3 SMART" your_email@address.com
"ada3 SMART" = Subject Line Text
your_email@address.com = The email address to send the test data to.
Another example:
zpool status -v | mail -s "Zpool Status" your_email@address.com
As for IDs 174 and 192 "Unsafe_Shutdown_Count" that is typically caused by an improper shutdown of the system. The power is removed to the SSD before the system tells the drive to shutdown. An easy to see way to create this issue is to hold down the power button on the computer and force it to power off. The UPS unexpectedly powers off. Maybe TrueNAS software is not flagging the drive properly. In this last case I would monitor these values and see if they continue to increase and if you can tie them to an event, even better.
For comparison please provide the same extended data for the other drives as they are the same Model drive, comparing this data would definitely shed more light on the 174 and 192 issue and if it is a single drive issue or a whole system issue.
You may find out that a Jira ticket needs to be submitted.