SQLite Data Corruption testing

limetech · October 31, 2019

1 hour ago, Scorpionhl said:

Could you elaborate on the long term fix? should I be adding this md command to my go file?

No 6.8.0-rc5 will have permanent fix.

Scorpionhl · October 31, 2019

2 minutes ago, limetech said:

No 6.8.0-rc5 will have permanent fix.

Great! thanks for help, info, and fix

WizADSL · November 1, 2019

Based on what was happening with the Sqlite databases, could this have caused corruption in other files on the array?

bbolinger · November 2, 2019

A little over a month ago I was having daily SQL errors with my Plex / Sonarr setup. I had to completely stop adding any new media and have been following this and many other threads looking for a fix. Last night I updated to 6.8.0-rc5 and have been really stressing the database today with TV and Movie updates over the past few months. So far no errors, thank you!!!

Edited November 2, 2019 by bbolinger

TXLZONE · November 5, 2019

Just installed 6.8.0-rc5 and I have to say all of the major show-stopper issues I had with 6.7 have been resolved (minus a few small bugs). Performance is really good and no corruption yet. Thank you guys for all the hard work you put into figuring this out!

11rcombs · November 6, 2019

On 10/31/2019 at 8:48 AM, limetech said:

To not ever fail read aheads.

So in the end, what was the actual bug here, and how did it manifest? I'm mostly wondering if there's anything libsqlite's doing that relies on particular implementation-defined kernel behavior that isn't actually guaranteed, in which case I'd want to report that to the sqlite devs with a description of a repro case.

limetech · November 6, 2019

16 hours ago, 11rcombs said:

So in the end, what was the actual bug here, and how did it manifest? I'm mostly wondering if there's anything libsqlite's doing that relies on particular implementation-defined kernel behavior that isn't actually guaranteed, in which case I'd want to report that to the sqlite devs with a description of a repro case.

The corruption occurred as a result of failing a read-ahead I/O operation with "BLK_STS_IOERR" status.

In the Linux block layer each READ or WRITE can have various modifier bits set. In the case of a read-ahead you get READ|REQ_RAHEAD which tells I/O driver this is a read-ahead. In this case, if there are insufficient resources at the time this request is received, the driver is permitted to terminate the operation with BLK_STS_IOERR status. Here is an example in Linux md/raid5 driver.

In case of Unraid it can definitely happen under heavy load that a read-ahead comes along and there are no 'stripe buffers' immediately available. In this case, instead of making calling process wait, it terminated the I/O. This has worked this way for years.

When this problem first happened there were conflicting reports of the config in which it happened. My first thought was an issue in user share file system. Eventually ruled that out and next thought was cache vs. array. Some reports seemed to indicate it happened with all databases on cache - but I think those reports were mistaken for various reasons. Ultimately decided issue had to be with md/unraid driver. Our big problem was that we could not reproduce the issue but others seemed to be able to reproduce with ease.

Honestly, thinking failing read-aheads could be the issue was a "hunch" - it was either that or some logic in scheduler that merged I/O's incorrectly (there were kernel bugs related to this with some pretty extensive patches and I thought maybe developer missed a corner case - this is why I added config setting for which scheduler to use). This resulted in release with those 'md_restrict' flags to determine if one of those was the culprit, and what-do-you-know, not failing read-aheads makes the issue go away.

What I suspect is that this is a bug in SQLite - I think SQLite is using direct-I/O (bypassing page cache) and issuing it's own read-aheads and their logic to handle failing read-ahead is broken. But I did not follow that rabbit hole - too many other problems to work on

Duniac · November 7, 2019

I have just updated to 6.8 RC5 and am in the process of reinstalling the Plex docker. I've read that RC5 includes the fix, but are there any specific config items I need to set?

Duniac · November 7, 2019

Many have written that they have backed up their Plex database, can someone please point me to the location of this?

Rich Minear · November 7, 2019

1 hour ago, Duniac said:

Many have written that they have backed up their Plex database, can someone please point me to the location of this?

Use the terminal tool built into Unraid. Once running in a new window, you will need to know the appdata location for your system. Mine is /mnt/disk1/appdata.

So I can cd to /mnt/disk1/appdata/PlexMediaServer/Library/Application Support/Plex Media Server/Plug-in Support/Databases.

com.plexapp.plugins.library.db is the main database. You can make a copy of this by just doing a cp of this file to another file name. Plex will make backup copies also...I believe every 3 days. They should have the date appended to the name.

If you need to fall back to one of these, you have to stop the Plex docker, and then copy the file with date appended back to the name of the main database.

TheBuz · November 7, 2019

2 minutes ago, Rich Minear said:

Use the terminal tool built into Unraid. Once running in a new window, you will need to know the appdata location for your system. Mine is /mnt/disk1/appdata.

So I can cd to /mnt/disk1/appdata/PlexMediaServer/Library/Application Support/Plex Media Server/Plug-in Support/Databases.

com.plexapp.plugins.library.db is the main database. You can make a copy of this by just doing a cp of this file to another file name. Plex will make backup copies also...I believe every 3 days. They should have the date appended to the name.

If you need to fall back to one of these, you have to stop the Plex docker, and then copy the file with date appended back to the name of the main database.

I use CA Backup / Restore Appdata, it can keep older versions aswell

Duniac · November 7, 2019

For previous Plex dockers I have spread it across all disks, should I limit it to only one disk?

Rich Minear · November 7, 2019

5 hours ago, Duniac said:

For previous Plex dockers I have spread it across all disks, should I limit it to only one disk?

So that is a good question. When I was 6.6.7, I had it spread across disks also. I was asked to move it one disk as part of the testing. I've had a couple of people tell me why, but I never really understood the reasoning.

My guess would be that if rc5 is stable like 6.6.7, it would not make any difference. But someone else may have to chime in to school us on this. 🙂

jbartlett · November 7, 2019

Sounds like it was to eliminate disk spin up delays. If they're backup copies and not your live version, it doesn't matter how you store it or where.

Rich Minear · November 7, 2019

1 minute ago, jbartlett said:

Sounds like it was to eliminate disk spin up delays. If they're backup copies and not your live version, it doesn't matter how you store it or where.

This is the appdata area. All of the databases are there. As for disk spin up delays...you have that if you are on a single disk also and the disk is quiet. Unless it is an SSD

Rich Minear · November 9, 2019

Just an FYI for this forum: It has been 16 days since I have had any corruption with Plex. That only happened with 6.6.7. With anything newer, it would corrupt in less than a day. 6.8.0-rc4 and rc5 have been rock stable with the changes that were made. I'm glad that I stuck with the testing, and was able to work so close with the Unraid team. 🙂

Scythe · November 14, 2019

Thanks so much @Rich Minear for sticking it out and doing all that testing for the rest of us. So glad we've got a resolution and I can confidently start thinking about moving off 6.6.7

NeoMatrixJR · November 16, 2019

So...for someone late to the party...I've seen sqlite errors in my plex logs somewhere but it's been a while and I can't remember how to look this up. Where do I check, and...if I'm getting them...should I go rc6? Is it stable enough. Once there, how can I correct my possibly corrupt DBs? Does anything else use sqlite that I should be aware of that's had this issue? Sorry if this is all covered, but I'm still trying to get through all of this post and this info might make a good sticky of the tl;dr type.

ZooMass · November 20, 2019

Super happy to see that we have resolved the issue! Thank you @Rich Minear @limetech and everyone else! I look forward to finally confidently upgrading from 6.6.7!

mrtech213 · December 10, 2019

Hey,

I upgraded to 6.8.0-rc9 from 6.7.2 cause I started seeing the SQLite exceptions starting today. So I made the jump but for some reason, I'm still seeing these exceptions and seeing lots of " database disk image is malformed " in Sonarr

Also my plex is not letting me see anything. Just see this in my dashboard: 624ab6102bfd17843696e3fb4803ec42

Please help not sure what I'm doing wrong!!!!

wakanda-diagnostics-20191210-1637.zip

BRiT · December 10, 2019

You need to start with a known valid database without corruption, so if you dont have a backup you will have to start fresh and let plex build the database from scratch.

mrtech213 · December 10, 2019

Ok I'll do that right now and let you know the results

mrtech213 · December 11, 2019

I did a fresh install of plex and used my backups to restore some settings but the server is not detecting newly added movies. I'm able to play videos but its not seeing new movies

also within sonarr, I'm still seeing lots of SQLite exceptions------------>https://gyazo.com/9d2fd05d4c8f109510e66fa3325d6902

mrtech213 · December 11, 2019

I'm assume that I'll need to reinstall mainly all the dockers???

Rick Gillyon · December 11, 2019

27 minutes ago, mrtech213 said:

I'm assume that I'll need to reinstall mainly all the dockers???

Reinstalling them won't do anything. You need to shut them down and delete the appdata.

Or restore a known good backup.

Edited December 11, 2019 by Rick Gillyon

SQLite Data Corruption testing

User Feedback

Recommended Comments

limetech 3328

Link to comment

Scorpionhl 3

Link to comment

WizADSL 3

Link to comment

bbolinger 1

Link to comment

TXLZONE 4

Link to comment

11rcombs 2

Link to comment

limetech 3328

Link to comment

Duniac 0

Link to comment

Duniac 0

Link to comment

Rich Minear 33

Link to comment

TheBuz 4

Link to comment

Duniac 0

Link to comment

Rich Minear 33

Link to comment

jbartlett 275

Link to comment

Rich Minear 33

Link to comment

Rich Minear 33

Link to comment

Scythe 1

Link to comment

NeoMatrixJR 0

Link to comment

ZooMass 3

Link to comment

mrtech213 4

Link to comment

BRiT 408

Link to comment

mrtech213 4

Link to comment

mrtech213 4

Link to comment

mrtech213 4

Link to comment

Rick Gillyon 12

Link to comment

Join the conversation