Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple "Possible SleuthkitServer X crash reading..." non problematic files #1187

Closed
lfcnassif opened this issue Jun 23, 2022 · 9 comments · Fixed by #1189
Closed

Multiple "Possible SleuthkitServer X crash reading..." non problematic files #1187

lfcnassif opened this issue Jun 23, 2022 · 9 comments · Fixed by #1189
Assignees
Labels

Comments

@lfcnassif
Copy link
Member

lfcnassif commented Jun 23, 2022

This is related to robustImageReading option. I got above error printed in console several times yesterday for 2 different processings of the same evidence, but it usually doesn't happen for this same evidence. Looking at the log, before the first error there is one "Timeout waiting SleuthkitServer X response! Restarting...". Analyzing the code and log, I found a race condition in timeout control in inter process communication.

I'll push the fix for the race condition to don't trigger the timeout wrongly, but I'm still analyzing why the error in the title happened indefinitely in a loop...

@lfcnassif
Copy link
Member Author

This is important to fix since robustImageReading is enabled by default in 4.0

@lfcnassif
Copy link
Member Author

@tc-wleite since you have used this option for a long time, have you observed similar errors in the console?

@wladimirleite
Copy link
Member

wladimirleite commented Jun 23, 2022

When I saw your first message about this issue, I remembered seeing that error message a couple of times. But I didn't pay much attention as I was testing/working in something else and it disappeared later.
I can't remember any details, but it was definitely rare.

@lfcnassif
Copy link
Member Author

Thanks. Maybe it is less rare when indexTempOnSSD = false, I'm using it to compare to other cases processed the same way, so items would be read multiple times from the external process instead of from the temp folder...

@lfcnassif
Copy link
Member Author

The "Possible SleuthkitServer X crash" message loop in console can be easily reproduced just killing one of them in task manager, it used to be robust against this in the past... Yesterday I tried to refactor some good portion of this code, but it is very sensitive, leading to other kinds of concurrency issues when reading files.., I'll delay this refactoring to after 4.0 and I'll just try to fix the loop.

There is also another area for improvement I remembered: each external TSK process read many files at the same time, switching between them. But if it crashes because of some FS corruption or TSK bug reading some file, or forcibly, that affects all files being read, even those without problems. I think it is possible to resume reading other files from the point they were with some improvements, but not sure if I'll implement that for 4.0...

@lfcnassif
Copy link
Member Author

lfcnassif commented Jun 24, 2022

This is not happening on 3.18.15, removing its tag.

@lfcnassif
Copy link
Member Author

lfcnassif commented Jun 24, 2022

This is not working on 3.18.15, removing its tag.

sorry, not happening!

@lfcnassif lfcnassif removed the 3.18 label Jun 24, 2022
@lfcnassif
Copy link
Member Author

Well, tricky, it takes some minutes to start the problematic loop after the first crash, so this happens with 3.18.15 too.

@lfcnassif lfcnassif added the 3.18 label Jun 24, 2022
@lfcnassif
Copy link
Member Author

Well, tricky, it takes some minutes to start the problematic loop after the first crash, so this happens with 3.18.15 too.

And, after the crash on 3.18.15, the external process is not restarted. Seems the bad loop begins when other external processes are restarted to clean possible resource leaks.

Anyway, 4.0.0-beta seems to be working fine now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants