This happens intermittently on systems which repeatedly enter and leave the hive shell. For example several dozen hive shell jobs in serial. For some of these jobs hive will hang and the process will remain frozen.
The thread dump is
cluster.close() is blocked
No reproduction with patched Jar after 45 minutes. Previously I could repo this in about 5 minutes. I will continue to run the test.
Still no repo at 1 hour 20 minutes
Still working well at 5 hours in
I'm +1 on this fix, but i'll let this run overnight just in case
Ran for about 24 hours no problems!
@Russell @Olivier Michallat can you attach the path to this ticket, I will include it in my custom build.