
On Thu, Nov 10, 2016 at 3:20 PM, Derek Atkins <derek@ihtfp.com> wrote:
Piotr,
On Thu, November 10, 2016 7:01 am, Piotr Kliczewski wrote:
Derek,
I see 2 issues in the engine logs:
2016-11-02 09:53:13,767 WARN [org.ovirt.engine.core.bll.storage.disk.image.GetUnregisteredDiskQuery] (org.ovirt.thread.pool-8-thread-47) [1e5f014c] Exception while parsing JSON for disk. Exception: '{}': org.codehaus.jackson.JsonParseException: Unexpected character ('H' (code 72)): expected a valid value (number, String, array, object, 'true', 'false' or 'null') at [Source: java.io.StringReader@482fe2d8; line: 1, column: 2] at org.codehaus.jackson.JsonParser._constructError(JsonParser.java:1433) [jackson-core-asl-1.9.13.jar:1.9.13] at org.codehaus.jackson.impl.JsonParserMinimalBase._reportError(JsonParserMinimalBase.java:521) [jackson-core-asl-1.9.13.jar:1.9.13] at org.codehaus.jackson.impl.JsonParserMinimalBase._reportUnexpectedChar(JsonParserMinimalBase.java:442) [jackson-core-asl-1.9.13.jar:1.9.13] at org.codehaus.jackson.impl.ReaderBasedParser._handleUnexpectedValue(ReaderBasedParser.java:1198) [jackson-core-asl-1.9.13.jar:1.9.13] at org.codehaus.jackson.impl.ReaderBasedParser.nextToken(ReaderBasedParser.java:485) [jackson-core-asl-1.9.13.jar:1.9.13] at org.codehaus.jackson.map.ObjectMapper._initForReading(ObjectMapper.java:2770) [jackson-mapper-asl-1.9.13.jar:1.9.13] at org.codehaus.jackson.map.ObjectMapper._readMapAndClose(ObjectMapper.java:2718) [jackson-mapper-asl-1.9.13.jar:1.9.13] at org.codehaus.jackson.map.ObjectMapper.readValue(ObjectMapper.java:1877) [jackson-mapper-asl-1.9.13.jar:1.9.13] at org.ovirt.engine.core.utils.JsonHelper.jsonToMap(JsonHelper.java:41) [utils.jar:] at org.ovirt.engine.core.bll.storage.disk.image.MetadataDiskDescriptionHandler.enrichDiskByJsonDescription(MetadataDiskDescriptionHandler.java:247) [bll.jar:]
Could you explain the above error? I'm not sure I understand what it means. I do, however, know what the following error is about. I attempted to import an OVA file and the file was mode 600 root:root and therefore was not readable by VDSM. Hence the "Errno 13: Permission Denied":
I added the stack traces to the email because I want someone from storage and virt to take a look at those failures. Maybe both were fixed already.
org.ovirt.engine.core.vdsbroker.vdsbroker.VDSErrorException: VDSGenericException: VDSErrorException: Failed to GetOvaInfoVDS, error = [Errno 13] Permission denied: u'/ovirt/import/openafs-fc23-64.ihtfp.org.ova', code = -32603 (Failed with error unexpected and code 16) at
[snip]
both not related to reconnects.
Agreed.
I see that from time to time there are connections reset by peer
2016-11-04 10:58:43,442 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.GetStatsVDSCommand] (DefaultQuartzScheduler6) [77387d45] Command 'GetStatsVDSCommand(HostName = ovirt-0, VdsIdAndVdsVDSCommandParametersBase:{runAsync='true', hostId='62b75bb9-fbd9-405f-b479-b6ad8cffd5b1', vds='Host[ovirt-0,62b75bb9-fbd9-405f-b479-b6ad8cffd5b1]'})' execution failed: VDSGenericException: VDSNetworkException: Connection reset by peer
which means that vdsm or the host was stopped. Vdsm log you provided do not cover this time so I am not able to say what is the cause of it.
Yeah, Nov 4 was approximately the time I was installing the systems, so yes, it's not surprising to see some up and down times around then.
There are more 'unexpected eof' in the logs but they seems not to be triggered by the engine. It looks like those connection are triggered from local host. This seems to be related to https://bugzilla.redhat.com/1349829
Is there something I can do to test/verify this?
It needs to be fixed first. I added it as reference for you.
Thanks, Piotr
-derek
On Tue, Nov 8, 2016 at 4:29 PM, Derek Atkins <derek@ihtfp.com> wrote:
Hi,
I'm not sure if you want my logs here or on the bug report (you didn't specify). But here you go. I had to gzip them to get them down to size -- the VDSM log is 18MB just from today (man is it chatty!)
-derek
Piotr Kliczewski <piotr.kliczewski@gmail.com> writes:
Derek,
This error may occur from time to time due to [1] but in your listing it is much to often. Please share engine and vdsm logs so we could see why.
Thanks, Piotr
-- Derek Atkins 617-623-3745 derek@ihtfp.com www.ihtfp.com Computer and Internet Security Consultant
-- Derek Atkins 617-623-3745 derek@ihtfp.com www.ihtfp.com Computer and Internet Security Consultant