[ovirt-users] Replica2 stripe2 hang on write to VM disk
paf1 at email.cz
paf1 at email.cz
Mon May 25 16:02:53 UTC 2015
Hello,
can anybody help me with hanging replica2 stripe2 datastore on 4 nodes
cluster ??
oVirt - ovirt-engine-lib-3.5.2.1-1.el7.centos.noarch
gluster - glusterfs-server-3.7.0-2.el7.x86_64
VM - Centos 7.1
If I use any bigger write to VM disk ( eg 2-5GB ) storage hosted virtual
disk will hang = I/O error
created by :
gluster volume create 12KVM12SC4 replica 2 stripe 2
16.0.0.161:/STORAGES/SlowClass/p4/GFS1
16.0.0.162:/STORAGES/SlowClass/p4/GFS1
16.0.0.163:/STORAGES/SlowClass/p4/GFS1
16.0.0.164:/STORAGES/SlowClass/p4/GFS1
rhev-data-center-mnt-glusterSD-localhost:_12KVM12SC4.log
-----------------------------------------------------------------------------------
[2015-05-25 14:47:24.205609] I [rpc-clnt.c:1807:rpc_clnt_reconfig]
0-12KVM12SC4-client-3: changing port to 49158 (from 0)
[2015-05-25 14:47:24.210824] I
[client-handshake.c:1405:select_server_supported_programs]
0-12KVM12SC4-client-3: Using Program GlusterFS 3.3, Num (1298437),
Version (330)
[2015-05-25 14:47:24.211204] I
[client-handshake.c:1193:client_setvolume_cbk] 0-12KVM12SC4-client-3:
Connected to 12KVM12SC4-client-3, attached to remote volume
'/STORAGES/SlowClass/p4/GFS1'.
[2015-05-25 14:47:24.211225] I
[client-handshake.c:1203:client_setvolume_cbk] 0-12KVM12SC4-client-3:
Server and Client lk-version numbers are not same, reopening the fds
[2015-05-25 14:47:24.211275] I [MSGID: 108005]
[afr-common.c:3880:afr_notify] 0-12KVM12SC4-replicate-1: Subvolume
'12KVM12SC4-client-3' came back up; going online.
[2015-05-25 14:47:24.216465] I [fuse-bridge.c:5077:fuse_graph_setup]
0-fuse: switched to graph 0
[2015-05-25 14:47:24.216556] I
[client-handshake.c:187:client_set_lk_version_cbk]
0-12KVM12SC4-client-3: Server lk version = 1
[2015-05-25 14:47:24.216643] I [fuse-bridge.c:4007:fuse_init]
0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.22
kernel 7.22
[2015-05-25 14:47:24.217998] I
[afr-common.c:1673:afr_local_discovery_cbk] 0-12KVM12SC4-replicate-0:
selecting local read_child 12KVM12SC4-client-0
[2015-05-25 14:47:29.737732] W [fuse-bridge.c:1080:fuse_setattr_cbk]
0-glusterfs-fuse: 40: SETATTR() /__DIRECT_IO_TEST__ => -1 (Read-only
file system)
[2015-05-25 14:49:18.266212] E
[client-handshake.c:1488:client_query_portmap_cbk]
0-12KVM12SC4-client-2: failed to get the port number for remote
subvolume. Please run 'gluster volume status' on server to see if brick
process is running.
[2015-05-25 14:49:18.266274] I [client.c:2086:client_rpc_notify]
0-12KVM12SC4-client-2: disconnected from 12KVM12SC4-client-2. Client
process will keep trying to connect to glusterd until brick's port is
available
[2015-05-25 14:49:19.346555] I [rpc-clnt.c:1807:rpc_clnt_reconfig]
0-12KVM12SC4-client-2: changing port to 49158 (from 0)
[2015-05-25 14:49:19.351812] I
[client-handshake.c:1405:select_server_supported_programs]
0-12KVM12SC4-client-2: Using Program GlusterFS 3.3, Num (1298437),
Version (330)
[2015-05-25 14:49:19.352169] I
[client-handshake.c:1193:client_setvolume_cbk] 0-12KVM12SC4-client-2:
Connected to 12KVM12SC4-client-2, attached to remote volume
'/STORAGES/SlowClass/p4/GFS1'.
[2015-05-25 14:49:19.352191] I
[client-handshake.c:1203:client_setvolume_cbk] 0-12KVM12SC4-client-2:
Server and Client lk-version numbers are not same, reopening the fds
[2015-05-25 14:49:19.352242] I [MSGID: 108002]
[afr-common.c:3959:afr_notify] 0-12KVM12SC4-replicate-1: Client-quorum
is met
[2015-05-25 14:49:19.352353] I
[client-handshake.c:187:client_set_lk_version_cbk]
0-12KVM12SC4-client-2: Server lk version = 1
[2015-05-25 14:49:27.843616] W [fuse-bridge.c:1263:fuse_err_cbk]
0-glusterfs-fuse: 151: REMOVEXATTR() /__DIRECT_IO_TEST__ => -1 (No data
available)
[2015-05-25 14:49:58.356900] W [fuse-bridge.c:1263:fuse_err_cbk]
0-glusterfs-fuse: 327: REMOVEXATTR() /__DIRECT_IO_TEST__ => -1 (No data
available)
# gluster volume status
Status of volume: 12KVM12SC4
Gluster process TCP Port RDMA Port Online Pid
------------------------------------------------------------------------------
Brick 16.0.0.161:/STORAGES/SlowClass/p4/GFS
1 49173 0 Y 17678
Brick 16.0.0.162:/STORAGES/SlowClass/p4/GFS
1 49158 0 Y 19184
Brick 16.0.0.163:/STORAGES/SlowClass/p4/GFS
1 49158 0 Y 9784
Brick 16.0.0.164:/STORAGES/SlowClass/p4/GFS
1 49158 0 Y 9327
NFS Server on localhost 2049 0 Y 17697
Self-heal Daemon on localhost N/A N/A Y 17708
NFS Server on 16.0.0.162 2049 0 Y 19205
Self-heal Daemon on 16.0.0.162 N/A N/A Y 19215
NFS Server on 16.0.0.163 2049 0 Y 9806
Self-heal Daemon on 16.0.0.163 N/A N/A Y 9813
NFS Server on 16.0.0.164 2049 0 Y 9347
Self-heal Daemon on 16.0.0.164 N/A N/A Y 9359
Task Status of Volume 12KVM12SC4
------------------------------------------------------------------------------
There are no active volume tasks
any idea ??
regs.
Pavel
More information about the Users
mailing list