[ovirt-users] Ovirt host activation and lvm looping with high CPU load trying to mount iSCSI storage

Gianluca Cecchi gianluca.cecchi at gmail.com
Thu Jan 12 23:46:29 UTC 2017


On Fri, Jan 13, 2017 at 12:10 AM, Nir Soffer <nsoffer at redhat.com> wrote:

> On Thu, Jan 12, 2017 at 6:01 PM, Nicolas Ecarnot <nicolas at ecarnot.net>
> wrote:
> > Hi,
> >
> > As we are using a very similar hardware and usage as Mark (Dell poweredge
> > hosts, Dell Equallogic SAN, iSCSI, and tons of LUNs for all those VMs),
> I'm
> > jumping into this thread.
>
> Can you share your multipath.conf that works with Dell Equallogic SAN?
>
>
I jump in to share what is my current config with EQL SAN and RH EL /
CentOS (but not oVirt).
the examples below for a system connected with a PS6510ES.
Please note that it is to be considered as an element of discussion and to
be then mixed and integrated with oVirt specific requirements (eg no
friendly names).
Also, it is what I'm using on RH EL 6.8 clusters configured with RHCS. Not
tested yet any RH EL / CentOS 7.x system with EQL iSCSI

 - /etc/multipath.conf

defaults {
        user_friendly_names yes
}

blacklist {
       wwid my_internal_disk_wwid

       device {
               vendor  "iDRAC"
               product "*"
       }
}

devices {
        device {
                vendor                  "EQLOGIC"
                product                 "100E-00"
                path_grouping_policy    multibus
                features "1 queue_if_no_path"
                path_checker directio
                failback immediate
                path_selector "round-robin 0"
                rr_min_io 512
                rr_weight priorities
        }
}


multipaths {
        multipath {
                wwid one_of_my_luns_wwid
                alias mympfriendlyname
        }

        ... other multipath sections for other luns

}


other important configurations:

- /etc/iscsi/iscsid.conf
other than chap config parameters

diff iscsid.conf iscsid.conf.orig
< #node.session.timeo.replacement_timeout = 120
< node.session.timeo.replacement_timeout = 15
---
> node.session.timeo.replacement_timeout = 120
130,131c125
< #node.session.err_timeo.lu_reset_timeout = 30
< node.session.err_timeo.lu_reset_timeout = 20
---
> node.session.err_timeo.lu_reset_timeout = 30
168,169c162
< # node.session.initial_login_retry_max = 8
< node.session.initial_login_retry_max = 12
---
> node.session.initial_login_retry_max = 8
178,179c171
< #node.session.cmds_max = 128
< node.session.cmds_max = 1024
---
> node.session.cmds_max = 128
183,184c175
< #node.session.queue_depth = 32
< node.session.queue_depth = 128
---
> node.session.queue_depth = 32
310,311c301
< #node.session.iscsi.FastAbort = Yes
< node.session.iscsi.FastAbort = No
---
> node.session.iscsi.FastAbort = Yes


- network adapters dedicated to iSCSI config files
they are 10Gb/s interfaces
(
lspci gives
05:00.0 Ethernet controller: Intel Corporation 82599 10 Gigabit Dual Port
Backplane Connection (rev 01)
)
/etc/sysconfig/network-scripts/ifcfg-eth4
DEVICE=eth4
BOOTPROTO=static
HWADDR=XX:XX:XX:XX:XX:XX
ONBOOT=yes
IPADDR=10.10.100.227
NETMASK=255.255.255.0
TYPE=Ethernet
MTU=9000

similar for eth5 (ip is 10.10.100.227)

ifup eth4
ifup eth5

- /etc/sysctl.conf
net.ipv4.conf.eth4.arp_announce=2
net.ipv4.conf.eth4.arp_ignore=1
net.ipv4.conf.eth4.arp_filter=2
#
net.ipv4.conf.eth5.arp_announce=2
net.ipv4.conf.eth5.arp_ignore=1
net.ipv4.conf.eth5.arp_filter=2

to acquire modification:
sysctl -p

Verify ping to the portal (10.10.100.7) from both interfaces
ping -I eth4 10.10.100.7
ping -I eth5 10.10.100.7

to verify jumbo frame connections (if configured, as in my case):
ping 10.10.100.7 -M do -s 8792 -I eth4
ping 10.10.100.7 -M do -s 8792 -I eth5


- configuration of the iscsi interfaces
iscsiadm -m iface -I ieth4 --op=new
iscsiadm -m iface -I ieth5 --op=new
iscsiadm -m iface -I ieth4 --op=update -n iface.hwaddress -v
XX:XX:XX:XX:XX:XX
iscsiadm -m iface -I ieth5 --op=update -n iface.hwaddress -v
YY:YY:YY:YY:YY:YY


output of some commands with this config

# iscsiadm -m session | grep mylun
tcp: [3] 10.10.100.7:3260,1 iqn.2001-05.com.equallogic:0-xxxxxxxx-mylun
(non-flash)
tcp: [4] 10.10.100.7:3260,1 iqn.2001-05.com.equallogic:0-xxxxxxxx-mylun
(non-flash)

with "-P 1" option

Target: iqn.2001-05.com.equallogic:0-xxxxxxxx-mylun (non-flash)
        Current Portal: 10.10.100.38:3260,1
        Persistent Portal: 10.10.100.7:3260,1
                **********
                Interface:
                **********
                Iface Name: ieth5
                Iface Transport: tcp
                Iface Initiatorname: iqn.1994-05.com.redhat:aea9b71a9aaf
                Iface IPaddress: 10.10.100.228
                Iface HWaddress: <empty>
                Iface Netdev: eth5
                SID: 3
                iSCSI Connection State: LOGGED IN
                iSCSI Session State: LOGGED_IN
                Internal iscsid Session State: NO CHANGE
        Current Portal: 10.10.100.37:3260,1
        Persistent Portal: 10.10.100.7:3260,1
                **********
                Interface:
                **********
                Iface Name: ieth4
                Iface Transport: tcp
                Iface Initiatorname: iqn.1994-05.com.redhat:aea9b71a9aaf
                Iface IPaddress: 10.10.100.227
                Iface HWaddress: <empty>
                Iface Netdev: eth4
                SID: 4
                iSCSI Connection State: LOGGED IN
                iSCSI Session State: LOGGED_IN
                Internal iscsid Session State: NO CHANGE

# multipath -l mympfriendlyname
mympfriendlyname (one_of_my_luns_wwid) dm-6 EQLOGIC,100E-00
size=30G features='1 queue_if_no_path' hwhandler='0' wp=rw
`-+- policy='round-robin 0' prio=0 status=active
  |- 12:0:0:0 sdf 8:80  active undef running
  `- 13:0:0:0 sdg 8:96  active undef running

HIH discussion,
Gianluca
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/users/attachments/20170113/32abfe9a/attachment-0001.html>


More information about the Users mailing list