• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar

UnixArena

  • Home
  • kubernetes
  • DevOps
    • Terraform
    • Jenkins
    • Docker
    • Openshift
      • OKD
    • Ansible engine
    • Ansible Tower
      • AWX
    • Puppet
  • Cloud
    • Azure
    • AWS
    • Openstack
    • Docker
  • VMware
    • vCloud Director
    • VMware-Guests
    • Vcenter Appliance 5.5
    • vC OPS
    • VMware SDDC
    • VMware vSphere 5.x
      • vSphere Network
      • vSphere DS
      • vShield Suite
    • VMware vSphere 6.0
    • VSAN
    • VMware Free Tools
  • Backup
    • Vembu BDR
    • Veeam
    • Nakivo
    • Azure Backup
    • Altaro VMBackup
    • Spinbackup
  • Tutorials
    • Openstack Tutorial
    • Openstack Beginner’s Guide
    • VXVM-Training
    • ZFS-Tutorials
    • NetApp cDot
    • LVM
    • Cisco UCS
    • LDOM
    • Oracle VM for x86
  • Linux
    • How to Articles
    • Q&A
    • Networking
    • RHEL7
  • DevOps Instructor-led Training
  • Contact

How do you troubleshoot if VCS cluster is not starting ?

July 11, 2014 By Cloud_Devops 14 Comments

How do you start VCS cluster if its not started automatically after the  server reboot?  Have you ever faced such  issues ? If not just see how we can fix these kind of issues on veritas cluster. I have been asking this questions on the Solaris interviews but most of them are fail to impress me by saying some unrelated things with VCS stuffs. If you know the basic of veritas cluster, it will be so easy for to troubleshoot in real time and easy to explain on interviews too.

VCS troubleshooting
VCS troubleshooting

Scenario: 

Two nodes are clustered with veritas cluster and you have rebooted one of the server. Rebooted node has come up but VCS cluster was not started (HAD daemon). You are trying to start the cluster using “hastart” command , but its not working.How do you troubleshoot ?

Here we go.

1.Check the cluster status after the server reboot using “hastatus” command.

# hastatus -sum |head
Cannot connect to VCS engine

2.Trying to start the cluster using hastart . No Luck. ? Still getting same message like above ? Proceed with Step 3.

3.Check the llt and GAB service. If its in disable state, just enable it .

[root@UA~]# svcs -a |egrep "llt|gab"
online         Jun_27   svc:/system/llt:default
online         Jun_27   svc:/system/gab:default
[root@UA~]#

4.Check the llt(heartbeat) status. Here LLT links looks good.

[root@UA ~]# lltstat -nvv |head
LLT node information:
    Node                 State    Link  Status  Address
     0 UA2         OPEN
                                  HB1   UP      00:91:28:99:74:89
                                  HB2   UP      00:91:28:99:74:BF
   * 1 UA          OPEN
                                  HB1   UP      00:71:28:9C:2E:OF
                                  HB2   UP      00:71:28:9C:2F:9F
[root@UA ~]#

5.If the LLT is down ,then try to configure using “lltconfig -c” command to configure the private links. Still if you have any issue with LLT links, then need to check with network team to fix the heartbeat links.

6.check the GAB status using “gabconfig -a” command.

[root@UA ~]# gabconfig -a
GAB Port Memberships
===============================================================
[root@UA ~]#

7.As per the above command output, memberships are not seeded. We have to seed the membership manually using gabconfig command.

[root@UA ~]# gabconfig -cx
[root@UA ~]#

8. Check the GAB status now.

[root@UA ~]# gabconfig -a
GAB Port Memberships
===============================================================
Port a gen   6d0607 membership 01
[root@UA ~]#

Above  output Indicates that GAB(Port a) is online on both the nodes. (0 , 1). To know which node is “0” and which node “1” , refer /etc/llthosts file.

9.Try to start the cluster using hastart command.It should work now.

10.Check the Membership status using gabconfig.

[root@UA ~]# gabconfig -a
GAB Port Memberships
===============================================================
Port a gen   6d0607 membership 01
Port h gen   6d060b membership 01
[root@UA ~]#

Above  output Indicates that HAD(Port h) is online on both the nodes. (0 , 1).

11.Check the cluster status using hastatus command. System should be back to business.

[root@UA ~]# hastatus -sum |head
-- SYSTEM STATE
-- System               State                Frozen
A  UA2                 RUNNING              0
A  UA                  RUNNING              0

-- GROUP STATE
-- Group           System       Probed     AutoDisabled    State
B  ClusterService  UA             Y          N             ONLINE
B  ClusterService  UA2            Y          N             OFFLINE

[root@UA ~]#

This is very small thing but many of the VCS beginners failed to fix this start-up issues. In interviews too ,they are not able say that ,” If the HAD is not starting using “hastart” command , I will check the LLT & GAB services and will fix any issues with that.Then i will start the cluster using hastart” As an interviewers , everybody will expect this answers.

Hope this article is informative to you .

Share it ! Comment it !! Be Sociable !!!

Filed Under: VCS, Veritas

Reader Interactions

Comments

  1. Donovan says

    April 11, 2021 at 2:23 pm

    Thanks, this article was a big help.

    Reply
  2. noh says

    October 20, 2020 at 6:44 pm

    thanks a lot really you help me to fix my issue

    Reply
  3. Durgaprasad says

    April 7, 2017 at 12:14 am

    I got the split brain problem in VCS , how to rectify this ?

    Reply
  4. Durgaprasad says

    April 7, 2017 at 12:08 am

    I got the split brain problem, how to rectify this ?

    Reply
  5. adinarayana says

    March 10, 2017 at 6:23 pm

    good job. it is very useful

    Reply
  6. Krishna says

    November 6, 2016 at 7:24 pm

    Very nice explanation. Getting so much knowledge from these posts.

    Please add some scenario based questions on VxVM, VCS, Zones, Ldoms.

    Reply
  7. Partha Dutta says

    September 18, 2016 at 9:48 pm

    great post buddy

    Reply
  8. Nick says

    February 18, 2016 at 10:36 am

    It is great. Thanks.

    Reply
  9. radha says

    January 5, 2016 at 3:05 pm

    Unable to switch between two servers in cluster.’

    I am trying to switch from one server (1st server) to another server(2 nd server).

    By using “hagrp –switch group –to system”

    Initially when we use this command, all resources goes offline on 1 st server as expected.
    But after that on second server jboss goes to fault mode and resources except jboss goes to online in 2 nd server
    after sometime again 1 st server comes to online….

    Actually in cluster 2 nd server needs to come online.

    pls provide me solution

    Reply
  10. Raja says

    November 13, 2014 at 1:05 pm

    Great post, You are helping alot for new beginners ..I am also referring to friends…

    Thanks again!!!..

    Reply
  11. senthil says

    September 11, 2014 at 9:42 am

    trying to mount a filesystem which i created in vcs, main resources getting offline

    Reply
  12. senthil says

    September 11, 2014 at 9:42 am

    trying to mount a filesystem which i created in vcs main resources getting offline

    Reply
  13. Anil boda says

    July 16, 2014 at 10:31 pm

    thanks for the quick response.
    The link mentioned does not VCS scenario based questions.
    Any how its useful…

    thanks

    Reply
  14. Lingeswaran R says

    July 15, 2014 at 12:23 am

    Will do ….You also check the below one …

    https://www.unixarena.com/2013/06/solaris-scenario-based-interview.html

    Regards

    Lingeswaran

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Primary Sidebar

Follow UnixArena

  • Facebook
  • LinkedIn
  • Twitter

Copyright © 2025 · UnixArena ·

Go to mobile version