High availability for MySQL on Amazon EC2 â€“ Part 4

This post is the fourth of a series that started here.

From the previous of this series, we now have resources configured but instead of starting MySQL, Pacemaker invokes a script to start (or restart) the EC2 instance running MySQL. This blog post describes the instance restart script. Remember, I am more a DBA than a script writer so it might not be written in the most optimal way.

First, let’s recap what’s the script has to perform (the full script is given below).

Kill the MySQL EC2 instance if running
Make sure the MySQL EC2 instance is stopped
Prepare the user-data script for the new MySQL EC2 instance
Launch the new MySQL instance
Make sure it is running
Reconfigure local heartbeat
Broadcast the new MySQL instance IP to the application servers

Kill the MySQL EC2 instance

In order to kill the existing MySQL EC2 instance, we first have to identify it. This is done by:

OLD_INSTANCE_ID=`ec2-describe-instances -K $PK -C $CERT | /usr/local/bin/filtre_instances.pl | grep $AMI_HA_MYSQL | egrep "running|pending" | tail -n 1 | cut -d'|' -f3`

1	OLD_INSTANCE_ID=`ec2-describe-instances -K $PK -C $CERT \| /usr/local/bin/filtre_instances.pl \| grep $AMI_HA_MYSQL \| egrep "running\|pending" \| tail -n 1 \| cut -d'\|' -f3`

by filtering on the AMI type of the instance. Since an instance can be listed at the “stopped” state, it is mandatory to filter for states “running” or “pending”. Then the instance is terminated with:

ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null

1	ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null

Make sure the MySQL EC2 instance is stopped

Terminating an EC2 instance is not instantaneous, we can confirm an instance is really stopped by monitoring its status and wait until it is actually “terminated”. The code below is how the script performs this task.

#wait until the old instance is terminated  it takes a few seconds to stop 
done="false"
while [ $done == "false" ]
do
	status=`ec2-describe-instances -K $PK -C $CERT $OLD_INSTANCE_ID | /usr/local/bin/filtre_instances.pl |  grep -c terminated`
  		if [ "$status" -eq "1" ]; then
			done="true"
		else
		        ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null
			sleep 5
   		fi
done

#wait until the old instance is terminated it takes a few seconds to stop

done="false"

while [ $done == "false" ]

status=`ec2-describe-instances -K $PK -C $CERT $OLD_INSTANCE_ID | /usr/local/bin/filtre_instances.pl | grep -c terminated`

if [ "$status" -eq "1" ]; then

done="true"

else

ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null

sleep 5

done

Prepare the user-data script for the new MySQL EC2 instance

The new MySQL instance will be running heartbeat. Since we cannot use neither Ethernet broadcast or multicast, we need to configure the new instance so that it communicates through unicast with its partner node in the cluster, the node on which the restart script is run. This configuration is achieved by providing a user-data script (see the hamysql.user-data below) which completes the heartbeat configuration of the new instance. The hamysql.user-data script only performs a search and replace operation on the /etc/ha.d/ha.cf file and then restart the heartbeat service. In order for this to work properly, we just have to put the IP of the current instance in the script like here:

OUR_IP=`/sbin/ifconfig eth0 | grep 'inet addr' | cut -d':' -f2 | cut -d' ' -f1`
#Now, modify the user-data script, we need to put our IP address in
if [ "$OUR_IP" == "" ]
then
	echo "Error getting Our IP"
else	
	perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $OUR_IP/g" $USER_DATA_SCRIPT
fi

OUR_IP=`/sbin/ifconfig eth0 | grep 'inet addr' | cut -d':' -f2 | cut -d' ' -f1`

#Now, modify the user-data script, we need to put our IP address in

if [ "$OUR_IP" == "" ]

then

echo "Error getting Our IP"

else

perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $OUR_IP/g" $USER_DATA_SCRIPT

Launch the new MySQL instance

Once things are ready, a new MySQL instance can be launched with:

#Now we are ready to start a new one
INSTANCE_INFO=`ec2-run-instances -K $PK -C $CERT $AMI_HA_MYSQL -n 1 -g $HA_SECURITY_GROUP -f $USER_DATA_SCRIPT -t $INSTANCE_TYPE -z $INSTANCE_ZONE -k $INSTANCE_KEY | /usr/local/bin/filtre_instances.pl`

#wait until the new instance is running  it take a few seconds to start
NEW_INSTANCE_ID=`echo $INSTANCE_INFO | cut -d'|' -f3`

#Now we are ready to start a new one

INSTANCE_INFO=`ec2-run-instances -K $PK -C $CERT $AMI_HA_MYSQL -n 1 -g $HA_SECURITY_GROUP -f $USER_DATA_SCRIPT -t $INSTANCE_TYPE -z $INSTANCE_ZONE -k $INSTANCE_KEY | /usr/local/bin/filtre_instances.pl`

#wait until the new instance is running it take a few seconds to start

NEW_INSTANCE_ID=`echo $INSTANCE_INFO | cut -d'|' -f3`

Out of this operation, we retrieve the new instance “instance_id”.

Make sure it is running

Since we know the “instance_id” of the new instance, checking if it is running is easy:

done="false"
while [ $done == "false" ]
do
	INSTANCE_INFO=`ec2-describe-instances -K $PK -C $CERT  $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl`
	status=`echo $INSTANCE_INFO | grep -c running`
	if [ "$status" -eq "1" ]; then
		done="true"
	else
		sleep 5
	fi
done

done="false"

while [ $done == "false" ]

INSTANCE_INFO=`ec2-describe-instances -K $PK -C $CERT $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl`

status=`echo $INSTANCE_INFO | grep -c running`

if [ "$status" -eq "1" ]; then

done="true"

else

sleep 5

done

Reconfigure local heartbeat

Now, Heartbeat, on the monitoring host, must be informed of the IP address of its new partner. In order to achieve this, a search and replace operation in the local ha.cf file followed of restart of Heartbeat is sufficient.

#Set the IP in /etc/ha.d/ha.cf and ask heartbeat to reload its config
MYSQL_IP=`ec2-describe-instances -K $PK -C $CERT  $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl | cut -d'|' -f2`
perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $MYSQL_IP/g" /etc/ha.d/ha.cf
/etc/init.d/heartbeat reload

#Set the IP in /etc/ha.d/ha.cf and ask heartbeat to reload its config

MYSQL_IP=`ec2-describe-instances -K $PK -C $CERT $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl | cut -d'|' -f2`

perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $MYSQL_IP/g" /etc/ha.d/ha.cf

/etc/init.d/heartbeat reload

Broadcast the new MySQL instance IP to the application servers

The final phase is to inform the application servers that the IP of the MySQL has changed. The best way to list those application servers is through a security group and, provided the appropriate ssh keys have been exchanged, this code will push the IP update.

TMPFILE=`mktemp`
ec2-describe-instances -K $PK -C $CERT | /usr/local/bin/filtre_instances.pl | grep $CLIENT_SECURITY_GROUP > $TMPFILE
	
while read line
do
	IP=`echo $line | cut -d'|' -f2`
	ssh -i /usr/local/bin/update_mysql ubuntu@$IP sudo ./updated_xinetd.sh $MYSQL_IP 
done < $TMPFILE

rm $TMPFILE

TMPFILE=`mktemp`

ec2-describe-instances -K $PK -C $CERT | /usr/local/bin/filtre_instances.pl | grep $CLIENT_SECURITY_GROUP > $TMPFILE

while read line

IP=`echo $line | cut -d'|' -f2`

ssh -i /usr/local/bin/update_mysql ubuntu@$IP sudo ./updated_xinetd.sh $MYSQL_IP

done < $TMPFILE

rm $TMPFILE

The full script:

#!/bin/bash
HA_SECURITY_GROUP=testyves
CLIENT_SECURITY_GROUP=hamysql-client
CLIENT_SCRIPT=/usr/local/bin/update_client.sh
AMI_HA_MYSQL=ami-84a74fed
EBS_DATA_VOL=vol-aefawf
USER_DATA_SCRIPT=/usr/local/bin/hamysql.user-data
PK=/usr/local/bin/pk-FNMBRRABFRKVICBDZ4IOOSF7YROYZRZW.pem
CERT=/usr/local/bin/cert-FNMBRRABFRKVICBDZ4IOOSF7YROYZRZW.pem
INSTANCE_TYPE=m1.small
INSTANCE_ZONE=us-east-1c
INSTANCE_KEY=yves-key

OUR_IP=`/sbin/ifconfig eth0 | grep 'inet addr' | cut -d':' -f2 | cut -d' ' -f1`
#Now, modify the user-data script, we need to put our IP address in
if [ "$OUR_IP" == "" ]
then
	echo "Error getting Our IP"
else	
	perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $OUR_IP/g" $USER_DATA_SCRIPT
fi

while [ 1 ]; do

	#First thing to do, terminate the other instance ID
	OLD_INSTANCE_ID=`ec2-describe-instances -K $PK -C $CERT | /usr/local/bin/filtre_instances.pl | grep $AMI_HA_MYSQL | egrep "running|pending" | tail -n 1 | cut -d'|' -f3`

	if [ "$OLD_INSTANCE_ID" == "" ]
	then
		#no running instance
		:
	else
		ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null

		#wait until the old instance is terminated  it takes a few seconds to stop 
		done="false"
		while [ $done == "false" ]
		do
			status=`ec2-describe-instances -K $PK -C $CERT $OLD_INSTANCE_ID | /usr/local/bin/filtre_instances.pl |  grep -c terminated`
	   		if [ "$status" -eq "1" ]; then
      				done="true"
   			else
				ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null
				sleep 5
   			fi
		done	
	fi

	#Now we are ready to start a new one
	INSTANCE_INFO=`ec2-run-instances -K $PK -C $CERT $AMI_HA_MYSQL -n 1 -g $HA_SECURITY_GROUP -f $USER_DATA_SCRIPT -t $INSTANCE_TYPE -z $INSTANCE_ZONE -k $INSTANCE_KEY | /usr/local/bin/filtre_instances.pl`

	#wait until the new instance is running  it take a few seconds to start
	NEW_INSTANCE_ID=`echo $INSTANCE_INFO | cut -d'|' -f3` 

	if [ "$NEW_INSTANCE_ID" == "" ]
	then
		echo "Error creating the new instance"
	else
	
		done="false"
		while [ $done == "false" ]
		do
   			INSTANCE_INFO=`ec2-describe-instances -K $PK -C $CERT  $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl`
	   		status=`echo $INSTANCE_INFO | grep -c running`
   			if [ "$status" -eq "1" ]; then
      				done="true"
	   		else
				sleep 5
   			fi
		done

		#Set the IP in /etc/ha.d/ha.cf and ask heartbeat to reload its config
		MYSQL_IP=`ec2-describe-instances -K $PK -C $CERT  $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl | cut -d'|' -f2`
		perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $MYSQL_IP/g" /etc/ha.d/ha.cf

		TMPFILE=`mktemp`
		ec2-describe-instances -K $PK -C $CERT | /usr/local/bin/filtre_instances.pl | grep $CLIENT_SECURITY_GROUP > $TMPFILE
	
		while read line
		do
			IP=`echo $line | cut -d'|' -f2`
			ssh -i /usr/local/bin/update_mysql ubuntu@$IP sudo ./updated_xinetd.sh $MYSQL_IP 
		done < $TMPFILE

		rm $TMPFILE
		
		/etc/init.d/heartbeat reload
	fi


        sleep 300 # 5 min before attempting again. Normally heartbeat should kill the script before
done

#!/bin/bash

HA_SECURITY_GROUP=testyves

CLIENT_SECURITY_GROUP=hamysql-client

CLIENT_SCRIPT=/usr/local/bin/update_client.sh

AMI_HA_MYSQL=ami-84a74fed

EBS_DATA_VOL=vol-aefawf

USER_DATA_SCRIPT=/usr/local/bin/hamysql.user-data

PK=/usr/local/bin/pk-FNMBRRABFRKVICBDZ4IOOSF7YROYZRZW.pem

CERT=/usr/local/bin/cert-FNMBRRABFRKVICBDZ4IOOSF7YROYZRZW.pem

INSTANCE_TYPE=m1.small

INSTANCE_ZONE=us-east-1c

INSTANCE_KEY=yves-key

OUR_IP=`/sbin/ifconfig eth0 | grep 'inet addr' | cut -d':' -f2 | cut -d' ' -f1`

#Now, modify the user-data script, we need to put our IP address in

if [ "$OUR_IP" == "" ]

then

echo "Error getting Our IP"

else

perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $OUR_IP/g" $USER_DATA_SCRIPT

while [ 1 ]; do

#First thing to do, terminate the other instance ID

if [ "$OLD_INSTANCE_ID" == "" ]

then

#no running instance

else

ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null

#wait until the old instance is terminated it takes a few seconds to stop

done="false"

while [ $done == "false" ]

status=`ec2-describe-instances -K $PK -C $CERT $OLD_INSTANCE_ID | /usr/local/bin/filtre_instances.pl | grep -c terminated`

if [ "$status" -eq "1" ]; then

done="true"

else

ec2-terminate-instances -K $PK -C $CERT $OLD_INSTANCE_ID > /dev/null

sleep 5

done

#Now we are ready to start a new one

#wait until the new instance is running it take a few seconds to start

NEW_INSTANCE_ID=`echo $INSTANCE_INFO | cut -d'|' -f3`

if [ "$NEW_INSTANCE_ID" == "" ]

then

echo "Error creating the new instance"

else

done="false"

while [ $done == "false" ]

INSTANCE_INFO=`ec2-describe-instances -K $PK -C $CERT $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl`

status=`echo $INSTANCE_INFO | grep -c running`

if [ "$status" -eq "1" ]; then

done="true"

else

sleep 5

done

#Set the IP in /etc/ha.d/ha.cf and ask heartbeat to reload its config

MYSQL_IP=`ec2-describe-instances -K $PK -C $CERT $NEW_INSTANCE_ID | /usr/local/bin/filtre_instances.pl | cut -d'|' -f2`

perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 $MYSQL_IP/g" /etc/ha.d/ha.cf

TMPFILE=`mktemp`

ec2-describe-instances -K $PK -C $CERT | /usr/local/bin/filtre_instances.pl | grep $CLIENT_SECURITY_GROUP > $TMPFILE

while read line

IP=`echo $line | cut -d'|' -f2`

ssh -i /usr/local/bin/update_mysql ubuntu@$IP sudo ./updated_xinetd.sh $MYSQL_IP

done < $TMPFILE

rm $TMPFILE

/etc/init.d/heartbeat reload

sleep 300 # 5 min before attempting again. Normally heartbeat should kill the script before

done

The hamysql.user-data script:

The script sets the IP of the monitor host in the heartbeat ha.cf configuration file and then, finishes up some missing configuration settings of the AMI.

#!/bin/bash
sudo hostname hamysql
sudo perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 10.220.230.18/g" /etc/ha.d/ha.cf

# to eventually be added to the ebs image
sudo perl -pi -e 's/bind-address/#bind-address/g' /etc/mysql/my.cnf
sudo service mysql restart
sleep 5
/usr/bin/mysql -u root -proot -e "grant all on *.* to root@'%' identified by 'root'"
sudo /etc/init.d/heartbeat start

#!/bin/bash

sudo hostname hamysql

sudo perl -pi -e "s/ucast eth0 (\d+)(\.\d+){3}/ucast eth0 10.220.230.18/g" /etc/ha.d/ha.cf

# to eventually be added to the ebs image

sudo perl -pi -e 's/bind-address/#bind-address/g' /etc/mysql/my.cnf

sudo service mysql restart

sleep 5

/usr/bin/mysql -u root -proot -e "grant all on *.* to root@'%' identified by 'root'"

sudo /etc/init.d/heartbeat start

6 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

Colin

15 years ago

Any idea when you’ll put up Part 4?

Yves Trudeau

Author

15 years ago

Hi Colin,
I am guessing you are talking about part 5. The draft is partly written, I have been very busy recently but I’ll find some time to complete the series.

Colin

15 years ago

Yeah, as soon as I hit “Submit Comment” I realized I hit the wrong number! :-/

Thanks, these are really interesting and I’m really curious about you’re instance monitoring script. 🙂

Rickm

15 years ago

I found these series of notes very interesting and waiting to see your next part hopefully soon.
Thanks a lot.

Yves Trudeau

Author

15 years ago

If you look in the Pacemaker configuration here:
http://www.mysqlperformanceblog.com/2010/07/12/high-availability-for-mysql-on-amazon-ec2-%E2%80%93-part-3-%E2%80%93-configuring-the-ha-resources/

You’ll see that you can define the path. The current config points to /usr/local/bin/mysql.

Luke

15 years ago

where do the scripts go?

MySQL 5.7
Support

Compare Percona to Leading Database Solutions

Software
Downloads

Valkey Contribution

Product Documentation

Resource Hub

Why Percona for MongoDB?

Why Percona for PostgreSQL?

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

High availability for MySQL on Amazon EC2 â€“ Part 4 – The instance restart script

Related Blog Articles

RECOMMENDED ARTICLES

Rebuilding a Replica with MyDumper

Databases, Data Lakes, And Encryption

Automatic “Multi-Source” Async Replication Failover Using PXC Replication Manager

MOST POPULAR ARTICLES

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL Performance Tuning: Maximizing Database Efficiency and Speed

The Ultimate Guide to Open Source Databases

MySQL 5.7 Support

Compare Percona to Leading Database Solutions

Software Downloads

Valkey Contribution

Product Documentation

Resource Hub

Why Percona for MongoDB?

Why Percona for PostgreSQL?

Percona Blog

Percona Community Hub

Percona Events Hub

About Percona

Percona in the News

Our Customers

Our Partners

Careers

Contact Us

High availability for MySQL on Amazon EC2 â€“ Part 4 – The instance restart script

About the Author

Share This Post!

Stay up to date with the Percona Blog

Related Blog Articles

RECOMMENDED ARTICLES

Rebuilding a Replica with MyDumper

Databases, Data Lakes, And Encryption

Automatic “Multi-Source” Async Replication Failover Using PXC Replication Manager

MOST POPULAR ARTICLES

Deploy Django on Kubernetes With Percona Operator for PostgreSQL

MySQL Performance Tuning: Maximizing Database Efficiency and Speed

The Ultimate Guide to Open Source Databases

MySQL 5.7
Support

Software
Downloads