Buy Percona ServicesBuy Now!

Can I get pt-online-schema-change to abort if it can't connect to slave DBs?

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • eliot.chan
    replied
    Hi Carlos,

    The issue was that the root user didn't have the necessary permissions to connect to the slaves - they were discoverable, just not able to be connected to. That being said, I'm still looking for a way to run the command in a way that it would abort itself if the found slaves could not be connected to.

    Thanks

    Leave a comment:


  • carlos.salguero
    replied
    Hello,

    Maybe your slaves are not using the standar ports.
    Try using the DSN table: https://www.percona.com/doc/percona-...cursion-method

    regards

    Leave a comment:


  • Can I get pt-online-schema-change to abort if it can't connect to slave DBs?

    Just ran into a problem in our production environment - we ran pt-online-schema-change to add an index to a table. It wasn't able to connect to the slave DBs, and as such, the slave DB's replication lag went crazy as the tool was not monitoring the lag.

    Here are the logs (I've edited out the sensitive information):

    [eliot@<machine> ~]$ sudo pt-online-schema-change --execute --alter "ADD INDEX colname (colname)" D=dbname,t=tablename
    1> Cannot connect to h=<SLAVE_IP_1>
    1> Cannot connect to h=<SLAVE_IP_2>
    No slaves found. See --recursion-method if host <machine> has slaves.
    Not checking slave lag because no slaves were found and --check-slave-lag was not specified.

    Clearly it knew there were slaves, but because it couldn't connect to them, and it looks like it assumed that they didn't exist (as opposed to knowing that they do exist).

    Is there a flag I'm missing that could get the tool to abort this change if it can't connect to its slaves? The --dry-run version also makes no mention of these slave connections, which is another reason why it slipped my mind/didn't ring any alarm bells.
Working...
X