Downloads

Blog

Can MySQL temporary tables be made safe for statement-based replication?

May 27, 2008

Author

Baron Schwartz

Insight for DBAs

Share this Post:

A while ago I wrote about how to make MySQL replication reliable, part of which is to eliminate temporary tables. The idea is this: if a slave is stopped (or crashed) while a temporary table is open and is then restarted, the temporary table doesn’t exist anymore, and the slave will have problems trying to replay any further statements that refer to these tables. Thus, I claimed, there’s no alternative but to eliminate temporary tables. This problem may not exist for row-based replication in MySQL 5.1 and later, but most installations I know of are using statement-based replication, even on MySQL 5.1

This is a contentious topic. People love their temporary tables and will ask hopefully “are you sure this isn’t safe?” They’ll propose all sorts of ways to mitigate the danger, and I’ve heard many of them. But I recently heard an angle on this I had not heard before.

The argument is this: “you can create an InnoDB temporary table and use it only within one transaction, and then if the slave crashes and restarts, it’ll roll back the transaction to the beginning.” In other words, in theory if the temporary table exists only within that one transaction, and if your transaction accesses only InnoDB tables, it’s safe.

My first thought was, you can’t do that. CREATE TABLE commits the transaction, so there’s implicitly more than one transaction. However, as the person pointed out, that isn’t true with CREATE TEMPORARY TABLE. I tested this (sometimes the manual is wrong!) and found that indeed, you can open a transaction, make some changes, create a temporary table with ENGINE=InnoDB, and the InnoDB transaction ID does not change in SHOW INNODB STATUS. The statements are all within one transaction. (However, if you type ROLLBACK the temporary table doesn’t get dropped. It’s not really transactional — it just doesn’t auto-commit the transaction. The ROLLBACK will produce a warning that says “Some non-transactional changed tables couldn’t be rolled back”, which is interesting.)

But does that mean it’s safe for replication?

There is one good way to find out: test it. I fired up my master-and-two-slaves replication sandbox, flushed all the logs, and set out to get to the bottom of the matter.

First, I stopped the slave threads so I could choose which statements to replay on the slave and pick the “crash point” as I wished. (I didn’t shut down the slave, I just stopped the replication processes. This is safe to do even when temporary tables are open.) Then I created a temporary table on the master, inserted some rows into it, and dropped it:

master &gt; set autocommit=0;
master &gt; begin;
master &gt; create temporary table test.t(a int) engine=innodb;
master &gt; insert into test.t(a) values(1);
master &gt; drop temporary table test.t;
master &gt; commit;

master > set autocommit=0;

master > begin;

master > create temporary table test.t(a int) engine=innodb;

master > insert into test.t(a) values(1);

master > drop temporary table test.t;

master > commit;

In theory, that’s all in one transaction. Since I flushed the logs before I did this, everything in the binary log so far comes from these statements. Let’s look at the binary logs:

master &gt; show master status;
+------------------+----------+--------------+------------------+
| File             | Position | Binlog_Do_DB | Binlog_Ignore_DB |
+------------------+----------+--------------+------------------+
| mysql-bin.000007 |      474 |              |                  | 
+------------------+----------+--------------+------------------+

master &gt; show binlog events in 'mysql-bin.000007'G
*************************** 1. row ***************************
   Log_name: mysql-bin.000007
        Pos: 4
 Event_type: Format_desc
  Server_id: 1
End_log_pos: 98
       Info: Server ver: 5.0.45-log, Binlog ver: 4
*************************** 2. row ***************************
   Log_name: mysql-bin.000007
        Pos: 98
 Event_type: Query
  Server_id: 1
End_log_pos: 207
       Info: create temporary table test.t(a int) engine=innodb
*************************** 3. row ***************************
   Log_name: mysql-bin.000007
        Pos: 207
 Event_type: Query
  Server_id: 1
End_log_pos: 271
       Info: BEGIN
*************************** 4. row ***************************
   Log_name: mysql-bin.000007
        Pos: 271
 Event_type: Query
  Server_id: 1
End_log_pos: 90
       Info: insert into test.t(a) values(1)
*************************** 5. row ***************************
   Log_name: mysql-bin.000007
        Pos: 361
 Event_type: Query
  Server_id: 1
End_log_pos: 176
       Info: drop temporary table test.t
*************************** 6. row ***************************
   Log_name: mysql-bin.000007
        Pos: 447
 Event_type: Xid
  Server_id: 1
End_log_pos: 474
       Info: COMMIT /* xid=39 */

master > show master status;

+------------------+----------+--------------+------------------+

+------------------+----------+--------------+------------------+

| mysql-bin.000007 | 474 | | |

+------------------+----------+--------------+------------------+

master > show binlog events in 'mysql-bin.000007'G

*************************** 1. row ***************************

Log_name: mysql-bin.000007

Pos: 4

Event_type: Format_desc

Server_id: 1

End_log_pos: 98

Info: Server ver: 5.0.45-log, Binlog ver: 4

*************************** 2. row ***************************

Log_name: mysql-bin.000007

Pos: 98

Event_type: Query

Server_id: 1

End_log_pos: 207

Info: create temporary table test.t(a int) engine=innodb

*************************** 3. row ***************************

Log_name: mysql-bin.000007

Pos: 207

Event_type: Query

Server_id: 1

End_log_pos: 271

Info: BEGIN

*************************** 4. row ***************************

Log_name: mysql-bin.000007

Pos: 271

Event_type: Query

Server_id: 1

End_log_pos: 90

Info: insert into test.t(a) values(1)

*************************** 5. row ***************************

Log_name: mysql-bin.000007

Pos: 361

Event_type: Query

Server_id: 1

End_log_pos: 176

Info: drop temporary table test.t

*************************** 6. row ***************************

Log_name: mysql-bin.000007

Pos: 447

Event_type: Xid

Server_id: 1

End_log_pos: 474

Info: COMMIT /* xid=39 */

Very interesting! The order of statements is not the same in the binlog as I typed into the console. If you replay the binary log, you’ll get two transactions here.

This shows us something interesting that isn’t considered in the “all inside one transaction” argument: transactions aren’t the only thing that matters. How the server logs events to the binary log is equally important. It appears that we can break replication on the slave by killing the slave after event 98 executes and before event 207 executes. But let’s not draw any conclusions yet. The only way to tell for sure is to really test it.

Since I’d stopped the slave, I could easily test my theory. Let’s let the slave replay events up until position 207, kill it, and restart it:

slave1 &gt; show slave statusG
*************************** 1. row ***************************
            Master_Log_File: mysql-bin.000006
        Read_Master_Log_Pos: 98
             Relay_Log_File: mysql_sandbox20551-relay-bin.000028
              Relay_Log_Pos: 235
      Relay_Master_Log_File: mysql-bin.000006
           Slave_IO_Running: No
          Slave_SQL_Running: No
         ...... omitted .........

slave1 &gt; start slave until master_log_file='mysql-bin.000007', master_log_pos=207;
slave1 &gt; show slave statusG
*************************** 1. row ***************************
            Master_Log_File: mysql-bin.000007
         .... omitted ........
        Exec_Master_Log_Pos: 207

slave1 &gt; show status like '%temp%';
+------------------------+-------+
| Variable_name          | Value |
+------------------------+-------+
| Slave_open_temp_tables | 1     | 
+------------------------+-------+

slave1 > show slave statusG

*************************** 1. row ***************************

Master_Log_File: mysql-bin.000006

Read_Master_Log_Pos: 98

Relay_Log_File: mysql_sandbox20551-relay-bin.000028

Relay_Log_Pos: 235

Relay_Master_Log_File: mysql-bin.000006

Slave_IO_Running: No

Slave_SQL_Running: No

...... omitted .........

slave1 > start slave until master_log_file='mysql-bin.000007', master_log_pos=207;

slave1 > show slave statusG

*************************** 1. row ***************************

Master_Log_File: mysql-bin.000007

.... omitted ........

Exec_Master_Log_Pos: 207

slave1 > show status like '%temp%';

+------------------------+-------+

| Variable_name | Value |

+------------------------+-------+

| Slave_open_temp_tables | 1 |

+------------------------+-------+

The slave is now “vulnerable,” in theory. To test my theory, I’ll shut down and restart the slave gracefully, rather than simulating a crash with kill -9, and see what happens.

$ ./node1/stop
$ ./node1/start
$ ./s1
slave1 &gt; show slave statusG
*************************** 1. row ***************************
                 Last_Errno: 1146
                 Last_Error: Error 'Table 'test.t' doesn't exist' on query. Default database: ''. Query: 'insert into test.t(a) values(1)'

$ ./node1/stop

$ ./node1/start

$ ./s1

slave1 > show slave statusG

*************************** 1. row ***************************

Last_Errno: 1146

Last_Error: Error 'Table 'test.t' doesn't exist' on query. Default database: ''. Query: 'insert into test.t(a) values(1)'

That’s the error I thought I’d see. Even though it was used entirely within one transaction on the master, the temporary table was not safe for replication.

I’m pretty sure this is a bug. The temporary table shouldn’t be logged out-of-order on the master like this (I suspect it’s logged out-of-order because CREATE TEMPORARY TABLE can’t be rolled back). But bug or no, it is what it is.

There’s one more angle to the email thread that inspired this article: what if the whole transaction is inside a stored procedure? Whether this works or not depends, again, on how the stored procedure call is logged to the binary log. Let’s create a stored procedure to hold the transaction, which this time will insert data from the temporary table into a non-temporary InnoDB table:

master &gt; delimiter //
master &gt; create procedure test_temp() begin
    -&gt; start transaction;
    -&gt; create temporary table test.t(a int) engine=innodb;
    -&gt; insert into test.t(a) values(1);
    -&gt; insert into test.ins(a) select * from test.t;
    -&gt; drop temporary table test.t;
    -&gt; commit;
    -&gt; end//
master &gt; delimiter;

master > delimiter //

master > create procedure test_temp() begin

-> start transaction;

-> create temporary table test.t(a int) engine=innodb;

-> insert into test.t(a) values(1);

-> insert into test.ins(a) select * from test.t;

-> drop temporary table test.t;

-> commit;

-> end//

master > delimiter;

Now calling the stored procedure should put a row into the test.ins table. Let’s see:

master &gt; call test_temp();
master &gt; select * from test.ins;
+------+
| a    |
+------+
|    1 | 
+------+

master > call test_temp();

master > select * from test.ins;

+------+

| a |

+------+

| 1 |

+------+

Good. Let’s see what’s in the binary log:

master &gt; show binlog events in 'mysql-bin.000011'G
*************************** 1. row ***************************
   Log_name: mysql-bin.000011
        Pos: 4
 Event_type: Format_desc
  Server_id: 1
End_log_pos: 98
       Info: Server ver: 5.0.45-log, Binlog ver: 4
*************************** 2. row ***************************
   Log_name: mysql-bin.000011
        Pos: 98
 Event_type: Query
  Server_id: 1
End_log_pos: 211
       Info: use `test`; create temporary table test.t(a int) engine=innodb
*************************** 3. row ***************************
   Log_name: mysql-bin.000011
        Pos: 211
 Event_type: Query
  Server_id: 1
End_log_pos: 279
       Info: use `test`; BEGIN
*************************** 4. row ***************************
   Log_name: mysql-bin.000011
        Pos: 279
 Event_type: Query
  Server_id: 1
End_log_pos: 94
       Info: use `test`; insert into test.t(a) values(1)
*************************** 5. row ***************************
   Log_name: mysql-bin.000011
        Pos: 373
 Event_type: Query
  Server_id: 1
End_log_pos: 198
       Info: use `test`; insert into test.ins select * from test.t
*************************** 6. row ***************************
   Log_name: mysql-bin.000011
        Pos: 477
 Event_type: Query
  Server_id: 1
End_log_pos: 288
       Info: use `test`; drop temporary table test.t
*************************** 7. row ***************************
   Log_name: mysql-bin.000011
        Pos: 567
 Event_type: Xid
  Server_id: 1
End_log_pos: 594
       Info: COMMIT /* xid=124 */
7 rows in set (0.00 sec)

master > show binlog events in 'mysql-bin.000011'G

*************************** 1. row ***************************

Log_name: mysql-bin.000011

Pos: 4

Event_type: Format_desc

Server_id: 1

End_log_pos: 98

Info: Server ver: 5.0.45-log, Binlog ver: 4

*************************** 2. row ***************************

Log_name: mysql-bin.000011

Pos: 98

Event_type: Query

Server_id: 1

End_log_pos: 211

Info: use `test`; create temporary table test.t(a int) engine=innodb

*************************** 3. row ***************************

Log_name: mysql-bin.000011

Pos: 211

Event_type: Query

Server_id: 1

End_log_pos: 279

Info: use `test`; BEGIN

*************************** 4. row ***************************

Log_name: mysql-bin.000011

Pos: 279

Event_type: Query

Server_id: 1

End_log_pos: 94

Info: use `test`; insert into test.t(a) values(1)

*************************** 5. row ***************************

Log_name: mysql-bin.000011

Pos: 373

Event_type: Query

Server_id: 1

End_log_pos: 198

Info: use `test`; insert into test.ins select * from test.t

*************************** 6. row ***************************

Log_name: mysql-bin.000011

Pos: 477

Event_type: Query

Server_id: 1

End_log_pos: 288

Info: use `test`; drop temporary table test.t

*************************** 7. row ***************************

Log_name: mysql-bin.000011

Pos: 567

Event_type: Xid

Server_id: 1

End_log_pos: 594

Info: COMMIT /* xid=124 */

7 rows in set (0.00 sec)

What you see depends on your version of MySQL, because the logging of stored procedures has changed over time. If just the CALL statement had been logged, I think we might have been safe using a stored procedure. However, since all the statements went into the binlog individually, there’s clearly an opportunity to break replication here. It looks like this doesn’t avoid the problem either.

Interestingly, I also created a version of the stored procedure that doesn’t begin and commit a transaction. After calling it, the CREATE TEMPORARY TABLE statement is logged into the binlog; after then typing COMMIT, the rest of the statements go into the binlog. It appears to me that there’s no way to get the CREATE TEMPORARY TABLE statement to be logged inside the transaction. And when it comes to a replication slave, what’s logged — not what executes on the master — is what’s important.

In summary, I still don’t see any way to use temporary tables with MySQL statement-based replication without some risk of breaking slaves. At some point I may test how it works with row-based replication; I believe even row-based logging format is going to have some problems, because the CREATE TABLE is logged in statement format. But that’s a topic for another post.