In this blog post, we will discuss MySQL performance on eXFlash DIMMs. Earlier we measured the IO performance of these storage devices with sysbench fileio.
The benchmarking environment was the same as the one we did sysbench fileio in.
CPU: 2x Intel Xeon E5-2690 (hyper threading enabled)
FusionIO driver version: 3.2.6 build 1212
Operating system: CentOS 6.5
Kernel version: 2.6.32-431.el6.x86_64
In this case, we used a separate machine for testing which had a 10G ethernet connection to this server. This server executed sysbench. The client was not the bottleneck in this case. The environment is described in greater detail at the end of the blog post.
Sysbench OLTP write workload
The graph shows throughput for sysbench OLTP, we will examine properties only for the dark areas of this graph: which is the read/write case for high concurrency.
Each table in the following sections has the following columns
|storage||The device that was used for the measurement.|
|threads||The number of sysbench client threads were used in the benchmark.|
|ro_rw||Read-only or read-write. In the whitepaper you can find detailed information about read-only data as well.|
|sd||The standard deviation of the metric in question.|
|mean||The mean of the metric in question.|
|95thpct||The 95th percentile of the metric in question (the maximum without the highest 5 percent of the samples).|
|max||The maximum of the metric in question.|
Sysbench OLTP throughput
Going from 4 to 8 eXFlash DIMMs will mostly mean more consistent throughput. The mean throughput is significantly higher in case of 8 DIMMs used, but the 95th percentile and the maximum values are not much different (the difference in standard deviation also shows this). The reason they are not much different is that these benchmark are CPU bound (check CPU idle time table later in this post or the graphs in the whitepaper). The PCI-E flash drive on the other hand can do less than half of the throughput of the eXFlash DIMMs (the most relevant is comparing the 95th percentile value).
Sysbench OLTP response time
The 95th percentile response time for the eXFlash DIMM’s case are less than 1/4 compared to the PCI-E flash device.
CPU idle percentage
The percentage of CPU being idle shows that the performance bottleneck in this benchmark was the CPU in case of eXFlash DIMMs (both with 4 and 8 DIMMs, this is why we didn’t see a substantial throughput difference between the 4 and the 8 DIMM setup). However, for the PCI-E flash, the storage device itself was the bottleneck.
If you are interested in more details, download the free white paper which contains the full analysis of sysbench OLTP and linkbench benchmarks.