最近遇到一个MySQL连接的问题,远程连接MySQL时遇到“ERROR 2013 (HY000): Lost connection to MySQL server at 'reading authorization packet', system error: 0”错误,如下所示:

[root@DB-Server ~]# mysql -h 10.13.65.93 -u onecard -p

Enter password:

ERROR 2013 (HY000): Lost connection to MySQL server at 'reading authorization packet', system error: 0

这个测试的MySQL位于阿里云Kubernetes(K8s)中Docker容器里面,而且在远程连接MySQL出现上面错误的时候,Docker也会出现下面错误。

一般出现“ERROR 2013 (HY000): Lost connection to MySQL server at 'reading authorization packet'”错误的原因较多:

1:网络异常或时延非常高的时候, 超过连接时间限制(系统变量connect_timeout)会导致这个错误。MySQL客户端与数据库建立连接需要发起三次握手协议,正常情况下,这个时间非常短,但是一旦网络异常,网络超时等因素出现,就会导致这个握手协议无法完成,MySQL有个参数connect_timeout,它是MySQL服务端进程mysqld等待连接建立完成的时间,单位为秒。如果超过connect_timeout时间范围内,仍然无法完成协议握手话,MySQL客户端会收到异常。  更多详细信息可以参考我这篇博客“MySQL参数max_connect_errors分析释疑”,但是当前这个案例中,不存在网络延时情况,如下所示:

[root@DB-Server ~]# ping 10.13.65.93

PING 10.13.65.93 (10.13.65.93) 56(84) bytes of data.

64 bytes from 10.13.65.93: icmp_seq=1 ttl=97 time=36.1 ms

64 bytes from 10.13.65.93: icmp_seq=2 ttl=97 time=36.3 ms

64 bytes from 10.13.65.93: icmp_seq=3 ttl=97 time=36.1 ms

64 bytes from 10.13.65.93: icmp_seq=4 ttl=97 time=36.0 ms

64 bytes from 10.13.65.93: icmp_seq=5 ttl=97 time=36.1 ms

64 bytes from 10.13.65.93: icmp_seq=6 ttl=97 time=36.2 ms

64 bytes from 10.13.65.93: icmp_seq=7 ttl=97 time=36.1 ms

64 bytes from 10.13.65.93: icmp_seq=8 ttl=97 time=36.2 ms

 

--- 10.13.65.93 ping statistics ---

8 packets transmitted, 8 received, 0% packet loss, time 7003ms

rtt min/avg/max/mdev = 36.092/36.205/36.354/0.205 ms

2:域名解析会导致这个问题。当客户端连接上来,服务器端都会对客户端进来的IP地址进行DNS解析,来获得客户端的域名或主机名,如果DNS解析出了问题或DNS解析相当慢,就会导致连接验证用户出现问题。而skip-name-resolve这个参数的意义就是禁止域名解析。官方文档解释如下:

For each new client connection, the server uses the client IP address to check whether the client host name is in the host cache. If so, the server refuses or continues to process the connection request depending on whether or not the host is blocked. If the host is not in the cache, the server attempts to resolve the host name. First, it resolves the IP address to a host name and resolves that host name back to an IP address. Then it compares the result to the original IP address to ensure that they are the same. The server stores information about the result of this operation in the host cache. If the cache is full, the least recently used entry is discarded.

The server handles entries in the host cache like this:

  • When the first TCP client connection reaches the server from a given IP address, a new cache entry is created to record the client IP, host name, and client lookup validation flag. Initially, the host name is set to NULL and the flag is false. This entry is also used for subsequent client TCP connections from the same originating IP.

当有一个新的客户端连接通过TCP进来时,MySQL Server会为这个IP在host cache中建立一个新的记录,包括IP,主机名和client lookup validation flag,分别对应host_cache表中的IP,HOST和HOST_VALIDATED这三列。第一次建立连接因为只有IP,没有主机名,所以HOST将设置为NULL,HOST_VALIDATED将设置为FALSE。

  • If the validation flag for the client IP entry is false, the server attempts an IP-to-host name-to-IP DNS resolution. If that is successful, the host name is updated with the resolved host name and the validation flag is set to true. If resolution is unsuccessful, the action taken depends on whether the error is permanent or transient. For permanent failures, the host name remains NULL and the validation flag is set to true. For transient failures, the host name and validation flag remain unchanged. (In this case, another DNS resolution attempt occurs the next time a client connects from this IP.)

MySQL Server检测HOST_VALIDATED的值,如果为FALSE,它会试图进行DNS解析,如果解析成功,它将更新HOST的值为主机名,并将HOST_VALIDATED值设为TRUE。如果没有解析成功,判断失败的原因是永久的还是临时的,如果是永久的,则HOST的值依旧为NULL,且将HOST_VALIDATED的值设置为TRUE,后续连接不再进行解析,如果该原因是临时的,则HOST_VALIDATED依旧为FALSE,后续连接会再次进行DNS解析。

  • If an error occurs while processing an incoming client connection from a given IP address, the server updates the corresponding error counters in the entry for that IP. For a description of the errors recorded, see Section 26.12.17.1, “The host_cache Table”

如果在处理来自给定IP地址的传入客户端连接时发生错误,则服务器会更新该IP条目中的相应错误计数器。 有关记录的错误的说明,请参见第26.12.17.1节“host_cache表”。

这个案例里面,因为MySQL位于阿里云Kubernetes(K8s)中Docker容器里面,对公司内部的IP地址进行DNS解析确实会出现问题。我们在配置文件设置skip_name_resolve后,确实解决了这个问题。然后本来以为找到了原因的我,在本地两台机器上测试时发现(一台MySQL版本为5.6.41, 一台MySQL版本为5.6.23),即使两台服务器相互不能做DNS解析,如下截图所示,但是从192.168.27.180连接DB-Server时,并不会报这个错误。Why? 即使我将connect_timeout调整为2,依然不会出现这个错误。看来MySQL的连接不像我们表面看的那样简单。还是相当复杂。只是目前的技术水平,还做不到进一步分析!

另外,在这个案例的测试过程中,发现skip_name_resolve为OFF的情况下,将connect_timeout设大,也不会出现这个错误

mysql>  show variables like '%connect_timeout%';

+-----------------+-------+

| Variable_name   | Value |

+-----------------+-------+

| connect_timeout | 10    |

+-----------------+-------+

1 row in set (0.01 sec)

 

mysql> set global connect_timeout=30;

Query OK, 0 rows affected (0.00 sec)

 

mysql> 

然后从客户端连接MySQL数据库就成功了,如下所示,只是IP地址并不是客户端的IP地址,而是Port IP。

当然这种情况下Kubernetes(K8s)中Docker下MySQL并没有挂掉,反而当系统变量connect_timeout=10的情况下,如果没有开启系统变量skip_name_resolve,每次远程连接MySQL就会出现Kubernetes(K8s)中Docker下MySQL挂掉,重启的过程,所以极度怀疑是疑因为在连接过程,Docker下MySQL挂掉重启才出现这个错误。但是对K8s了解不多,涉及太广,没法进一步分析具体原因了。

最新文章

  1. SQL SERVER 2008 评估期已过
  2. 需要了解的 Linux 网络和监控命令
  3. flex弹性布局学习笔记
  4. Java常用命令
  5. [转]c#调用API截图
  6. OC6_复合类的类存管理
  7. HDU4612 Warm up 边双(重边)缩点+树的直径
  8. 安卓扫码:简单的ZXing使用记录
  9. MVC3.0+knockout.js+Ajax 实现简单的增删改查
  10. [TJOI 2017]可乐
  11. 再回首UML之上篇
  12. 论文笔记(1):Deep Learning.
  13. 20165305 《网络对抗技术》 Kali安装
  14. Zabbix调优不完全指南(https://www.jianshu.com/p/2d911d55448f)
  15. AI VGG16
  16. mysql 允许特定IP访问
  17. LOJ#3023 老C的键盘
  18. day11 高阶函数 函数式编程
  19. React多页面应用脚手架-v1.3.0
  20. 零值比较--BOOL,int,float,指针变量与零值比较的if语句

热门文章

  1. SeetaFace教程(一) 在 VS 中的编译安装和环境配置
  2. 一步步实现 Prism + MEF(二)--- 绑定命令
  3. 图解Mysql语句的执行过程
  4. mysql的索引key_len计算方法,及个字段所占字节数
  5. argparse 在深度学习中的应用
  6. 2017 Multi-University Training Contest - Team 7 Just do it
  7. NET Core2
  8. Jaspersoft Studio简介
  9. MapReduce作业的执行流程
  10. Java 8特性尝鲜:新新IO