mysql-connect_retries_on_failure not effective to failures like connection refused

For the hostgroup with a single backend, if the backend server goes down, the ProxySQL will return the error "Max connect timeout reached while reaching hostgroup" after the "mysql-connect_timeout_server_max" duration elapses if the backend is marked as "SHUNNED". In contrast, if establishing a direct connection without ProxySQL, the client will get a "connection refused" error or error code 111 immediately, which makes more sense.

After debugging the latest binary, it appears that ProxySQL keeps attempting to get a good connection at intervals controlled by "mysql-connect_retries_delay" until "mysql-connect_timeout_server_max" is reached according to the method MySQL_Session::handler_again___status_CONNECTING_SERVER. https://github.com/sysown/proxysql/blob/27e71d29729c67cbaf6beafcb3a3b4eeca7ab9b4/lib/MySQL_Session.cpp#L3177
And the retry is not even controlled by "mysql-connect_retries_on_failure" (as it should be) because the process always falls into this condition in the scenario I mentioned above. https://github.com/sysown/proxysql/blob/27e71d29729c67cbaf6beafcb3a3b4eeca7ab9b4/lib/MySQL_Session.cpp#L3244
If debug further, we will find the method MyHGC::get_random_MySrvC returns NULL all the time during the retry for this single "SHUNNED" backend. https://github.com/sysown/proxysql/blob/27e71d29729c67cbaf6beafcb3a3b4eeca7ab9b4/lib/MyHGC.cpp#L55
According to this method, the ProxySQL should attempt to bring the "SHUNNED" backend online but it fails all the time because the "mysrvc->time_last_detected_error" is always in the future related to "mysql-monitor_ping_interval".
```
					// if Monitor is enabled and mysql-monitor_ping_interval is
					// set too high, ProxySQL will unshun hosts that are not
					// available. For this reason time_last_detected_error will
					// be tuned in the future
					if (mysql_thread___monitor_enabled) {
						int a = mysql_thread___shun_recovery_time_sec;
						int b = mysql_thread___monitor_ping_interval;
						b = b/1000;
						if (b > a) {
							t = t + (b - a);
						}
					}
					mysrvc->time_last_detected_error = t;
```

I'm not sure whether the logic is intentionally designed this way. However, the retry time should be controlled by 'mysql-connect_retries_on_failure' rather than its current implementation. Additionally, I believe it would be more effective to return the exact failure error to the client.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mysql-connect_retries_on_failure not effective to failures like connection refused #4597

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

mysql-connect_retries_on_failure not effective to failures like connection refused #4597

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions