ctrld

mirror of https://github.com/Control-D-Inc/ctrld.git synced 2026-02-03 22:18:39 +00:00

Author	SHA1	Message	Date
Cuong Manh Le	3ea69b180c	cmd/cli: use config timeout when checking upstream Otherwise, for slow network connection (like plane wifi), the check may fail even though the internet is available.	2025-01-14 14:32:01 +07:00
Cuong Manh Le	3e388c2857	all: leaking queries to OS resolver instead of SRVFAIL So it would work in more general case than just captive portal network, which ctrld have supported recently. Uses who may want no leaking behavior can use a config to turn off this feature.	2024-09-30 18:20:27 +07:00
Cuong Manh Le	5c24acd952	cmd/cli: fix bug causes checkUpstream run only once To prevent duplicated running of checkUpstream function at the same time, upstream monitor uses a boolean to report whether the upstream is checking. If this boolean is true, then other calls after the first one will be returned immediately. However, checkUpstream does not set this boolean to false when it finishes, thus all future calls to checkUpstream won't be run, causing the upstream is marked as down forever. Fixing this by ensuring the boolean is reset once checkUpstream done. While at it, also guarding all upstream monitor operations with a mutex, ensuring there's no race condition between marking upstream state.	2023-12-18 21:30:36 +07:00
Cuong Manh Le	d88cf52b4e	cmd/cli: always rebootstrap when check upstream Otherwise, network changes may not be seen on some platforms, causing ctrld failed to recover and failing all requests. While at it, also doing the check DNS in separate goroutine, prevent it from blocking ctrld from notifying others that it "started". The issue was seen when ctrld is configured as direct listener, requests are flooded before ctrld started, causing the healtch process failed.	2023-11-06 20:01:25 +07:00
Cuong Manh Le	511c4e696f	cmd/cli: add upstream monitor Some users mentioned that when there is an Internet outage, ctrld fails to recover, crashing or locks up the router. When requests start failing, this results in the clients emitting more queries, creating a resource spiral of death that can brick the device entirely. To guard against this case, this commit implement an upstream monitor approach: - Marking upstream as down after 100 consecutive failed queries. - Start a goroutine to check when the upstream is back again. - When upstream is down, answer all queries with SERVFAIL. - The checking process uses backoff retry to reduce high requests rate. - As long as the query succeeded, marking the upstream as alive then start operate normally.	2023-09-22 18:45:59 +07:00

5 Commits