Current Regions
| Region | Location |
|---|---|
| US East | Atlanta |
| US West | Denver |
| EU | Paris |
| Asia-Pacific | Sydney |
Routing
Inference processing is latency-based, influenced by the ingress domain you call, not by your account’s data locality setting. Telnyx will endeavor to process in the preferred region, but does not guarantee it:| Ingress domain | Preferred region |
|---|---|
api.telnyx.com | US |
api.telnyx.eu | EU |
api.telnyx.com.au | APAC |
api.telnyx.eu) directs requests to the nearest GPU region for that domain under normal conditions. Telnyx does not guarantee processing location: during failover or capacity events, requests are processed at the next-lowest-latency region rather than failing. A region-selection API parameter is on the roadmap.
Data Residency
Processing location and storage location are controlled separately:- Processing in transit is latency-based, influenced by the ingress domain you call (see Routing above). Telnyx will endeavor to process in the preferred region, but it is not a guaranteed processing location.
- Storage at rest depends on the endpoint. The chat completions endpoint does not store request or response data. The responses endpoint stores conversations, and that storage is governed by your Data Locality setting.
Roadmap
- Region selection API parameter
- Per-region model status and latency metrics
- Edge inference for sub-50ms response times