Gather using speak

POST /calls/:call_control_id/actions/gather_using_speak

Convert text to speech and play it on the call until the required DTMF signals are gathered to build interactive menus.

You can pass a list of valid digits along with an 'invalid_payload', which will be played back at the beginning of each prompt. Speech will be interrupted when a DTMF signal is received. The Answer command must be issued before the gather_using_speak command.

Expected Webhooks (see callback schema below):

call.dtmf.received (you may receive many of these webhooks)
call.gather.ended

Request

Path Parameters

call_control_id stringrequired

Unique identifier and token for controlling the call

application/json

Body

required

Gather using speak request

payload stringrequired

The text or SSML to be converted into speech. There is a 3,000 character limit.

invalid_payload string

The text or SSML to be converted into speech when digits don't match the valid_digits parameter or the number of digits is not between min and max. There is a 3,000 character limit.

payload_type string

Possible values: [text, ssml]

Default value: text

The type of the provided payload. The payload can either be plain text, or Speech Synthesis Markup Language (SSML).

service_level string

Possible values: [basic, premium]

Default value: premium

This parameter impacts speech quality, language options and payload types. When using basic, only the en-US language and payload type text are allowed.

voice stringrequired

Specifies the voice used in speech synthesis.

Define voices using the format <Provider>.<Model>.<VoiceId>. Specifying only the provider will give default values for voice_id and model_id.

Supported Providers:

AWS: Use AWS.Polly.<VoiceId> (e.g., AWS.Polly.Joanna). For neural voices, which provide more realistic, human-like speech, append -Neural to the VoiceId (e.g., AWS.Polly.Joanna-Neural). Check the available voices for compatibility.
Azure: Use `Azure.. (e.g. Azure.en-CA-ClaraNeural, Azure.en-CA-LiamNeural, Azure.en-US-BrianMultilingualNeural, Azure.en-US-Ava:DragonHDLatestNeural. For a complete list of voices, go to Azure Voice Gallery.)
ElevenLabs: Use ElevenLabs.<ModelId>.<VoiceId> (e.g., ElevenLabs.eleven_multilingual_v2.21m00Tcm4TlvDq8ikWAM). The ModelId part is optional. To use ElevenLabs, you must provide your ElevenLabs API key as an integration identifier secret in "voice_settings": {"api_key_ref": "<secret_identifier>"}. See integration secrets documentation for details. Check available voices.
Telnyx: Use Telnyx.<model_id>.<voice_id>

For service_level basic, you may define the gender of the speaker (male or female).

voice_settings

object

The settings associated with the voice selected

oneOf

api_key_ref string

The identifier for an integration secret /v2/integration_secrets that refers to your ElevenLabs API key. Warning: Free plans are unlikely to work with this integration.

voice_speed float

Possible values: >= 0.1 and <= 2

Default value: 1

The voice speed to be used for the voice. The voice speed must be between 0.1 and 2.0. Default value is 1.0.

The settings associated with the voice selected

language string

Possible values: [arb, cmn-CN, cy-GB, da-DK, de-DE, en-AU, en-GB, en-GB-WLS, en-IN, en-US, es-ES, es-MX, es-US, fr-CA, fr-FR, hi-IN, is-IS, it-IT, ja-JP, ko-KR, nb-NO, nl-NL, pl-PL, pt-BR, pt-PT, ro-RO, ru-RU, sv-SE, tr-TR]

The language you want spoken. This parameter is ignored when a Polly.* voice is specified.

minimum_digits int32

Default value: 1

The minimum number of digits to fetch. This parameter has a minimum value of 1.

maximum_digits int32

Default value: 128

The maximum number of digits to fetch. This parameter has a maximum value of 128.

maximum_tries int32

Default value: 3

The maximum number of times that a file should be played back if there is no input from the user on the call.

timeout_millis int32

Default value: 60000

The number of milliseconds to wait for a DTMF response after speak ends before a replaying the sound file.

terminating_digit string

Default value: #

The digit used to terminate input if fewer than maximum_digits digits have been gathered.

valid_digits string

Default value: 0123456789#*

A list of all digits accepted as valid.

inter_digit_timeout_millis int32

Default value: 5000

The number of milliseconds to wait for input between digits.

client_state string

Use this field to add state to every subsequent webhook. It must be a valid Base-64 encoded string.

command_id string

Use this field to avoid duplicate commands. Telnyx will ignore any command with the same command_id for the same call_control_id.

Responses

200: Successful response upon making a call control command.

application/json

Schema

data

object

result string

{
  "data": {
    "result": "ok"
  }
}

default: Unexpected error

application/json

Schema

errors

Error[]

Array [

code integerrequired

title stringrequired

detail string

source

object

pointer json-pointer

JSON pointer (RFC6901) to the offending entity.

parameter string

Indicates which query parameter caused the error.

meta object

]

{
  "errors": [
    {
      "code": "string",
      "title": "string",
      "detail": "string",
      "source": {
        "pointer": "string",
        "parameter": "string"
      },
      "meta": {}
    }
  ]
}

Callbacks

Request samples

curl -L 'https://api.telnyx.com/v2/calls/:call_control_id/actions/gather_using_speak' \
-H 'Content-Type: application/json' \
-H 'Accept: application/json' \
-H 'Authorization: Bearer <TOKEN>' \
-d '{
  "payload": "say this on call",
  "invalid_payload": "say this on call",
  "payload_type": "text",
  "service_level": "premium",
  "voice": "male",
  "language": "arb",
  "minimum_digits": 1,
  "maximum_digits": 10,
  "terminating_digit": "#",
  "valid_digits": "123",
  "inter_digit_timeout_millis": 10000,
  "client_state": "aGF2ZSBhIG5pY2UgZGF5ID1d",
  "command_id": "891510ac-f3e4-11e8-af5b-de00688a4901"
}'

Response samples

{
  "data": {
    "result": "ok"
  }
}

{
  "errors": [
    {
      "code": "string",
      "title": "string",
      "detail": "string",
      "source": {
        "pointer": "string",
        "parameter": "string"
      },
      "meta": {}
    }
  ]
}

Gather using speak

Request ​

Path Parameters

Body

Responses ​

Callbacks ​

Request

Responses

Callbacks