Inference API Quickstart

Python

import os
from openai import OpenAI

client = OpenAI(
  api_key=os.getenv("TELNYX_API_KEY"),
  base_url="https://api.telnyx.com/v2/ai/openai",
)

chat_completion = client.chat.completions.create(
  messages=[
    {"role": "user", "content": "Tell me about Telnyx"}
  ],
  model="zai-org/GLM-5.2",
  stream=True
)

# GLM-5.2 is a reasoning model: it streams its thinking in `reasoning_content`
# before the final answer in `content`. Print both so you can see the reasoning.
reasoning_started = False
content_started = False
for chunk in chat_completion:
  delta = chunk.choices[0].delta
  if getattr(delta, "reasoning_content", None):
    if not reasoning_started:
      print("--- reasoning ---")
      reasoning_started = True
    print(delta.reasoning_content, end="", flush=True)
  if delta.content:
    if not content_started:
      print("\n--- answer ---")
      content_started = True
    print(delta.content, end="", flush=True)

Reasoning models such as zai-org/GLM-5.2 return their chain-of-thought in a separate reasoning_content field (on message for non-streaming responses, or delta when streaming). Models without reasoning simply omit it, so the getattr(..., "reasoning_content", None) guard works for every model.

Core Concepts

Messages

Chat history passed to the model.

Roles

Every message has a role: system, user, assistant, or tool.

system — model behavior instructions

user — end-user input

assistant — model output

tool — function call results. See Function Calling.

Models

Available Models lists all hosted LLMs with context lengths and capabilities.

Streaming

Server-sent events, same as OpenAI.

I want to…	Go to
Build a voice assistant	No-Code Voice Assistant
Call custom code from the model	Function Calling / Streaming Functions
Ground responses in documents	Embeddings
Identify themes in data	Clusters
Migrate from OpenAI	OpenAI Migration
Browse all models	Available Models

I want to…

Go to

Build a voice assistant

No-Code Voice Assistant

Call custom code from the model

Function Calling / Streaming Functions

Ground responses in documents

Embeddings

Identify themes in data

Clusters

Migrate from OpenAI

OpenAI Migration

Browse all models

Available Models

Inference API Quickstart

Prerequisites

Python

Core Concepts

Messages

Roles

Models

Streaming

What Next?

​Prerequisites

​Python

​Core Concepts

​Messages

​Roles

​Models

​Streaming

​What Next?

Prerequisites

Python

Core Concepts

Messages

Roles

Models

Streaming

What Next?