Together AI hosts open-weight models (Llama 3.x, Qwen, Mistral, DeepSeek, and more) behind a fast OpenAI-compatible API. Arize AX captures every Together AI call — prompts, responses, model name, and token usage — by reusing theDocumentation Index
Fetch the complete documentation index at: https://arize-ax.mintlify.dev/docs/llms.txt
Use this file to discover all available pages before exploring further.
openinference-instrumentation-openai package against the OpenAI client pointed at Together’s endpoint.
Prerequisites
- Python 3.9+
- An Arize AX account (sign up)
- A
TOGETHER_API_KEYfrom the Together AI console
Launch Arize AX
- Sign in to your Arize AX account.
- From Space Settings, copy your Space ID and API Key. You will set them as
ARIZE_SPACE_IDandARIZE_API_KEYbelow.
Install
Configure credentials
Setup tracing
Run Together AI
Expected output
Verify in Arize AX
- Open your Arize AX space and select project
together-ai-tracing-example. - You should see a new trace within ~30 seconds containing a
ChatCompletionLLM span with the prompt, response, and token usage attached. The span’sllm.model_namereflects the Together-hosted model (e.g.meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo). - If no traces appear, see Troubleshooting.
Troubleshooting
- No traces in Arize AX. Confirm
ARIZE_SPACE_IDandARIZE_API_KEYare set in the same shell that runsexample.py. Enable OpenTelemetry debug logs withexport OTEL_LOG_LEVEL=debugand re-run. - OpenAI spans missing.
OpenAIInstrumentor().instrument(...)must run before anyfrom openai import .... Make sureinstrumentation.pyis the first import in your entry point. 401from Together. VerifyTOGETHER_API_KEYis set and active. Generate a new key at api.together.ai/settings/api-keys if needed.model_not_found. The exact model slug must match Together’s catalog — see Together’s models page. Swap themodel=string for one your account can call.