Module @arizeai/phoenix-client

@arizeai/phoenix-client

This package provides a TypeScript client for the Arize Phoenix API. It is still under active development and is subject to change.

Installation

# or yarn, pnpm, bun, etc...
npm install @arizeai/phoenix-client

Configuration

The client will automatically read environment variables from your environment, if available.

The following environment variables are used:

PHOENIX_HOST - The base URL of the Phoenix API.
PHOENIX_API_KEY - The API key to use for authentication.
PHOENIX_CLIENT_HEADERS - Custom headers to add to all requests. A JSON stringified object.

PHOENIX_HOST='http://localhost:12345' PHOENIX_API_KEY='xxxxxx' pnpx tsx examples/list_datasets.ts
# emits the following request:
# GET http://localhost:12345/v1/datasets
# headers: {
#   "Authorization": "Bearer xxxxxx",
# }

Alternatively, you can pass configuration options to the client directly, and they will be prioritized over environment variables and default values.

const phoenix = createClient({
  options: {
    baseUrl: "http://localhost:6006",
    headers: {
      Authorization: "Bearer xxxxxx",
    },
  },
});

Prompts

@arizeai/phoenix-client provides a prompts export that exposes utilities for working with prompts for LLMs.

Creating a Prompt and push it to Phoenix

The createPrompt function can be used to create a prompt in Phoenix for version control and reuse.

import { createPrompt, promptVersion } from "@arizeai/phoenix-client/prompts";

const version = createPrompt({
  name: "my-prompt",
  description: "test-description",
  version: promptVersion({
    description: "version description here",
    modelProvider: "OPENAI",
    modelName: "gpt-3.5-turbo",
    template: [
      {
        role: "user",
        content: "{{ question }}",
      },
    ],
    invocationParameters: {
      temperature: 0.8,
    },
  }),
});

Prompts that are pushed to Phoenix are versioned and can be tagged.

Pulling a Prompt from Phoenix

The getPrompt function can be used to pull a prompt from Phoenix based on some Prompt Identifier and returns it in the Phoenix SDK Prompt type.

import { getPrompt } from "@arizeai/phoenix-client/prompts";

const prompt = await getPrompt({ name: "my-prompt" });
// ^ you now have a strongly-typed prompt object, in the Phoenix SDK Prompt type

const promptByTag = await getPrompt({ tag: "production", name: "my-prompt" });
// ^ you can optionally specify a tag to filter by

const promptByVersionId = await getPrompt({
  versionId: "1234567890",
});
// ^ you can optionally specify a prompt version Id to filter by

Using a Phoenix Prompt with an LLM Provider SDK

The toSDK helper function can be used to convert a Phoenix Prompt to the format expected by an LLM provider SDK. You can then use the LLM provider SDK as normal, with your prompt.

If your Prompt is saved in Phoenix as openai, you can use the toSDK function to convert the prompt to the format expected by OpenAI, or even Anthropic and Vercel AI SDK. We will do a best effort conversion to your LLM provider SDK of choice.

The following LLM provider SDKs are supported:

Vercel AI SDK: ai ai
OpenAI: openai openai
Anthropic: anthropic @anthropic-ai/sdk

import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";
import { getPrompt, toSDK } from "@arizeai/phoenix-client/prompts";

const prompt = await getPrompt({ name: "my-prompt" });
const promptAsAI = toSDK({
  sdk: "ai",
  // ^ the SDK you want to convert the prompt to, supported SDKs are listed above
  variables: {
    "my-variable": "my-value",
  },
  // ^ you can format the prompt with variables, if the prompt has any variables in its template
  //   the format (Mustache, F-string, etc.) is specified in the Prompt itself
  prompt,
});
// ^ promptAsAI is now in the format expected by the Vercel AI SDK generateText function

const response = await generateText({
  model: openai(prompt.model_name),
  // ^ the model adapter provided by the Vercel AI SDK can be swapped out for any other model
  //   adapter supported by the Vercel AI SDK. Take care to use the correct model name for the
  //   LLM provider you are using.
  ...promptAsAI,
});

REST Endpoints

The client provides a REST API for all endpoints defined in the Phoenix OpenAPI spec.

Endpoints are accessible via strongly-typed string literals and TypeScript auto-completion inside of the client object.

import { createClient } from "@arizeai/phoenix-client";

const phoenix = createClient();

// Get all datasets
const datasets = await phoenix.GET("/v1/datasets");

// Get specific prompt
const prompt = await phoenix.GET("/v1/prompts/{prompt_identifier}/latest", {
  params: {
    path: {
      prompt_identifier: "my-prompt",
    },
  },
});

A comprehensive overview of the available endpoints and their parameters is available in the OpenAPI viewer within Phoenix, or in the Phoenix OpenAPI spec.

Datasets

The @arizeai/phoenix-client package allows you to create and manage datasets, which are collections of examples used for experiments and evaluation.

Creating a Dataset

You can create a dataset by providing a name, description, and an array of examples (each with input, output, and optional metadata).

import { createDataset } from "@arizeai/phoenix-client/datasets";

const { datasetId } = await createDataset({
  name: "questions",
  description: "a simple dataset of questions",
  examples: [
    {
      input: { question: "What is the capital of France" },
      output: { answer: "Paris" },
      metadata: {},
    },
    {
      input: { question: "What is the capital of the USA" },
      output: { answer: "Washington D.C." },
      metadata: {},
    },
  ],
});
// You can now use datasetId to run experiments or add more examples

Experiments

The @arizeai/phoenix-client package provides an experiments API for running and evaluating tasks on datasets. This is useful for benchmarking models, evaluating outputs, and tracking experiment results in Phoenix.

Running an Experiment

To run an experiment, you typically:

Create a dataset (or use an existing one)
Define a task function to run on each example
Define one or more evaluators to score or label the outputs
Run the experiment and inspect the results

Below is a complete example:

import { createDataset } from "@arizeai/phoenix-client/datasets";
import {
  asExperimentEvaluator,
  runExperiment,
} from "@arizeai/phoenix-client/experiments";

// 1. Create a dataset
const { datasetId } = await createDataset({
  name: "names-dataset",
  description: "a simple dataset of names",
  examples: [
    {
      input: { name: "John" },
      output: { text: "Hello, John!" },
      metadata: {},
    },
    {
      input: { name: "Jane" },
      output: { text: "Hello, Jane!" },
      metadata: {},
    },
  ],
});

// 2. Define a task to run on each example
const task = async (example) => `hello ${example.input.name}`;

// 3. Define evaluators
const evaluators = [
  asExperimentEvaluator({
    name: "matches",
    kind: "CODE",
    evaluate: async ({ output, expected }) => {
      const matches = output === expected?.text;
      return {
        label: matches ? "matches" : "does not match",
        score: matches ? 1 : 0,
        explanation: matches
          ? "output matches expected"
          : "output does not match expected",
        metadata: {},
      };
    },
  }),
  asExperimentEvaluator({
    name: "contains-hello",
    kind: "CODE",
    evaluate: async ({ output }) => {
      const matches = typeof output === "string" && output.includes("hello");
      return {
        label: matches ? "contains hello" : "does not contain hello",
        score: matches ? 1 : 0,
        explanation: matches
          ? "output contains hello"
          : "output does not contain hello",
        metadata: {},
      };
    },
  }),
];

// 4. Run the experiment
const experiment = await runExperiment({
  dataset: { datasetId },
  task,
  evaluators,
});

Hint: Tasks and evaluators are instrumented using OpenTelemetry. You can view detailed traces of experiment runs and evaluations directly in the Phoenix UI for debugging and performance analysis.

Examples

To run examples, install dependencies using pnpm and run:

pnpm install
pnpx tsx examples/list_datasets.ts
# change the file name to run other examples

Compatibility

This package utilizes openapi-ts to generate the types from the Phoenix OpenAPI spec.

Because of this, this package only works with the arize-phonix server 8.0.0 and above.

Compatibility Table:

Phoenix Client Version	Phoenix Server Version
^2.0.0	^9.0.0
^1.0.0	^8.0.0

Community

Join our community to connect with thousands of AI builders:

🌍 Join our Slack community.
📚 Read the Phoenix documentation.
💡 Ask questions and provide feedback in the #phoenix-support channel.
🌟 Leave a star on our GitHub.
🐞 Report bugs with GitHub Issues.
𝕏 Follow us on 𝕏.
🗺️ Check out our roadmap to see where we're heading next.

Modules

__generated__/api/v1
client
config
datasets
datasets/appendDatasetExamples
datasets/createDataset
datasets/createOrGetDataset
datasets/getDataset
datasets/getDatasetExamples
datasets/getDatasetInfo
datasets/getDatasetInfoByName
datasets/listDatasets
experiments
experiments/createExperiment
experiments/deleteExperiment
experiments/getExperiment
experiments/getExperimentInfo
experiments/getExperimentRuns
experiments/helpers
experiments/helpers/asExperimentEvaluator
experiments/helpers/fromPhoenixLLMEvaluator
experiments/helpers/getExperimentEvaluators
experiments/listExperiments
experiments/logging
experiments/resumeEvaluation
experiments/resumeExperiment
experiments/runExperiment
index
logger
prompts
prompts/constants
prompts/createPrompt
prompts/getPrompt
prompts/listPrompts
prompts/sdks
prompts/sdks/constants
prompts/sdks/toAI
prompts/sdks/toAnthropic
prompts/sdks/toOpenAI
prompts/sdks/toSDK
prompts/sdks/types
schemas/jsonLiteralSchema
schemas/jsonSchema
schemas/llm/anthropic/converters
schemas/llm/anthropic/messagePartSchemas
schemas/llm/anthropic/messageSchemas
schemas/llm/anthropic/toolCallSchemas
schemas/llm/anthropic/toolChoiceSchemas
schemas/llm/anthropic/toolSchemas
schemas/llm/constants
schemas/llm/converters
schemas/llm/openai/converters
schemas/llm/openai/messagePartSchemas
schemas/llm/openai/messageSchemas
schemas/llm/openai/responseFormatSchema
schemas/llm/openai/toolCallSchemas
schemas/llm/openai/toolChoiceSchemas
schemas/llm/openai/toolSchemas
schemas/llm/phoenixPrompt/converters
schemas/llm/phoenixPrompt/messagePartSchemas
schemas/llm/phoenixPrompt/messageSchemas
schemas/llm/phoenixPrompt/responseFormatSchema
schemas/llm/phoenixPrompt/toolCallSchemas
schemas/llm/phoenixPrompt/toolChoiceSchemas
schemas/llm/phoenixPrompt/toolSchemas
schemas/llm/schemas
schemas/llm/types
schemas/llm/utils
schemas/llm/vercel/messagePartSchemas
schemas/llm/vercel/messageSchemas
schemas/llm/vercel/toolChoiceSchemas
schemas/llm/vercel/toolSchemas
sessions
sessions/addSessionAnnotation
sessions/logSessionAnnotations
sessions/types
spans
spans/addDocumentAnnotation
spans/addSpanAnnotation
spans/addSpanNote
spans/deleteSpan
spans/getSpanAnnotations
spans/getSpans
spans/logDocumentAnnotations
spans/logSpanAnnotations
spans/types
types/annotations
types/core
types/datasets
types/experiments
types/projects
types/prompts
utils/assertUnreachable
utils/channel
utils/ensureString
utils/formatPromptMessages
utils/getPromptBySelector
utils/isHttpError
utils/isObject
utils/noopLogger
utils/pluralize
utils/promisifyResult
utils/safelyParseJSON
utils/safelyStringifyJSON
utils/schemaMatches
utils/toObjectHeaders
utils/urlUtils