Mar 28, 2025

Engineering

Cartesia Python SDK v2.0.0

Arjun Desai

We are excited to announce the release of v2.0.0 of our Python SDK, polishing the developer experience when using Cartesia's AI voice capabilities with Python.

Getting started with Cartesia using Python

Install the Cartesia Python SDK in your project with:

pip install cartesia

Initialize the SDK and authenticate:

from cartesia import Cartesia
import os

client = Cartesia(api_key=os.getenv("CARTESIA_API_KEY"))

Now, you can start making requests. For example, to generate audio with Cartesia's text-to-speech model:

client.tts.bytes(
    model_id="sonic-2",
    transcript="Hello, world!",
    voice={
        "mode": "id",
        "id": "694f9389-aac1-45b6-b726-9d9369183238",
    },
    language="en",
    output_format={
        "container": "wav",
        "sample_rate": 44100,
        "encoding": "pcm_f32le",
    },
)

For more examples, check out our API Explorer to generate Python code snippets for any of our APIs.

The Python SDK at a glance

Building upon industry established SDK patterns, v2 of our Python SDK delivers a great development experience structured around a primary Cartesia client, which is the entry point for accessing the various API endpoints.

Even more features

Basic Client - Instantiate and use the client with just 24 lines of code.
Async Client - The SDK exports an async client alongside standard real-time calls, allowing you to make non-blocking API requests.
Streaming - The SDK supports streaming responses and outputs a generator that you can iterate over.
WebSocket - Integrate using WebSockets to build realtime, low-latency voice applications.
Exception Handling - The API gracefully handles non-success status codes (4xx and 5xx responses).
Retries - The SDK is instrumented with automatic retries with exponential backoff.
Timeouts - The SDK defaults to a 60 second timeout. You can configure this with a timeout option at the client or request level.
Custom Client — You can override the httpx client to customize it for your use-case. Some common use-cases include support for proxies and transports.

What’s next?

We can’t wait to see what you build with the Cartesia Python SDK! Your feedback helps us improve—let us know your thoughts on our Discord or by submitting issues on GitHub.

An expressive, studio-quality voice

0:00/1:34

An expressive, studio-quality voice

0:00/1:34

An expressive, studio-quality voice

0:00/1:34

Listen to these demos:

An expressive, studio-quality voice

0:00/1:34

An expressive, studio-quality voice

0:00/1:34

An expressive, studio-quality voice

0:00/1:34

We are excited to announce the release of v2.0.0 of our Python SDK, polishing the developer experience when using Cartesia's AI voice capabilities with Python.

Getting started with Cartesia using Python

Install the Cartesia Python SDK in your project with:

pip install cartesia

Initialize the SDK and authenticate:

from cartesia import Cartesia
import os

client = Cartesia(api_key=os.getenv("CARTESIA_API_KEY"))

Now, you can start making requests. For example, to generate audio with Cartesia's text-to-speech model:

client.tts.bytes(
    model_id="sonic-2",
    transcript="Hello, world!",
    voice={
        "mode": "id",
        "id": "694f9389-aac1-45b6-b726-9d9369183238",
    },
    language="en",
    output_format={
        "container": "wav",
        "sample_rate": 44100,
        "encoding": "pcm_f32le",
    },
)

For more examples, check out our API Explorer to generate Python code snippets for any of our APIs.

The Python SDK at a glance

Even more features

Basic Client - Instantiate and use the client with just 24 lines of code.
Async Client - The SDK exports an async client alongside standard real-time calls, allowing you to make non-blocking API requests.
Streaming - The SDK supports streaming responses and outputs a generator that you can iterate over.
WebSocket - Integrate using WebSockets to build realtime, low-latency voice applications.
Exception Handling - The API gracefully handles non-success status codes (4xx and 5xx responses).
Retries - The SDK is instrumented with automatic retries with exponential backoff.
Timeouts - The SDK defaults to a 60 second timeout. You can configure this with a timeout option at the client or request level.
Custom Client — You can override the httpx client to customize it for your use-case. Some common use-cases include support for proxies and transports.

What’s next?

We can’t wait to see what you build with the Cartesia Python SDK! Your feedback helps us improve—let us know your thoughts on our Discord or by submitting issues on GitHub.

Create Your First PVC

Professional Voice Clones are available through our Playground UI and API, and across 15 languages. Bring your voice data and let’s begin fine-tuning. For more info, check out Docs.

We are excited to announce the release of v2.0.0 of our Python SDK, polishing the developer experience when using Cartesia's AI voice capabilities with Python.

Getting started with Cartesia using Python

Install the Cartesia Python SDK in your project with:

pip install cartesia

Initialize the SDK and authenticate:

from cartesia import Cartesia
import os

client = Cartesia(api_key=os.getenv("CARTESIA_API_KEY"))

Now, you can start making requests. For example, to generate audio with Cartesia's text-to-speech model:

client.tts.bytes(
    model_id="sonic-2",
    transcript="Hello, world!",
    voice={
        "mode": "id",
        "id": "694f9389-aac1-45b6-b726-9d9369183238",
    },
    language="en",
    output_format={
        "container": "wav",
        "sample_rate": 44100,
        "encoding": "pcm_f32le",
    },
)

For more examples, check out our API Explorer to generate Python code snippets for any of our APIs.

The Python SDK at a glance

Even more features

Basic Client - Instantiate and use the client with just 24 lines of code.
Async Client - The SDK exports an async client alongside standard real-time calls, allowing you to make non-blocking API requests.
Streaming - The SDK supports streaming responses and outputs a generator that you can iterate over.
WebSocket - Integrate using WebSockets to build realtime, low-latency voice applications.
Exception Handling - The API gracefully handles non-success status codes (4xx and 5xx responses).
Retries - The SDK is instrumented with automatic retries with exponential backoff.
Timeouts - The SDK defaults to a 60 second timeout. You can configure this with a timeout option at the client or request level.
Custom Client — You can override the httpx client to customize it for your use-case. Some common use-cases include support for proxies and transports.

What’s next?

We can’t wait to see what you build with the Cartesia Python SDK! Your feedback helps us improve—let us know your thoughts on our Discord or by submitting issues on GitHub.

Expressive, studio-quality voices:

0:00/1:34

Ultra-realistic, conversational voices:

0:00/1:34

Perfectly capturing diverse speakers:

0:00/1:34

Create Your First PVC

Professional Voice Clones are available through our Playground UI and API, and across 15 languages. Bring your voice data and let’s begin fine-tuning. For more info, check out Docs.

Create Your First PVC

Professional Voice Clones are available through our Playground UI and API, and across 15 languages. Bring your voice data and let’s begin fine-tuning. For more info, check out Docs.

Listen to these demos:

An expressive, studio-quality voice

0:00/1:34

An ultra-realistic, conversational voice

0:00/1:34

Capturing a localized speaker perfectly

0:00/1:34

An expressive, studio-quality voice

0:00/1:34

An expressive, studio-quality voice

0:00/1:34

Try our Python SDK today

Learn more about the implementation details

Integrate now

May 6, 2025

News

Introducing Professional Voice Cloning

Mar 28, 2025

Engineering

Cartesia Python SDK v2.0.0

Mar 25, 2025

News

Cartesia Named to 7th Annual Enterprise Tech 30 List Presented by Wing Venture Capital

May 6, 2025

News

Introducing Professional Voice Cloning

Mar 28, 2025

Engineering

Cartesia Python SDK v2.0.0

Real-time, multimodal intelligence for every device.

Models

Products

Resources

Company

Legal

Real-time, multimodal intelligence for every device.

Models

Products

Resources

Company

Legal

Real-time, multimodal intelligence for every device.

Models

Products

Resources

Company

Legal

Cartesia will be at Customer Contact Week (CCW) Las Vegas on June 9–12

Stop by our booth and meeting room for live demos of the world’s fastest voice AI—let us know you're coming, and you'll get early access to our demos and our exclusive swag!

Get updates & swag

Join dinner waitlist

Cartesia will be at Customer Contact Week (CCW) Las Vegas on June 9–12

Stop by our booth and meeting room for live demos of the world’s fastest voice AI—let us know you're coming, and you'll get early access to our demos and our exclusive swag!

Get updates & swag

Join dinner waitlist

Models

Resources

Pricing

Get started

Models

Sonic

Deployments

Resources

Docs

Blog

Customers

About

Research

Careers

Pricing

Cartesia Python SDK v2.0.0

Cartesia Python SDK v2.0.0

Getting started with Cartesia using Python

The Python SDK at a glance

Even more features

What’s next?

Listen to these demos:

Getting started with Cartesia using Python

The Python SDK at a glance

Even more features

What’s next?

Create Your First PVC

Getting started with Cartesia using Python

The Python SDK at a glance

Even more features

What’s next?

Create Your First PVC

Create Your First PVC

Listen to these demos:

Try our Python SDK today

Related articles

Related articles