Camb.ai Java SDK

Camb AI Website

The official Java SDK for interacting with Camb AI's powerful voice and audio generation APIs. Create expressive speech, unique voices, and rich soundscapes with just a few lines of Java.

✨ Features

Dubbing: Dub your videos into multiple languages with voice cloning!
Expressive Text-to-Speech: Convert text into natural-sounding speech using a wide range of pre-existing voices.
Generative Voices: Create entirely new, unique voices from text prompts and descriptions.
Soundscapes from Text: Generate ambient audio and sound effects from textual descriptions.
Access to voice cloning, translation, and more (refer to full API documentation).

📦 Installation

Gradle

Add the dependency to your build.gradle file:

dependencies {
    implementation 'ai.camb:cambai-java-sdk:0.0.1'
}

Maven

Add the dependency to your pom.xml file:

<dependency>
    <groupId>ai.camb</groupId>
    <artifactId>cambai-java-sdk</artifactId>
    <version>0.0.1</version>
</dependency>

🔑 Authentication & Accessing Clients

To use the Camb AI SDK, you'll need an API key.

import CambApiClient;

CambApiClient client = CambApiClient.builder()
    .apiKey("YOUR_CAMB_API_KEY")
    .build();

Custom Hosting Provider (e.g. Baseten Mars8-Flash)

You can route TTS through a custom hosting provider like Baseten while keeping the same SDK interface. reference_audio can be a public URL or base64-encoded audio file — Baseten caches it for faster inference.

import resources.texttospeech.requests.CreateStreamTtsRequestPayload;
import resources.texttospeech.types.CreateStreamTtsRequestPayloadLanguage;
import java.io.InputStream;

// Initialize the Baseten Mars8-Flash custom hosting provider.
// BASETEN_REFERENCE_AUDIO can be a public URL or base64-encoded audio file.
ITtsProvider ttsProvider = new BasetenProvider(
    System.getenv("BASETEN_API_KEY"),
    System.getenv("BASETEN_URL"),
    System.getenv("BASETEN_REFERENCE_AUDIO"), // reference voice
    "en-us"                                   // reference audio language
);

// Use the provider to generate speech
InputStream audioStream = ttsProvider.tts(CreateStreamTtsRequestPayload.builder()
    .text("Hello from Java via Baseten Mars8-Flash!")
    .language(CreateStreamTtsRequestPayloadLanguage.EN_US)
    .voiceId(1) // Required by the SDK's staged builder; ignored by the Baseten provider
    .build(), null);

🚀 Getting Started: Examples

NOTE: For more examples and full runnable files refer to the examples/ directory.

Supported Models & Sample Rates

Model Name	Sample Rate	Description
mars-pro	48kHz	High-fidelity, professional-grade speech synthesis. Ideal for long-form content and dubbing.
mars-instruct	22.05kHz	Optimized for instruction-following and nuance control.
mars-flash	22.05kHz	Low-latency model optimized for real-time applications and conversational AI.

1. Text-to-Speech (TTS)

Convert text into spoken audio using one of Camb AI's high-quality voices.

import resources.texttospeech.requests.CreateStreamTtsRequestPayload;
import resources.texttospeech.types.CreateStreamTtsRequestPayloadLanguage;
import resources.texttospeech.types.CreateStreamTtsRequestPayloadSpeechModel;
import types.OutputFormat;
import types.StreamTtsOutputConfiguration;
import java.io.InputStream;
import java.io.FileOutputStream;
import java.io.File;

// ... initialize client ...

InputStream audioStream = client.textToSpeech().tts(CreateStreamTtsRequestPayload.builder()
    .text("Hello from Camb AI! This is a test.")
    .voiceId(20303)
    .language(CreateStreamTtsRequestPayloadLanguage.EN_US) 
    .speechModel(CreateStreamTtsRequestPayloadSpeechModel.MARSPRO)
    .outputConfiguration(StreamTtsOutputConfiguration.builder().format(OutputFormat.WAV).build())
    .build());

// Save InputStream to file
File outputFile = new File("tts_output.wav");
try (FileOutputStream outputStream = new FileOutputStream(outputFile)) {
    audioStream.transferTo(outputStream);
}

2. Text-to-Voice (Generative Voice)

Create completely new and unique voices from a textual description.

import resources.texttovoice.requests.CreateTextToVoiceRequestPayload;

var result = client.textToVoice().createTextToVoice(CreateTextToVoiceRequestPayload.builder()
    .text("A smooth, rich baritone voice.")
    .voiceDescription("Ideal for storytelling.")
    .build());

System.out.println("Generated voice sample URLs: " + result);

3. Text-to-Audio (Sound Generation)

Generate sound effects or ambient audio from a descriptive prompt.

import resources.texttoaudio.requests.CreateTextToAudioRequestPayload;
import java.util.Optional;

var response = client.textToAudio().createTextToAudio(CreateTextToAudioRequestPayload.builder()
    .prompt("A gentle breeze rustling through autumn leaves.")
    .duration(10)
    .audioType("sound")
    .build());

String taskId = response.getTaskId().get();
// Poll status and get result using client.textToAudio().getTextToAudioStatus(taskId)

4. End-to-End Dubbing

Dub videos into multiple languages with voice cloning.

import resources.dub.requests.EndToEndDubbingRequestPayload;
import java.util.Collections;

var response = client.dub().endToEndDubbing(EndToEndDubbingRequestPayload.builder()
    .videoUrl("https://www.youtube.com/watch?v=dQw4w9WgXcQ") 
    .sourceLanguage(Languages.EN_US.getValue())
    .targetLanguages(Collections.singletonList(Languages.HI_IN.getValue()))
    .build());

String taskId = response.getTaskId().get();
// Poll status using client.dub().getEndToEndDubbingStatus(taskId)

⚙️ Advanced Usage & Other Features

The Camb AI SDK offers a wide range of capabilities beyond these examples, including:

Voice Cloning
Translated TTS
Audio Dubbing
Transcription
And more!

Please refer to the Official Camb AI API Documentation for a comprehensive list of features and advanced usage patterns.

📖 Examples

Check out the examples/ directory for complete, runnable examples:

examples/BasicTts.java - Basic text-to-speech example
examples/TextToAudioExample.java - Sound generation example
examples/TextToVoiceExample.java - Generative voice example
examples/DubbingExample.java - Video dubbing workflow
examples/BasetenProvider.java - Using custom hosting providers

🔗 Links

License

This project is licensed under the MIT License - see the LICENSE file for details

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github/workflows		.github/workflows
assets		assets
docs		docs
examples		examples
src/main/java		src/main/java
LICENSE		LICENSE
README.md		README.md
build.gradle		build.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Camb.ai Java SDK

Camb AI Website

✨ Features

📦 Installation

Gradle

Maven

🔑 Authentication & Accessing Clients

Custom Hosting Provider (e.g. Baseten Mars8-Flash)

🚀 Getting Started: Examples

Supported Models & Sample Rates

1. Text-to-Speech (TTS)

2. Text-to-Voice (Generative Voice)

3. Text-to-Audio (Sound Generation)

4. End-to-End Dubbing

⚙️ Advanced Usage & Other Features

📖 Examples

🔗 Links

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Camb.ai Java SDK

Camb AI Website

✨ Features

📦 Installation

Gradle

Maven

🔑 Authentication & Accessing Clients

Custom Hosting Provider (e.g. Baseten Mars8-Flash)

🚀 Getting Started: Examples

Supported Models & Sample Rates

1. Text-to-Speech (TTS)

2. Text-to-Voice (Generative Voice)

3. Text-to-Audio (Sound Generation)

4. End-to-End Dubbing

⚙️ Advanced Usage & Other Features

📖 Examples

🔗 Links

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages