feat: Twitter Spaces Integration #1550

slkzgm · 2024-12-29T02:44:43Z

Relates to:

No specific issue linked. This PR addresses new community-driven features.

Risks

Low. Existing users who relied on Deepgram by default will still see no change unless they explicitly define a new TRANSCRIPTION_PROVIDER. Fallback logic preserves original behavior (Deepgram → OpenAI → Local).

Background

What does this PR do?

Adds an optional TRANSCRIPTION_PROVIDER setting (deepgram, openai, or local) with fallback logic.
- If not set, old behavior remains: Deepgram → OpenAI → Local.
Moves Twitter Spaces plugins from agent-twitter-client into this repo for better flexibility and less friction in plugin development.
Introduces an AI-driven Twitter Spaces flow:
1. Automatic Space launch decisions (random chance, business hours, cooldown intervals).
2. Multi-speaker logic with queue management (maxSpeakers).
3. GPT-based filler/idle messages, optional STT/TTS bridging, local audio recording.
4. Graceful shutdown and cooldown for repeated Spaces.

Flow Recap

Periodic Check
- If no Space is running, possibly launch one by shouldLaunchSpace() (random chance, business hours, cooldown).
- If a Space is running, manageCurrentSpace() handles speaker timeouts, occupancy updates, queue acceptance, etc.
Space Creation
- Generates a SpaceConfig (topics from config or GPT).
- Attaches plugins: audio recording, STT/TTS, idle monitor, etc.
- Hooks into speakerRequest, occupancyUpdate, idleTimeout, etc.
Speaker Logic
- Maintains an activeSpeakers array + a queue if at capacity (maxSpeakers).
- Enforces speakerMaxDurationMs per speaker.
- If a speaker is removed, accept next in queue if available.
Stopping
- stopSpace() finalizes the Space, logs completion, clears states, etc.
- Resumes periodic checks at a slower interval until the next launch is decided.

Configuration

A) `.env` / Environment Variables

# Transcription Provider
TRANSCRIPTION_PROVIDER=         # Default is local (possible values: deepgram, openai, local)
OPENAI_API_KEY=sk-...
DEEPGRAM_API_KEY=...

B) `character.json` → `"twitterSpaces"` Field

{
  // ...
  "twitterSpaces": {
    "maxSpeakers": 2,
    "topics": [
      "Blockchain Trends",
      "AI Innovations"
    ],
    "typicalDurationMinutes": 45,
    "idleKickTimeoutMs": 300000,
    "minIntervalBetweenSpacesMinutes": 60,
    "businessHoursOnly": true,
    "randomChance": 0.3,
    "enableIdleMonitor": true,
    "enableSttTts": true,
    "enableRecording": false,
    "voiceId": "21m00Tcm4TlvDq8ikWAM",
    "sttLanguage": "en",
    "gptModel": "gpt-3.5-turbo",
    "systemPrompt": "You are a helpful AI co-host assistant.",
    "speakerMaxDurationMs": 240000
  }
}

maxSpeakers: number of concurrent speakers allowed.
topics: if none are provided, GPT generates them dynamically.
randomChance: probability for each check cycle to spawn a new Space.
speakerMaxDurationMs: maximum time each speaker can speak before removal.

What kind of change is this?

Features (new Twitter Spaces integration and optional transcription provider).
Improvements (unified plugin development, more config options, fallback logic maintained).

Documentation changes needed?

Yes, minimal. We must mention:

The new TRANSCRIPTION_PROVIDER in .env (optional).
The new twitterSpaces config section in character.json.

Testing

Where should a reviewer start?

Check transcription.service.ts to review the fallback logic.
Check new or relocated Twitter Spaces integration files for the Space lifecycle (launch, speaker management, idle detection, etc.).

Detailed testing steps

Define TRANSCRIPTION_PROVIDER in .env (or leave it empty to keep old fallback).
Provide valid API keys if choosing deepgram or openai.
Define twitterSpaces.randomChance in character json to 1 (for 100% rate of starting a space)
Run the agent; verify that Spaces launch automatically, respect the chosen transcription provider, and handle multi-speaker logic as expected.

No special database migrations are needed. Basic local runs and logs confirm correct functioning.

Future Improvements

More robust decision logic for accepting speakers, switching, and timeouts.
Realtime API plugin for smoother, on-the-fly conversation handling.
Solo Broadcast Mode: launch Spaces focused on a single host monologue with no external speakers.
True VAD (Voice Activity Detection) to detect when a speaker finishes talking, instead of relying on manual mute/unmute cues.
Advanced scheduling triggers (e.g., event-based or calendar-based).
Analytics & insights for post-Space summaries or usage metrics.

github-actions

Hi @slkzgm! Welcome to the ai16z community. Thanks for submitting your first pull request; your efforts are helping us accelerate towards AGI. We'll review it shortly. You are now a ai16z contributor!

odilitime

please add back the documentation

packages/client-twitter/src/environment.ts

…, moving twitter spaces plugins to Eliza repo

lalalune · 2025-01-01T12:13:00Z

Some conflicts that need review, we should prioritize getting this in since it's a pretty big push

slkzgm added 2 commits December 29, 2024 03:29

feat: Twitter spaces integration

91edfee

Adding pnpm-lock.yaml

baaec2d

github-actions bot reviewed Dec 29, 2024

View reviewed changes

odilitime changed the base branch from main to develop December 29, 2024 19:45

odilitime requested changes Dec 29, 2024

View reviewed changes

packages/client-twitter/src/environment.ts Show resolved Hide resolved

slkzgm added 3 commits December 30, 2024 00:03

Adding back comments and documentation

eeeb31c

feat: add optional TRANSCRIPTION_PROVIDER setting with fallback logic…

e479f05

…, moving twitter spaces plugins to Eliza repo

Cleanup on packages/client-twitter/src/plugins/SttTtsPlugin.ts

daa0e13

slkzgm requested a review from odilitime December 30, 2024 20:19

fix: Merge conflicts with develop

0c06d22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Twitter Spaces Integration #1550

feat: Twitter Spaces Integration #1550

slkzgm commented Dec 29, 2024 •

edited

Loading

github-actions bot left a comment

odilitime left a comment

lalalune commented Jan 1, 2025

feat: Twitter Spaces Integration #1550

Are you sure you want to change the base?

feat: Twitter Spaces Integration #1550

Conversation

slkzgm commented Dec 29, 2024 • edited Loading

Relates to:

Risks

Background

What does this PR do?

Flow Recap

Configuration

A) .env / Environment Variables

B) character.json → "twitterSpaces" Field

What kind of change is this?

Documentation changes needed?

Testing

Where should a reviewer start?

Detailed testing steps

Future Improvements

github-actions bot left a comment

Choose a reason for hiding this comment

odilitime left a comment

Choose a reason for hiding this comment

lalalune commented Jan 1, 2025

slkzgm commented Dec 29, 2024 •

edited

Loading

A) `.env` / Environment Variables

B) `character.json` → `"twitterSpaces"` Field