ElevenLabs Speech-to-Text技能使用说明

2026-03-28 新闻来源：网淘吧围观:176

电脑广告

手机广告

ElevenLabs 语音转文字

使用 ElevenLabs 的 Scribe v2 模型转录音频文件。支持 90 多种语言，并具备说话人分离功能。

快速开始

# Basic transcription
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3

# With speaker diarization
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --diarize

# Specify language (improves accuracy)
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --lang en

# Full JSON output with timestamps
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --json

选项

标志	描述
`--diarize`	识别不同的说话人
`--lang 代码`	ISO 语言代码（例如：en、pt、es）
`--json`	输出包含词级时间戳的完整 JSON
`--events`	标记音频事件（笑声、音乐等）

支持格式

所有主流音频/视频格式：mp3、m4a、wav、ogg、webm、mp4 等。

API 密钥

设置ELEVENLABS_API_KEY环境变量，或在 clawdbot.json 中配置：

{
  skills: {
    entries: {
      "elevenlabs-stt": {
        apiKey: "sk_..."
      }
    }
  }
}

示例

# Transcribe a WhatsApp voice note
{baseDir}/scripts/transcribe.sh ~/Downloads/voice_note.ogg

# Meeting recording with multiple speakers
{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize --lang en

# Get JSON for processing
{baseDir}/scripts/transcribe.sh podcast.mp3 --json > transcript.json

免责申明

部分文章来自各大搜索引擎，如有侵权，请与我联系删除。

打赏

文章底部电脑广告

手机广告位-内容正文底部

标签

上一篇：Excel weekly dashboards at scale技能使用说明下一篇：Market Research Agent技能使用说明

ElevenLabs Speech-to-Text技能使用说明

ElevenLabs 语音转文字

快速开始

选项

支持格式

API 密钥

示例

相关文章

推荐文章

热门浏览

标签列表