Funciones de VexaScribe
Transcripción con IA en 99 idiomas. Detección de hablantes, marcas de tiempo, resúmenes IA y traducción integrada (133 idiomas). Sube archivos o envía un bot a tus reuniones. Desde 2 $/mes.
What VexaScribe is, in 80 words
VexaScribe is a web app that turns audio and video into searchable, timestamped, speaker-labeled transcripts using OpenAI Whisper. Drop a file (up to 5 GB) or send a bot to your Zoom, Google Meet, or Teams meeting. Get a transcript in 99 languages in ~5–10 minutes per hour of audio, optional AI summary with action items, and exports to TXT, DOCX, SRT, VTT, or JSON. 30 minutes free, then $2–$20/month. No credit card to start.
What VexaScribe doesn't do
Five things VexaScribe is genuinely not built for, with the tool we'd actually recommend in each case. If your use case is on this list, save yourself the trial signup.
No real-time live captioning
Transcripts are generated after upload, not as you speak. A 1-hour file takes 5–10 minutes to process — fine for meetings you watch back, wrong for live events.
Use instead: Otter Live, Google Meet's built-in captions, or Web Captioner for free browser-based live captions.
No public REST API
VexaScribe is a web app for humans, not a backend service. There's no developer API, no SDK, no webhook for programmatic uploads.
Use instead: OpenAI Whisper API ($0.006/min), Deepgram Nova-3 (~$0.0043/min), or AssemblyAI (~$0.012/min).
No video editing
You can export SRT/VTT subtitles to drop into your editor, but VexaScribe won't cut clips, remove filler words, or burn captions onto video.
Use instead: Descript or Vrew for transcript-based video editing; Premiere/Final Cut/DaVinci for traditional NLE workflows.
No custom vocabulary tuning
You can't upload a dictionary of brand names, drug names, or technical jargon to bias the model toward. Whisper is used as-is, with no per-account fine-tuning.
Use instead: AssemblyAI's “word boost” or Deepgram's “keywords” param for proper-noun-heavy domains.
No on-premise / enterprise self-hosting
Audio is processed in our cloud — there's no air-gapped or HIPAA-BAA-signed deployment available. For attorney-client, clinical therapy, or classified content where a breach creates direct legal liability, no cloud tool (ours included) is the right call.
Use instead: install OpenAI Whisper locally (free, runs on your machine, audio never leaves), or for legal-grade 100% accuracy use human transcription (Rev, GoTranscript) at $1.25–$1.99/min.
Honest accuracy — what the numbers really mean
VexaScribe uses OpenAI Whisper (specifically large-v3 class models). Marketing pages love to say “99% accuracy” — that's not honest. Real-world Whisper accuracy depends heavily on audio quality, accent, and number of speakers. Here's what to expect.
Transcription accuracy (Whisper)
- Clean studio English, single speaker~92–97%
- Accented English (non-native, regional)~85–92%
- Noisy environments (cafes, phone, outdoor)~80–90%
- Clean Spanish, French, German, Italian, Portuguese, Dutch~88–94%
- Korean, Japanese, Indonesian, Turkish, Arabic, Polish~85–92%
Source: Open ASR Leaderboard + Whisper paper benchmarks (LibriSpeech, FLEURS, Common Voice).
Speaker diarization accuracy
- 2 speakers, no overlap95%+
- 3–4 speakers, occasional overlap~88–94%
- 5–6 speakers, meeting dynamics~80–90%
- 7–15 speakers, panel or focus group~70–82%
- Up to 50 speakers (max supported)variable
Best accuracy with 2–6 distinct speakers. You can rename Speaker 1/2/3 in the editor after.
What moves the needle
Three things that matter more than picking the “best” transcription tool:
- A decent mic (USB headset or lapel beats laptop built-in by 5–15 accuracy points).
- One speaker at a time — overlap kills both transcription and diarization.
- Low background noise. Record in a closed room, not next to a fan or HVAC vent.
If you need legal-grade 100% accuracy (court filings, regulated research), use human transcription services like Rev or GoTranscript at $1.25–$1.99/min. AI gets you to ~95% at 1–2% the cost — fine for most use cases, wrong for some.
Funciones principales
99 idiomas compatibles
Transcribe audio y vídeo en 99 idiomas con detección automática de idioma.
Detección de hablantes
La diarización automática identifica y etiqueta diferentes voces. Ideal para entrevistas y reuniones.
Marcas de tiempo
Cada transcripción incluye marcas de tiempo precisas. Haz clic para saltar a ese momento.
5 formatos de exportación
Exporta como TXT, DOCX, SRT, VTT o JSON. Elige el formato que mejor se adapte a tu flujo de trabajo.
Procesamiento rápido
La IA transcribe en minutos. Una grabación de 1 hora se procesa en unos 5–10 minutos.
Editor integrado
Revisa y edita tus transcripciones directamente en el navegador. Corrige errores y renombra hablantes.
Bot de reunión
Envía un bot IA a Zoom, Meet o Teams. Graba, transcribe y genera resúmenes estructurados. Usa 3× créditos.
Resúmenes IA
Convierte cualquier transcripción en puntos clave, tareas y decisiones. Incluido en todos los planes de pago.
Traducción de transcripciones
Traduce cualquier transcripción a 133 idiomas con Google Translate — sin coste adicional.
Bulk Upload — 50 Files at Once
Upload up to 50 audio or video files in one go. All processed in parallel — not one at a time. Mix formats freely and download everything as a ZIP.
Formatos compatibles
Formatos de audio
Formatos de vídeo
Formatos de exportación (5)
Texto plano
Documento Word
Subtítulos
Subtítulos web
Datos estructurados
Casos de uso
Transcripción de reuniones
Bot IA en Zoom, Meet o Teams
Transcripción de podcasts
Convierte episodios en notas y entradas de blog
Transcripción de entrevistas
Transcribe con detección de hablantes
Transcripción de clases
Convierte grabaciones en apuntes de estudio
Vídeo a texto
Extrae transcripciones y crea subtítulos
MP3 a texto
Convierte archivos de audio en documentos de texto
Transcripción de audio
Conversión general de audio a texto
Impulsado por IA avanzada
VexaScribe utiliza modelos de reconocimiento de voz de última generación entrenados con millones de horas de audio.
Precisión en audio claro
Idiomas compatibles
Tiempo de procesamiento por hora
Disponibilidad de funciones por plan
Todos los planes incluyen prueba gratuita. No se necesita tarjeta de crédito.
| Función | Prueba gratuita | Starter (2 $/mes) | Pro (10 $/mes) |
|---|---|---|---|
| Transcripción de audio y vídeo | ✓ | ✓ | ✓ |
| 99 idiomas | ✓ | ✓ | ✓ |
| Detección de hablantes | ✓ | ✓ | ✓ |
| Marcas de tiempo | ✓ | ✓ | ✓ |
| Exportar: TXT, DOCX, SRT, VTT, JSON | ✓ | ✓ | ✓ |
| Traducción de transcripciones (133 idiomas) | ✓ | ✓ | ✓ |
| Editor integrado | ✓ | ✓ | ✓ |
| Resúmenes IA | — | ✓ | ✓ |
| Bot de reunión (Zoom, Meet, Teams) | — | ✓ | ✓ |
| Transcripción masiva | ✓ | ✓ | ✓ |
Preguntas frecuentes
¿Listo para transcribir?
Prueba VexaScribe gratis con 30 minutos de transcripción. Sin tarjeta de crédito.