Try free →
How-to

Transcribe Zoom recording cheap (skip the $20/mo subscription)

May 5, 2026 · 4 min read

Zoom AI Companion is bundled with paid Zoom plans (Pro and above) and gives you a transcript automatically. Otter integrates with Zoom and gives you live captions plus a transcript. Both work — but they assume you record meetings constantly. If you record one Zoom every two weeks, you're paying $20-30/mo for an empty seat.

Here's the cheap path.

Cost comparison for one 1-hour Zoom

PathCostNotes
Zoom Pro (yearly)$15/mo + Zoom planNeed Pro+ to use AI Companion at all
Otter Pro$20/moAuto-joins Zoom meetings, live captions
Read.ai / Fireflies / Avoma$15-30/moMeeting bot joins, transcribes after
Rev (manual upload)$15$0.25/min × 60
LessRec (manual upload)$3$0.05/min × 60

The 4-step workflow

1. Record the Zoom locally

In Zoom: Settings → Recording → Record to this computer. The host gets the option to start recording mid-call. After the meeting, Zoom drops three files into your local Recordings folder: video_*.mp4, audio_only.m4a, and a chat.txt.

You only need the audio_only.m4a file. It's about 30-50 MB per hour at default Zoom audio quality (44 kHz mono).

2. (Optional) compress further if file is huge

Zoom recordings are sometimes saved at high bitrate. To shrink:

ffmpeg -i audio_only.m4a -b:a 64k -ac 1 zoom-meeting.mp3

1-hour Zoom audio drops to ~28 MB. No accuracy loss for Whisper.

3. Upload to LessRec

Open lessrec.com, drag-drop the file. First 10 minutes free without signup. For a 1-hour file, buy a $5 pack (100 minutes, no expiry) — leaves you with 40 minutes credit for the next Zoom.

4. Wait 1-2 minutes, download

Whisper large-v3 INT8 transcribes 1 hour of clean Zoom audio in ~50-90 seconds. You get .txt, .docx, and .srt. Open the .docx in Google Docs or Word, drop a few section headers, and you have a meeting note.

What about live captions during the call?

If you need real-time captions while the meeting is happening — LessRec doesn't do that. Use Zoom's built-in live transcription (free for hosts on paid plans) or Otter's live caption feature.

If you only need the transcript after the meeting (the more common case), the upload-and-wait flow is the right tool.

Speaker labels (who said what)

Zoom AI Companion attributes lines to participant names because it knows who's on the call. LessRec doesn't have that context — it transcribes the audio file alone, so all speakers get merged into one text stream.

Workaround: skim the transcript, add speaker names manually based on the audio (5-10 minutes for a 1-hour meeting). Not ideal, but if you only do this occasionally it's faster than paying $20/mo for the rest of the year. Speaker diarization (auto-labeling) is on the LessRec roadmap for Q3 2026.

Privacy: where does the audio go?

Your file lives on a Hetzner server in Helsinki (EU GDPR jurisdiction) for 7 days, then deletes automatically. We don't train on your data, ever. We don't sell it, don't share it, don't index it.

If you're transcribing sensitive meetings (HR conversations, legal calls, founder discussions) and need stricter compliance — email hello@lessrec.com for an EU-only enterprise tier with shorter retention.

When Zoom AI Companion is better

When LessRec is better

Try with 10 free minutes

Drop your Zoom audio file, get text in under 90 seconds.

Upload now →

FAQ

Can I just upload the .mp4 video?

Yes — LessRec auto-extracts audio from .mp4, .mov, .webm. But uploading the audio-only .m4a is faster (smaller file, faster upload).

What if my Zoom recording is on Zoom Cloud, not local?

Open Zoom web, go to Recordings, download the audio file, then upload to LessRec. Zoom Cloud doesn't expose a public URL we can pull from directly.

What if the meeting was in Spanish/Russian/Chinese/etc?

Whisper handles 100+ languages. Just upload — it auto-detects. For low-resource languages (Tamil, Bengali) try the 10 free minutes first to verify accuracy.

Multiple speakers all in one audio track — does Whisper handle it?

Yes, but no speaker labels (yet). All voices merge into one transcript. For a 2-3 person Zoom meeting it's usually obvious from context who's speaking when.