Screen Recording with Real-time Dual Speaker Subtitles
English · Chinese · Spanish · French · German · Japanese · Korean · Arabic · and 95+ more
Features
Capture system audio and microphone input simultaneously. CloudListen distinguishes "System" and "Me" as two separate speakers for clear conversation records.
Powered by Whisper AI, CloudListen transcribes speech in over 100 languages including English, Mandarin, Spanish, French, German, Japanese, Arabic, and more — with automatic language detection.
Run whisper-server.py locally on your machine. All transcription happens on-device — no audio data leaves your machine, ensuring complete privacy.
Real-time subtitles appear directly in Chrome's side panel. Non-intrusive, always accessible, and synchronized with your recording.
Choose between Deepgram Nova-2 or AssemblyAI Universal for cloud transcription. Both offer real-time streaming with automatic connection handling.
Auto-saves up to 5 recording sessions. Select multiple sessions and export them as Markdown files for later review.
Multilingual
Whisper AI powers CloudListen's local transcription engine. It was trained on 5 million hours of multilingual audio and supports automatic language detection for 100+ languages.
How It Works
FAQ
CloudListen supports real-time transcription in over 100 languages via Whisper AI. Includes all major world languages: English, Chinese (Mandarin and Cantonese), Spanish, French, German, Japanese, Korean, Arabic, Portuguese, Russian, Hindi, and many more. CloudListen automatically detects the spoken language.
Yes. When you run whisper-server.py locally, all transcription happens on your machine. No audio data is sent to external servers. This makes CloudListen suitable for privacy-sensitive environments.
CloudListen captures two audio sources simultaneously: your system audio (e.g., a video call, lecture, or presentation) and your microphone input. Each source is transcribed separately and displayed with distinct speaker labels ("System" and "Me") in the Chrome side panel.
CloudListen supports Deepgram Nova-2 and AssemblyAI Universal for cloud transcription. Both provide real-time streaming with automatic connection handling. For fully offline operation, use local Whisper AI via whisper-server.py.
Setup Guide
Click the CloudListen extension icon. Choose one of two modes:
Download and run the Whisper server for fully offline transcription:
pip install faster-whisper flask
python3 whisper-server.py
Server starts at http://localhost:8180. Extension auto-detects it in Local Whisper mode.
All audio stays on your machine — complete privacy.
Click "Start Recording" in the Chrome side panel. Enable audio sharing in the tab picker. System audio and microphone are captured simultaneously — displayed as "System" and "Me" subtitles in real-time.
Click "Select to Export" in the side panel. Choose sessions and export as Markdown files for later review.