On-device transcription
Apple silicon native ASR + diarization. The audio never leaves your laptop.
Private, on-device transcription with an AI that knows what was said, who said it, and exactly when — and cites every line.
macOS 15+ · Apple silicon · Free during beta
No upload. No third party in the room. Nothing leaves your machine until you ask it to.
kiromi notices the call start — Zoom, Meet, Teams, FaceTime — and puts a recording offer in your menu bar. One click and you're capturing.
FluidAudio transcribes locally on Apple silicon at 1.2× realtime. Mic and system audio stay on separate tracks so speakers stay distinct.
Ask anything. Each answer cites the exact transcript line — click to play the original audio. Filter by person, project, or date.
FluidAudio's neural transcription runs natively on Apple silicon. Audio, transcripts, and voice profiles stay on your machine — no upload, no third-party processing, no audio crossing the internet unless you explicitly turn on cloud features.
Ask anything across all your meetings. Each claim links back to the exact transcript timestamp — click and you're playing the original audio. Scope conversations to a meeting, person, or date range.
Tag a speaker once and kiromi extracts a voice embedding stored in your macOS keychain. Future meetings auto-attribute that voice — across Zoom, Meet, Teams, FaceTime, and the phone.
Each feature is small, focused, and works whether the network is up or not.
Apple silicon native ASR + diarization. The audio never leaves your laptop.
Every claim is hyperlinked to the exact moment in the recording.
Tag once, recognized forever. Encrypted in the keychain, deletable in one click.
Spotted automatically and tracked across meetings until they're closed.
Pulls invite metadata to give speakers their right names from the start.
Find any phrase, scope by person or project, jump to the audio.
See where each speaker spoke and what they said — at a glance.
Markdown, JSON, plain text. Your transcripts are your transcripts.
Disable network entirely and detection, recording, transcription, and search keep working.
Voice prints are biometric data, and we treat them that way: encrypted at rest in the macOS keychain, local by default, deletable in one click, with a signed consent receipt for every recording.
A few representative quotes from the private beta. Identifying details have been blurred to keep things polite.
"I stopped pretending to take notes. The cited answers are the unlock — every claim links back to the exact moment in the call."
"We had to verify that nothing left the machine before we could run it. kiromi's network panel and the airplane-mode test sold our security team."
"The voice-profile stuff is uncanny. It correctly attributes my co-founder across calls she joined on three different devices."