Vibing — Just Speak It!

Powered by VibeVoice
More thanyour intelligent voice input method, but alsoyour intention expression assistant, andyour interface to the AI-native world.

How to Use

Transcription

Press Right ⌥ Option (Mac) or Ctrl + Win (Win) to start / stop recording

Listening...
Recognizing...
Done
Copied to Clipboard
⌘V / Ctrl+V — Paste anywhere
① Press to Start ② Recording ③ Press Again to Stop ④ Recognizing ⑤ Done ⑥ Copied to Clipboard
Mac
~
1
2
3
4
5
6
7
8
9
0
-
=
del
tab
Q
W
E
R
T
Y
U
I
O
P
[
]
\
caps
A
S
D
F
G
H
J
K
L
;
'
return
shift
Z
X
C
V
B
N
M
,
.
/
shift
fn
↑↓
Windows
~
1
2
3
4
5
6
7
8
9
0
-
=
Bksp
Tab
Q
W
E
R
T
Y
U
I
O
P
[
]
\
Caps
A
S
D
F
G
H
J
K
L
;
'
Enter
Shift
Z
X
C
V
B
N
M
,
.
/
Shift
Ctrl
⊞ Win
Alt
Alt
Menu
Ctrl

How to Cancel

Press ESC at any stage to immediately cancel — nothing is transcribed or copied

Listening...
Recognizing...
✕ Cancelled
① Press to Start ② Recording ③ Press ESC to Cancel ④ Cancelled
Mac
esc
~
1
2
3
4
5
6
7
8
9
0
-
=
tab
Q
W
E
R
T
Y
U
I
O
P
[
]
\
caps
A
S
D
F
G
H
J
K
L
;
'
return
shift
Z
X
C
V
B
N
M
,
.
/
shift
fn
↑↓
Windows
esc
~
1
2
3
4
5
6
7
8
9
0
-
=
Tab
Q
W
E
R
T
Y
U
I
O
P
[
]
\
Caps
A
S
D
F
G
H
J
K
L
;
'
Enter
Shift
Z
X
C
V
B
N
M
,
.
/
Shift
Ctrl
⊞ Win
Alt
Alt
Menu
Ctrl
More Usage Methods →

Installation Guide

Step-by-step setup instructions for macOS — accessibility, screen recording & microphone permissions.

Mac Setup Guide →

Video Introduction

Key Features

Long-Form Voice Input

Over 5 minutes of continuous speech in a single recording.

Personalized Hotwords

Custom vocabulary for names, jargon, and domain-specific terms.

Context-Aware Intent Understanding

Understands what you mean, not just what you say.

Multilingual

Speak in any of 50+ languages with automatic detection.

Mixed-Language Input

Switch between languages freely within a single sentence.

LLM-Powered Rewriting

AI rewrites your speech into polished, context-appropriate text.

Translation

Real-time voice translation across languages.

Works Everywhere You Type

Vibe Coding

src
train.py
model.py
dataset.py
configs
config.yaml
# Training Pipeline import torch import torch.nn as nn from torch.utils.data import DataLoader from model import VoiceTransformer device = torch.device("cuda" if torch.cuda.is_available() else "cpu") model = VoiceTransformer(d_model=512, nhead=8) model.to(device) optimizer = torch.optim.AdamW(model.parameters(), lr=3e-4)
Claude Code ~/project
>
Listening...
Recognizing...
Vibing…
Done

Presentation

Q1 Business Review
FY2026 · Quarterly Performance
Revenue grew 23% YoY

Q1 Business Review

FY2026 · Quarterly Performance

Revenue grew 23% year over year

Listening...
Recognizing...
Vibing…
Done

Document

HomeInsertDesignLayoutReferencesReview

Q1 Product Planning — Meeting Notes

Date: March 25, 2026
Attendees: Product, Engineering, Design

1. Core Objectives
This quarter we focus on cross-platform voice input,
ensuring seamless typing across VS Code, Office, and
presentation environments.

2. Key Decisions

Listening...
Recognizing...
Vibing…
Done

Chat & Message

💬
👥
📅
📁
Engineering
general
voice-engine
frontend
releases
# general
A
Alice Chen10:32 AM
新的语音引擎通过了所有集成测试,可以开始 review 了!
B
Bob Kim10:45 AM
太好了!下午我来看 PR,有没有 breaking changes?
Y
YouNow
Type a message...
Recording (Translate)
Recognizing...
Translating…
Done

Privacy

Data processing: To provide more accurate transcription, context-aware rewriting, and translation results, Vibing sends your audio and contextual information (such as screenshots, text in the active input field, and the current application name) to our servers. This data is used solely to process your request and return results. It is not retained after processing is complete.

Privacy commitment: Your data is never stored or used for model training, analytics, or any other purpose beyond fulfilling your immediate request.