Voice Input
Voice Input
Section titled “Voice Input”Voice Input is one of Owlfy’s core features. Triggered by clicking the voice key, it provides an intelligent text input experience that goes far beyond simple transcription.
Operation Method
Section titled “Operation Method”Click the voice key → Start speaking → Click again to end → Owlfy applies role polishing → Output appears at the cursor position.
Default Voice Keys:
- Windows:
Right Altkey orMouse wheel - macOS:
Fnkey orMouse wheel
You can customize the voice key in Settings.
Input Flow
Section titled “Input Flow”Click voice key → Speak → Click again → Role polishing applied → Text outputThis flow ensures your spoken words are refined before being entered, making voice input suitable for professional and formal contexts.
Role Polishing
Section titled “Role Polishing”Choose from multiple roles to polish your voice input. This transforms casual speech into polished, context-appropriate text.
Built-in Roles
Section titled “Built-in Roles”| Role | Effect | Example |
|---|---|---|
| Oral Correction | Converts spoken language into formal written language | ”You know, I think we should probably…” → “I believe we should consider…” |
| Workplace Professional | Rewrites casual speech into polished, high-EQ workplace expressions | ”Free for dinner tomorrow?” → “Would you be available for dinner tomorrow evening?” |
| Custom Role | Create your own roles for personalized needs | Define any polishing rules you need |
Creating Custom Roles
Section titled “Creating Custom Roles”- Click “Role Management” in the voice input interface
- Click “Add Role”
- Set role name and polishing rules
- Save and use
Multi-language Output
Section titled “Multi-language Output”Specify the output language to achieve real-time translation while recording.
Example:
- Speak in Chinese: “明天一起开会讨论一下这个方案”
- Set output language to English
- Output: “Let’s meet tomorrow to discuss this proposal.”
This is perfect for:
- Cross-language communication
- Real-time interpretation during meetings
- Quickly drafting messages in foreign languages
Smart Snippets
Section titled “Smart Snippets”Map long text to short phrases for quick input.
How it works:
- Set a phrase mapping in Settings, e.g.:
- Phrase: “Shipping address”
- Full text: “123 Main Street, New York, NY 10001, (555) 123-4567”
- During voice input, say: “Shipping address”
- The full text is entered instantly
Common use cases:
- Shipping/billing addresses
- Company boilerplate text
- Frequently used email signatures
- Standard legal disclaimers
- Complex code snippets
Privacy Guarantee: Snippets are stored locally with no network transmission.
Voice Input Settings
Section titled “Voice Input Settings”Microphone
Section titled “Microphone”- Select your recording device
- Test microphone levels
- Ensure clear audio input
Mute System During Recording
Section titled “Mute System During Recording”When enabled, system audio is automatically muted during voice input and restored afterward. This prevents background sounds from interfering with recognition.
Voice Bar Sound Effects
Section titled “Voice Bar Sound Effects”Play subtle sounds when the voice bar appears and disappears, giving you clear auditory feedback on recording state.
Usage Tips
Section titled “Usage Tips”Speak Punctuation
Section titled “Speak Punctuation”You can speak punctuation marks directly:
"Hello comma are you there question mark" → "Hello, are you there?"Number Input
Section titled “Number Input”"One two three" → "123""Twelve thirty" → "12:30"Quick Editing Commands
Section titled “Quick Editing Commands”While recording, you can speak editing commands:
"Delete last sentence""Clear all""New line"Privacy Guarantee
Section titled “Privacy Guarantee”Voice input content is processed in real-time and not retained long-term. Your voice data is:
- Used only for the current recognition session
- Not stored on servers
- Never used for model training
Related Documentation
Section titled “Related Documentation”- Sonic Execution — Learn about Sonic Execution
- AI Assistant — Learn about AI Assistant