Skip to content

Feature Guide

Owlfy is a next-generation Voice AI Operating System that redefines how humans interact with computers through full voice interaction. Command your computer with natural language and let AI seamlessly integrate into daily operations for 10x efficiency gains.

Product Slogan: Speak. It’s Done.


Owlfy’s core interactions are completed through the voice key:

OperationFunctionUse Case
Click voice keyLaunch “Voice Input”Voice-to-text input
Hold voice keyLaunch task executionSonic Execution (quick tasks) or AI Assistant (complex tasks)

Default Voice Keys:

  • Windows: Right Alt key or Mouse wheel
  • macOS: Fn key or Mouse wheel

You can customize the voice key in Settings.

When it’s inconvenient to speak, use the text command bar:

  • macOS: Option + Space
  • Windows: Ctrl + Space

Type your request and press Enter — equivalent to holding the voice key.


Triggered by holding the voice key and speaking a quick command. Instantly completes computer operations.

Say the name to instantly open apps, websites, or directories.

TypeExampleResult
Application”Open WeChat”Launches WeChat
Website”Open GitHub”Opens github.com
Directory”Open Downloads”Opens Downloads folder

Custom Aliases: Set your own aliases for apps, websites, and directories in Settings.

Say search content and specify a search engine for quick results.

CommandEffect
”Search for iPhone 16 review”Opens default search engine
”Google search climate change”Searches Google
”Bing search AI trends”Searches Bing

Select text, hold the voice key, and speak your need. AI processes instantly.

InputCommandOutput
”Boss, I can’t do this project, it’s too hard.""Rewrite this in a professional tone""I’m encountering some technical roadblocks with this project and would appreciate discussing potential resources or alternative approaches."
"Do it yourself.""Make this sound more tactful""Could you please handle this yourself?"
"Had amazing hotpot today and it totally made my day.""Turn this into an Instagram caption""Hotpot kind of night and absolutely no regrets. So good, so comforting. 🍲✨ #FoodieFinds #HotpotLove”
InputCommandOutput
”Let’s review the design at 10 AM tomorrow. Bob, print 3 copies. Alice, book room 2.""List the action items”• 10:00 AM: Design review
• Bob: Print 3 copies
• Alice: Book Room 2
”Q3 revenue increased 20% year over year thanks to strong cloud growth, but margins dipped slightly because of higher R&D spending.""Make this shorter""Cloud growth lifted Q3 revenue, but higher R&D spending hurt margins.”
InputCommandOutput
const sum = arr.reduce((a, b) => a + b, 0);”Explain this code""This uses the reduce method to iterate over ‘arr’, accumulating the sum of its elements starting from 0.”
const sum = arr.reduce((a, b) => a + b, 0);”Turn this JavaScript into Python”sum_result = sum(arr)
InputCommandOutput
”FOMO""What does this mean?""Fear of missing out — the anxious feeling that other people may be having rewarding experiences without you."
"Despite of the heavy rain, but we still arrived on time for the meeting.""Fix the grammar""Despite the heavy rain, we still arrived on time for the meeting.”
InputCommandOutput
”Bob expensed $150 for taxi, Alice spent $200 on meals, and John paid $80 for taxi.""Turn this into a table”| Name | Type | Amount |
| --- | --- | --- |
| Bob | Taxi | $150 |
| Alice | Meals | $200 |
| John | Taxi | $80 |

Select one or more files, hold the voice key, and speak your requirements. AI quickly processes them.

Image Processing: Compress, convert formats, remove backgrounds, adjust resolution, OCR, flip, rotate, crop, color adjustment, multiple images to GIF/PDF.

Audio/Video Processing: Compress, add watermark, speed up/slow down, smart silence removal, concatenate, format conversion, extract audio, frame extraction, voice recognition.

Document Processing: Merge/split PDF, PDF watermark, PDF to image, Markdown to PDF/Word, draw charts, edit Excel/Word/PPT, AI generate PPT, document to Markdown.


Triggered by clicking the voice key. Not just transcription — an intelligent text input experience.

  1. Click the voice key → Start speaking
  2. Click again to end
  3. Owlfy applies role polishing
  4. Output appears at the current cursor position

Choose from multiple roles to polish your voice input:

RoleEffect
Oral CorrectionConverts spoken language into formal written language
Workplace ProfessionalRewrites casual speech into polished, high-EQ workplace expressions
Custom RoleCreate your own roles for personalized needs

Example:

  • Say: “Free for dinner tomorrow?”
  • Select “Workplace Professional” → Output: “Would you be available for dinner tomorrow evening?”

Specify the output language to achieve real-time translation while recording.

Example:

  • Say in Chinese: “明天一起开会讨论一下这个方案”
  • Set output language to English → Output: “Let’s meet tomorrow to discuss this proposal.”

Map long text to short phrases for quick input.

Example:

  • Set: “123 Main Street, New York, NY 10001, (555) 123-4567” = “Shipping address”
  • Say: “Shipping address” → Full address and contact info is entered instantly

Privacy Guarantee: Snippets are stored locally with no network transmission.


Triggered by holding the voice key and describing a complex task. Owlfy has a powerful AI scheduling engine that completes tasks through planned execution.

Intelligently invokes a vast array of Skills to handle:

  • Images, Audio & Video: Format conversion, compression, cropping, smart editing, material extraction, dubbing, etc.
  • Documents: Writing, summarizing, translating, rewriting, merging, splitting, format conversion, etc.
  • System Operations: Organizing files, cleaning junk, shutdown/lock screen, etc.
  • Web Automation: Searching, extracting web content, downloading resources, automating web operations, etc.

Breaks large tasks into executable plans, then tackles them one by one through code, CLI commands, APIs, MCP, or Skills — delivering the final result.

Example:

  • Task: “Summarize the key points from all PDFs in my Downloads folder and create a Word document”
  • AI Assistant plan:
    1. Scan Downloads folder for PDF files
    2. Extract text from each PDF
    3. Summarize key points
    4. Generate a formatted Word document

For tasks that can be completed locally through programs, scripts, or tools — such as document processing, image processing (non-AIGC), and audio/video processing (non-AIGC) — AI Assistant prioritizes local execution. Files are not uploaded to the cloud, ensuring data security.


Let your computer learn to work “automatically and spontaneously.”

Supports one-time or recurring tasks.

Example Scenarios:

  • Check and summarize important emails every hour
  • Auto shutdown at 21:00 every day
  • Every Monday at 9:00 AM: Summarize last week’s meeting notes and create this week’s to-do list

Break spatial limits. Send tasks via mobile instant messaging apps (such as WhatsApp, Messenger). The Owlfy assistant completes tasks on your controlled computer and feeds results back to your phone in real time.


SettingDescription
Run on system startupStart Owlfy automatically when the computer boots
Mouse wheel as voice keyUse mouse middle button as the voice shortcut
Keyboard voice keySet which key triggers voice functions (Mac default: Fn, Windows default: Right Alt)
Text command bar shortcutSet shortcut for the text command bar
Screenshot shortcutSet shortcut for the screenshot tool
Interface languageSwitch software display language
Appearance themeLight, Dark, or follow system
System permissionsView and enable Accessibility, Screen Recording, Full Disk Access (mainly for macOS)
SettingDescription
Clipboard shortcutSet shortcut to open clipboard history
History countView current clipboard history capacity
SettingDescription
AI Assistant sandbox modeDetermines the scope of operations AI can perform on your computer
AI Assistant WorkspaceIn restricted mode, AI can only read/write files in the specified folder
AI Assistant approval policyWhether AI asks for confirmation before high-risk or restricted operations
Confirm before AI Assistant tasksWhen enabled, tasks sent to AI Assistant require confirmation before execution
SettingDescription
MicrophoneSelect recording device and test microphone
Mute system during recordingAutomatically mute system sound during voice input to avoid interference
Voice bar sound effectsPlay sounds when voice bar appears and disappears
SettingDescription
Login/LogoutManage account login status
Membership infoView membership status, validity, and renewal options
My devicesView devices logged into your account
Edit device nameRename devices for easier identification
Refresh device listRe-fetch latest device status
Remove deviceRemove a specific device from your account
Redemption codeEnter codes to redeem points or membership benefits
SettingDescription
Current versionView version number
Check for updatesManually check for new versions
Install updateInstall when a new version is downloaded
Official homepage / social mediaLinks to official website and social accounts

Owlfy follows the principle of “Local-First, On-Demand, No Excessive Retention.”

Your personal settings are saved locally, not stored long-term on servers.

  • Local content: Aliases, smart snippets, custom roles, etc.
  • More controllable: Content follows the current device, not becoming a platform-side long-term profile.
  • Device change reminder: Re-setup is usually needed when changing computers.

Usage history is kept locally by default.

  • Local-first: Chat history and operation logs are saved only on your device.
  • Easy to trace: Conveniently review past actions and conversations.
  • Reduced leakage: History does not automatically become long-term cloud archives.
  • For real-time recognition only: Voice content is used only for the current recognition and processing session.
  • No long-term retention: Voice data is not kept long-term.
  • Not used for model training: Voice content is never used to train any model.
  • Permissions serve functions: Used for reading selected text, screenshots, voice input, etc.
  • Not unlimited collection: Granting permission does not mean continuous monitoring.
  • On-demand activation: Permissions only take effect when you use the corresponding feature.
  • Local-first storage for personal settings and history
  • Minimal necessary use — data only used for the current function
  • Voice is not retained — only for real-time recognition
  • No participation in training — voice data not used for model training
  • Perceivable permissions — users know what permissions are needed and why
  • Only enable necessary permissions
  • Check content before sharing conversations or screenshots
  • Periodically clean up local records
  • Allow time for re-setup when changing devices with many local configurations

Owlfy’s privacy policy is not “collect as much as possible,” but:

  • Store locally when possible
  • Use only for the current function
  • Voice is for recognition only — not saved, not trained

Owlfy uses a stamina system to manage AI feature usage quotas.

ItemDescription
Stamina UsageUsed for AI-related operations
Daily Stamina50 stamina gifted daily
Stamina AcquisitionEarn extra stamina through referrals, etc.
MethodReward
Daily check-in50 stamina/day
Invite friends1000 stamina/person
Promote OwlfyUp to lifetime membership

ItemDescription
Supported PlatformsWindows, macOS (Intel/Apple Silicon), Linux
Developer ToolsBuilt-in DevTools for debugging
MCP EcosystemSupports installing third-party MCP skills

Find “App Management” in Settings to add custom aliases for apps.

Find “Website Management” in Settings to add URLs and aliases.

Click “Role Management” in the voice input interface to add exclusive roles.

Wait for the next day’s gifted stamina, or earn extra stamina by inviting friends.


  • Current Version: 2.0
  • Release Date: 2026
  • Developer: Silicon Geek Co., Limited