Skip to content
Glossary

Mallo glossary

Clear definitions for Mac dictation, voice typing, speech recognition, cleanup passes, and the workflow language around using Mallo well.

Start here

Core terms for the Mallo workflow

Mac Dictation2 min read

Dictation

Dictation is a speech-to-text workflow where spoken words are converted into written text.

Read term
Speech Recognition2 min read

Local-First Speech Recognition

Local-first speech recognition keeps audio processing on your device by default instead of sending every utterance to a remote server.

Read term
Speech Recognition2 min read

Speech Recognition

Speech recognition is the system that turns spoken audio into text tokens a computer can work with.

Read term
Mac Dictation2 min read

Voice Typing

Voice typing means speaking instead of pressing keys so spoken words become typed text inside an app.

Read term

All terms

Vocabulary for setup, typing, and cleanup

12 published terms

C

Voice Workflow2 min read

Cursor Insertion

Cursor insertion means generated text lands directly at the active caret position inside the app you are already using.

Read term

D

Mac Dictation2 min read

Dictation

Dictation is a speech-to-text workflow where spoken words are converted into written text.

Read term
Text Cleanup2 min read

Dictionary Replacement

Dictionary replacement is a rule-based text cleanup step that swaps known terms into the forms you want after speech is recognized.

Read term

H

Voice Workflow1 min read

Hold-to-Talk

Hold-to-Talk means dictation runs only while you keep a shortcut pressed, giving you tight start-and-stop control.

Read term

L

Speech Recognition2 min read

Local-First Speech Recognition

Local-first speech recognition keeps audio processing on your device by default instead of sending every utterance to a remote server.

Read term

M

Mac Dictation2 min read

Multilingual Dictation

Multilingual dictation means a speech-to-text workflow can handle more than one language in real writing use.

Read term

S

Speech Recognition2 min read

Speech Recognition

Speech recognition is the system that turns spoken audio into text tokens a computer can work with.

Read term
Speech Recognition1 min read

Speech Model

A speech model is the engine that predicts text from audio and largely determines speed, language fit, and accuracy tradeoffs.

Read term
Speech Recognition1 min read

Speech-to-Text

Speech-to-text is the process of converting spoken audio into written text.

Read term

T

Voice Workflow1 min read

Toggle Dictation

Toggle dictation starts with one shortcut press and keeps listening until the user stops it with another action.

Read term

V

Mac Dictation2 min read

Voice Typing

Voice typing means speaking instead of pressing keys so spoken words become typed text inside an app.

Read term

W

Speech Recognition1 min read

whisper.cpp

whisper.cpp is an on-device inference runtime used to run Whisper-family speech models locally.

Read term