All Tips
Tool Update2026-03-25
Microsoft builds own AI models: MAI-Transcribe beats Whisper
Microsoft's new MAI team (under Mustafa Suleyman, ex-DeepMind) has introduced three models:
MAI-Transcribe-1 — Speech to text in 25 languages, faster and more accurate than OpenAI's Whisper. Ideal for meeting minutes.
MAI-Voice-1 — Text to speech, generates 60 seconds of audio in one second. Natural-sounding voices.
MAI-Image-2 — Image generation as an alternative to DALL-E.
What this means: Microsoft is distancing itself from pure OpenAI dependency. For users this means more choice and potentially lower prices on Azure.
Tool: Microsoft MAI Models
ToolsMicrosoftSprache
Share: