Back

Microsoft AI Launches Three New Foundational Models to Compete in the LLM Market

New Model Portfolio

Microsoft AI, the research laboratory of the technology giant, unveiled three new foundational AI models. The suite includes MAI-Transcribe-1, a speech‑to‑text system; MAI-Voice-1, an audio‑generation engine; and MAI-Image-2, a video‑generation model. All three models are now accessible through Microsoft Foundry, with the transcription and voice models also available in MAI Playground.

Performance and Capabilities

MAI-Transcribe-1 can transcribe speech in 25 different languages and is reported to be 2.5 times faster than Microsoft’s Azure Fast offering. MAI-Voice-1 enables users to produce 60 seconds of audio in a single second and supports the creation of custom voice profiles. MAI-Image-2, initially released on MAI Playground on March 19, adds video‑generation capabilities to Microsoft’s multimodal AI lineup.

Strategic Positioning

The launch signals Microsoft’s continued push to develop its own stack of multimodal AI models and to compete with rival AI labs, even as it remains tied to OpenAI. The models were developed by the MAI Superintelligence team, an AI research group led by Mustafa Suleyman, the CEO of Microsoft AI, which was formed and announced in November 2025. Suleyman emphasized a “Humanist AI” approach that puts humans at the center and focuses on practical communication use cases.

Microsoft positions the new models as cost‑effective alternatives to offerings from Google and OpenAI, aiming to attract developers seeking affordable, high‑performance AI services.

Pricing and Availability

Pricing for the models is positioned to be lower than competing solutions. MAI-Transcribe-1 starts at $0.36 per hour, MAI-Voice-1 begins at $22 per 1 million characters, and MAI-Image-2 is priced at $5 for 1 million tokens of text input and $33 for 1 million tokens of image output.

Despite the independent model release, Microsoft reaffirmed its ongoing partnership with OpenAI, noting that a recent renegotiation of that partnership enables the company to pursue superintelligence research while still collaborating with OpenAI.

Hardware and Ecosystem

Microsoft continues a dual strategy on hardware, producing its own chips while also sourcing components from external vendors, ensuring flexibility in supporting its AI services across its cloud and product ecosystem.

Used: News Factory APP - news discovery and automation - ChatGPT for Business

Source: TechCrunch

Also available in: