TechCrunch Multiverse Computing, a Spanish startup focused on compressing large language models, has launched the CompactifAI app and a self‑serve API portal. The app lets users run a tiny, locally stored model called Gilda on compatible devices, routing to cloud‑based models when hardware limits are reached. The new API gives developers direct access to the compressed models without using third‑party marketplaces. Multiverse cites privacy, resilience and lower compute costs as key benefits, and already counts the Bank of Canada, Bosch and Iberdrola among its more than 100 global customers. The company, fresh from a $215 million Series B, is reportedly preparing another large funding round.
Read more →