Enterprise

Databricks expands Mosaic AI to help enterprises build with LLMs

Comment

Databricks logo on building
Image Credits: Smith Collection/Gado / Getty Images

A year ago, Databricks acquired MosaicML for $1.3 billion. Now rebranded as Mosaic AI, the platform has become integral to Databricks’ AI solutions. Today, at the company’s Data + AI Summit, it is launching a number of new features for the service. Ahead of the announcements, I spoke to Databricks co-founders CEO Ali Ghodsi and CTO Matei Zaharia.

Databricks is launching five new Mosaic AI tools at its conference: Mosaic AI Agent Framework, Mosaic AI Agent Evaluation, Mosaic AI Tools Catalog, Mosaic AI Model Training and Mosaic AI Gateway.

“It’s been an awesome year — huge developments in GenAI. Everybody’s excited about it,” Ghodsi told me. “But the things everybody cares about are still the same three things: How do we make the quality or reliability of these models go up? Number two, how do we make sure that it’s cost-efficient? And there’s a huge variance in cost between models here — a gigantic, orders-of-magnitude difference in price. And third, how do we do that in a way that we keep the privacy of our data?”

Today’s launches aim to cover the majority of these concerns for Databricks’ customers.

Zaharia also noted that the enterprises that are now deploying large language models (LLMs) into production are using systems that have multiple components. That often means they make multiple calls to a model (or maybe multiple models, too), and use a variety of external tools for accessing databases or doing retrieval augmented generation (RAG). These compound systems speed up LLM-based applications, save money by using cheaper models for specific queries or caching results and, maybe most importantly, make the results more trustworthy and relevant by augmenting the foundation models with proprietary data.

“We think that is the future of really high-impact, mission-critical AI applications,” he explained. “Because if you think about it, if you’re doing something really mission critical, you’ll want engineers to be able to control all aspects of it — and you do that with a modular system. So we’re developing a lot of basic research on what’s the best way to create these [systems] for a specific task so developers can easily work with them and hook up all the bits, trace everything through and see what’s happening.”

As for actually building these systems, Databricks is launching two services this week: the Mosaic AI Agent Framework and the Mosaic AI Tools Catalog. The AI Agent Framework takes the company’s serverless vector search functionality, which became generally available last month and provides developers with the tools to build their own RAG-based applications on top of that.

Ghodsi and Zaharia emphasized that the Databricks vector search system uses a hybrid approach, combining classic keyword-based search with embedding search. All of this is integrated deeply with the Databricks data lake and the data on both platforms is always automatically kept in sync. This includes the governance features of the overall Databricks platform — and specifically the Databricks Unity Catalog governance layer — to ensure, for example, that personal information doesn’t leak into the vector search service.

Talking about the Unity Catalog (which the company is now also slowly open sourcing), it’s worth noting that Databricks is now extending this system to let enterprises govern which AI tools and functions these LLMs can call upon when generating answers. This catalog, Databricks says, will also make these services more discoverable across a company.

Ghodsi also highlighted that developers can now take all of these tools to build their own agents by chaining together models and functions using Langchain or LlamaIndex, for example. And indeed, Zaharia tells me that a lot of Databricks customers are already using these tools today.

“There are a lot of companies using these things, even the agent-like workflows. I think people are often surprised by how many there are, but it seems to be the direction things are going. And we’ve also found in our internal AI applications, like the assistant applications for our platform, that this is the way to build them,” he said.

To evaluate these new applications Databricks is also launching the Mosaic AI Agent Evaluation, an AI-assisted evaluation tool that combines LLM-based judges to test how well the AI does in production, but also allows enterprises to quickly get feedback from users (and let them label some initial datasets, too). The Agent Evaluation includes a UI component based on Databricks’ acquisition of Lilac earlier this year, which lets users visualize and search massive text datasets.

“Every customer we have is saying: I do need to do some labeling internally, I’m going to have some employees do it. I just need maybe 100 answers, or maybe 500 answers — and then we can feed that into the LLM judges,” Ghodsi explained.

Another way to improve results is by using fine-tuned models. For this, Databricks now offers the Mosaic AI Model Training service, which — you guessed it — allows its users to fine-tune models with their organization’s private data to help them perform better on specific tasks.

The last new tool is the Mosaic AI Gateway, which the company describes as a “unified interface to query, manage, and deploy any open source or proprietary model.” The idea here is to allow users to query any LLM in a governed way, using a centralized credentials store. No enterprise, after all, wants its engineers to send random data to third-party services.

In times of shrinking budgets, the AI Gateway also allows IT to set rate limits for different vendors to keep costs manageable. Additionally, these enterprises then also get usage tracking and tracing for debugging these systems.

As Ghodsi told me, all of these new features are a reaction to how Databricks’ users are now working with LLMs. “We saw a big shift happen in the market in the last quarter and a half. Beginning of last year, anyone you talk to, they’d say: we’re pro open source, open source is awesome. But when you really pushed people, they were using Open AI. Everybody, no matter what they said, no matter how much they were touting how open source is awesome, behind the scenes, they were using Open AI.” Now, these customers have become far more sophisticated and are using open models (very few are really open source, of course), which in turn requires them to adopt an entirely new set of tools to tackle the problems — and opportunities — that come with that.

More TechCrunch

Fisker is just a few days into its Chapter 11 bankruptcy, and the fight over its assets is already charged, with one lawyer claiming the startup has been liquidating assets…

The fight over Fisker’s assets is already heating up

A hacker is advertising customer data allegedly stolen from the Australia-based live events and ticketing company TEG on a well-known hacking forum. On Thursday, a hacker put up for sale…

Hacker claims to have 30 million customer records from Australian ticket seller giant TEG

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Elon…

Tesla makes Musk best-paid CEO of all time and Fisker bites the dust

Dot is a new AI companion and chatbot that thrives on getting to know your innermost thoughts and feelings.

Dot’s AI really, really wants to get to know you

The e-fuels startup is working on producing fuel for aviation and maritime shipping using carbon dioxide and other waste carbon streams.

E-fuels startup Aether Fuels is raising $34.3 million, per filing

Fisker was facing “potential financial distress” as early as last August, according to a new filing in its Chapter 11 bankruptcy proceeding, which the EV startup initiated earlier this week.…

Fisker faced financial distress as early as last August

Cruise, the self-driving subsidiary of General Motors, has agreed to pay a $112,500 fine for failing to provide full information about an accident involving one of its robotaxis last year.…

Cruise clears key hurdle to getting robotaxis back on roads in California

Feel Therapeutics has a pretty original deck, with some twists we rarely see; the company did a great job telling the overall story.

Pitch Deck Teardown: Feel Therapeutics’ $3.5M seed deck

The Rockset buy fits into OpenAI’s broader recent strategy of investing heavily in its enterprise sales and tech orgs.

OpenAI buys Rockset to bolster its enterprise AI

The U.S. government announced sanctions against 12 executives and senior leaders of the Russia-based cybersecurity giant Kaspersky. In a press release, the Department of the Treasury’s Office of Foreign Assets…

US government sanctions Kaspersky executives

Style DNA, an AI-powered fashion stylist app, creates a personalized style profile from a single selfie. The app is particularly useful for people interested in seasonal color analysis, a process…

Style DNA gets a generative AI chatbot that suggests outfit ideas based on your color type

Rates of depression, anxiety and suicidal thoughts are surging among U.S. teens. A recent report from the Center of Disease Control found that nearly one in three girls have seriously…

Khosla-backed Marble, built by former Headway founders, offers affordable group therapy for teens

Cover says what sets it apart is the underlying technology it employs, which has been exclusively licensed from NASA’s Jet Propulsion Laboratory.

A new startup from Figure’s founder is licensing NASA tech in a bid to curb school shootings

Spotify is introducing a new “Basic” streaming plan in the United States, the company announced on Friday. The new plan costs $10.99 per month and includes all of the benefits…

Spotify launches a new Basic streaming plan in the US

Photographers say the social media giant is applying a ‘Made with AI’ label to photos they took, causing confusion for users.

Meta is tagging real photos as ‘Made with AI,’ say photographers

Website building platform Squarespace is selling Tock, its restaurant reservation service, to American Express in a deal worth $400 million — the exact figure that Squarespace paid for the service…

Squarespace sells restaurant reservation system Tock to American Express for $400M

Featured Article

Change Healthcare confirms ransomware hackers stole medical records on a ‘substantial proportion’ of Americans

The February ransomware attack on UHG-owned Change Healthcare stands as one of the largest-ever known digital thefts of U.S. medical records.

20 hours ago
Change Healthcare confirms ransomware hackers stole medical records on a ‘substantial proportion’ of Americans

Google said today that it globally paused its experiment that aimed to allow new kinds of real-money games on the Play Store, citing the challenges that come with the lack…

Google pauses its experiment to expand real-money games on the Play Store

Venture firms raised $9.3 billion in Q1 according to PitchBook data, which means this year likely won’t match or surpass 2023’s $81.8 billion total. While emerging managers are feeling the…

Kevin Hartz’s A* raises its second oversubscribed fund in three years

Google is making reviews of all your movies, TV shows, books, albums and games visible under one profile page starting June 24, according to an email sent to users last…

Google is making your movie and TV reviews visible under a new profile page

Zepto, an Indian quick commerce startup, has more than doubled its valuation to $3.6 billion in a new funding round of $665 million.

Zepto, a 10-minute delivery app, raises $665M at $3.6B valuation

Speak, the AI-powered language learning app, has raised new money from investors at double its previous valuation.

Language learning app Speak nets $20M, doubles valuation

SpaceX unveiled Starlink Mini, a more portable version of its satellite internet product that is small enough to fit inside a backpack.  Early Starlink customers were invited to purchase the…

SpaceX debuts portable Starlink Mini for $599

Ali Rathod-Papier has stepped down from her role as global head of compliance at corporate card expense management startup Brex to join venture firm Andreessen Horowitz (a16z) as a partner…

Brex’s compliance head has left the fintech startup to join Andreessen Horowitz as a partner

U.S. officials imposed the “first of its kind” ban arguing that Kaspersky threatens U.S. national security because of its links to Russia.

US bans sale of Kaspersky software citing security risk from Russia 

Apple has released Final Cut Pro for iPad 2 and Final Cut Camera, the company announced on Thursday. Both apps were previously announced during the company’s iPad event in May.…

Apple releases Final Cut Pro for iPad 2 and Final Cut Camera

Paris has quickly established itself as a major European center for AI startups, and now another big deal is in the works.

Poolside is raising $400M+ at a $2B valuation to build a supercharged coding co-pilot

The space industry is all abuzz about how SpaceX’s Starship, Blue Origin’s New Glenn, and other heavy-lift rockets will change just about everything. One likely consequence is that spacecraft will…

Gravitics prepares a testing gauntlet for a new generation of giant spacecraft

LTK (formerly LiketoKnow.it and RewardStyle), the influencer shopping app with 40 million monthly users, announced on Thursday the launch of a free direct message tool for creators to instantly share…

Influencer shopping app LTK gets an automatic direct message tool

YouTube appears to be taking a firm stance against Premium subscribers who attempt to use a VPN (virtual private network) to access cheaper subscription prices in other countries. This week,…

YouTube confirms crackdown on VPN users accessing cheaper Premium plans