AI Hustle: News on Open AI, ChatGPT, Midjourney, NVIDIA, Anthropic, Open Source LLMs: Kolena Has Secured $15M for AI Model Testing Tools!

Jaeden Schafer & Jamie McCauley 10/10/23 - Episode Page - 10m - PDF Transcript

Transcript
Show Notes

Welcome to the OpenAI podcast, the podcast that opens up the world of AI in a quick and

concise manner.

Tune in daily to hear the latest news and breakthroughs in the rapidly evolving world

of artificial intelligence.

If you've been following the podcast for a while, you'll know that over the last six

months I've been working on a stealth AI startup.

Of the hundreds of projects I've covered, this is the one that I believe has the greatest

potential.

So today I'm excited to announce AIBOX.

AIBOX is a no-code AI app building platform paired with the App Store for AI that lets

you monetize your AI tools.

The platform lets you build apps by linking together AI models like chatGPT, mid-journey

and 11 labs, eventually will integrate with software like Gmail, Trello and Salesforce

so you can use AI to automate every function in your organization.

To get notified when we launch and be one of the first to build on the platform, you

can join the wait list at AIBOX.AI, the link is in the show notes.

We are currently raising a seed round of funding.

If you're an investor that is focused on disruptive tech, I'd love to tell you more

about the platform.

You can reach out to me at jaden at AIBOX.AI, I'll leave that email in the show notes.

So San Francisco based startup Kalina has recently announced a $15 million funding round led

by Lobby Capital with additional investments from SignalFire and Bloomberg Beta.

So this infusion of capital now boosts the company's total funds to $21 million that

they've raised and this is aimed primarily at scaling its research team and also forming

alliances with regulatory bodies and amplifying sales and marketing initiatives.

If you listened to my recent interview that I did yesterday with Bradley from Project

Voice, it was really interesting because I asked him, you know, like what are some things

that you look for in startups that a lot of people are missing?

And he said, one of the biggest things was that they need startups today need to be at

least thinking about regulatory frameworks, what's going to happen in the future.

And he's like, if you're not looking at that, you're really going to be left behind because

right now there's so much regulation coming down the pipe, there's so much changing in

the space.

And he's like, you know, a startup today could become completely irrelevant if the wrong

regulation comes down the pipe.

So, you know, no one can predict exactly what's going to happen in that, but startups need

to be aware and following and, you know, essentially let investors know that this is something

that they're cognizant of because this is incredibly important.

So to me, I find that interesting when, you know, Kalina right here just raised $50 million

and one of the big things that they're doing, right, this is essentially like their company

is set to kind of tackle trust deficit in AI with advanced model testing frameworks.

So one of the biggest things they're putting their $50 million to is forming alliances with

regulatory bodies.

So very, very interesting.

So Kalina was actually founded back in 2021 by Muhammad, Al-Jedi, Andrew Shee, Gordon

Hart and the trio has a lot of extensive experience in AI departments across a bunch of different

industries.

So Amazon, Palantir, Rokton and SNAPs and their venture essentially is looking to address

a really fundamental issue in AI deployment.

So it's the absence of trust among both its developers and the public.

So this is what they said, quote, and this was actually Al-Jedi.

So the use cases for AI are enormous, but AI lacks trust from both builders and the public.

This technology must be rolled out in a way that makes digital experiences better, not

worse.

The genie isn't going back in the model, but as an industry, we can make sure we make

the right wishes.

So I think kind of going beyond some of the, you know, conventional metrics.

I think what really kind of sets Kalina apart is its comprehensive, quote, and quote, model

quality framework.

So this is a tool set designed to provide robust, customizable and enterprise friendly

model testing solutions.

So unlike other platforms that concentrate solely on, you know, component level testing,

Kalina offers an end to end testing of AI and machine learning products.

So Al-Jedi again said, quote, first and foremost, we want to provide a new framework for model

quality, not just a tool that simplifies current approaches.

And he really was kind of emphasizing a couple of things, but I think really with Kalina's

user interface, teams can construct test cases to rigorously assess a model's performance

across multiple criteria.

So it also kind of highlights potential gaps in AI model test data coverage and tracks

associated development risks.

So the platform aims to shift the focus from, you know, blanket metrics like accuracy scores

to more nuanced evaluations.

You know, Al-Jedi elaborated, quote, for example, a model with a 95% accuracy in detecting cars

isn't necessarily better than one with an 89% accuracy.

Each has their own strengths and weaknesses, detecting cars in varying weather conditions

or occlusion levels, spotting a car's orientation, et cetera.

So I think AI engineers reportedly spent around 20% of their time on analyzing and developing

models according to one survey.

And another report indicates that only about 54% of models make the transition from pilot

to full scale production.

And I think this is really kind of revealing a big need for kind of effective model testing

solutions while industry giants, of course, we have like the Amazon, Google, Microsoft

and those kind of companies, they all, you know, they offer similar services.

Other startups like Prolific, Robust Intelligence, Deep Cheat Checks, and Bobbidi are also kind

of entering the arena with a bunch of different kind of innovative approaches that this is

well.

However, Kalina claims to have an edge by offering customers full control over data types, evaluation

logics and some other testing components.

I think it's kind of robust focus on privacy eliminates the need for users to upload their

data or models.

And as Kalina only retains test results, which can be deleted upon request.

So looking ahead, I think while Kalina remains pretty discreet about its current customer

base, some people are saying that's because they don't have a good customer base.

Some people are saying, you know, it's just confidentiality reasons.

But I think the company is adopting a quote unquote selective approach focusing on partnerships

with mission critical enterprises.

Now really, like if I'm going to be honest, what I read between the lines on that is a

selective approach means they don't really have a ton of users.

That's not a bad thing.

That doesn't mean they're not a great company and they're not going to be, you know, successful.

But they probably don't have a massive user base.

Selective approach also could mean that their technology is maybe not robust enough to roll

out to a large larger audience, right?

Maybe they have had bigger companies and a big user base request to use them, but their

technology isn't strong enough to do that yet.

It's not bulletproof and they don't want to open themselves up.

So a selective approach means they can pick one or two clients that have very specific

needs kind of in the space.

They can make sure that they're fulfilling those needs and they're able to kind of like

test it, beta, get rid of bugs and, you know, remove, add the needed features and really

kind of test the waters.

I think it's actually a really good approach.

I have nothing against it, but I mean, I'm just, you know, trying to be transparent with

what I think that means.

In any case, the startup has also plans to introduce team bundles aimed at mid-sized

organizations and budding AI startups in the second quarter of next year.

And they said, quote, minimizing risks from an AI and machine learning system requires

rigorous testing before deployment, yet enterprises don't have strong tooling or processes around

model validation.

Co-Lenna focuses on comprehensive and thorough model evaluations.

We give machine learning managers, product managers and executives unparalleled visibility

into a model's test coverage and product-specific functional requirements, allowing them to

effectively influence product quality from the start.

So I think as AI really continues to kind of permeate every facet of our lives, tools

that can evaluate, validate and benchmark AI models not only become like really indispensable

but they also might kind of dictate the future credibility of the technology, especially specific

models.

So I think with this new funding, Co-Lenna seems well positioned to play a pivotal role

in this evolving kind of landscape and I'm really excited to follow them on their journey

and see where they go.

If you are looking for an innovative and creative community of people using ChatGPT, you need

to join our ChatGPT creators community.

I'll drop a link in the description to this podcast.

We'd love to see you there where we share tips and tricks of what is working in ChatGPT.

It's a lot easier than a podcast as you can see screenshots, you can share and comment

on things that are currently working.

So if this sounds interesting to you, check out the link in the comment.

We'd love to have you in the community.

Thanks for joining me on the OpenAI podcast.

It would mean the world to me if you would rate this podcast wherever you listen to your

podcasts and I'll see you tomorrow.

Machine-generated transcript that may contain inaccuracies.

Tune in to learn how Kolena Has, the trailblazing company, has secured an impressive $15 million in funding to revolutionize AI model testing tools. In this episode, we delve into the innovation driving AI model testing and how this investment is set to reshape the industry. Don't miss this insightful conversation on the future of AI testing technology!

Get on the AI Box Waitlist: https://AIBox.ai/
Join our ChatGPT Community: ⁠https://www.facebook.com/groups/739308654562189/⁠
Follow me on Twitter: ⁠https://twitter.com/jaeden_ai⁠