Aravind Srinivas is battling Google to get his Perplexity AI assistant preinstalled on Android phones. At the same time, the CEO is refocusing his startup on what he predicts will be the next battleground in the AI race: your web browser.
Perplexity plans to release its own browser called Comet next month, Srinivas tells me. “The reason we're doing the browser is that it might be the best way to build agents,” he says. “A browser is essentially a containerized operating system. It can let you access other third-party services through hidden tabs if you're already logged into them, scrape the page on the client side, and perform reasoning and take actions on your behalf.”
Other AI firms are already going in this direction. OpenAI’s Operator and Google’s Mariner both rely on the browser to execute commands and control websites. OpenAI has yet to release its own browser but is rumored to be developing one. Google, meanwhile, may be compelled by the US government to sell Chrome following its ruling that the company has a monopoly in the search market.
One of Srinivas’s deputies testified that Perplexity would like to run Chrome if it were spun out from Google, while OpenAI has also thrown its hat into the ring. (Don’t count out Yahoo, too, I guess?)
While the fate of Chrome remains unknown, antitrust scrutiny on Google has already created an opportunity for Perplexity to enter into distribution deals with Android phone manufacturers. This week, Motorola announced that Perplexity would be pre-installed on its new Razr phones, giving Srinivas’s self-described AI “answer engine” access to potentially millions more customers. He says it’s not as deep an integration as either he or Motorola wanted, but for a smaller startup like Perplexity, he still sees it as a victory.
“If Google had not gone through the DOJ trial, we wouldn't have been able to make this partnership happen,” he says. “They would have bullied a lot of the OEMs. I have had conversations with telcos where they would not even listen to us or take meetings with us because of the fear that, if Mountain View becomes aware, their revenue share could be reduced.”
When I last spoke with Srinivas just over a year ago, Perplexity had about 1 million users and had raised less than $100 million. Now, the startup has nearly 30 million monthly active users and has raised hundreds of millions of dollars. Srinivas says Perplexity is currently serving about 600 million queries a month, which is roughly 14-percent of Google’s query volume.
The following conversation with Srinivas took place the day before his announcement with Motorola. We covered the other kinds of partnerships he’s exploring to broaden Perplexity’s reach, why he’s betting on owning the browser interface, how he managed to build an iOS assistant that controls other apps, his conversations about running TikTok, and more.
The following conversation has been edited for length and clarity:
Walk me through how the Motorola partnership came about and the challenges you faced with Google.
Conversations accelerated when we showed them a demo of the Perplexity Android assistant, which launched in January. They tried it out and it was working pretty reliably — way better than Gemini. They got excited about preloading the app and push-notifying users to make Preplexity the default assistant. Google stopped them by saying they cannot go ahead with the launch of the phone using the Play Store and the official version of Android if they do not have Gemini as the default system.
If Google had not gone through the DOJ trial, we wouldn't have been able to make this partnership happen. They would have bullied a lot of the OEMs. I have had conversations with telcos where they would not even listen to us or take meetings with us because of the fear that, if Mountain View becomes aware, their revenue share could be reduced.
It takes seven or eight clicks to change the default. Google still has a strong hold on the Android ecosystem.
Samsung has invested in you. It would make sense for that to lead to some kind of partnership, like the one you announced with Motorola, right?
Yeah. I hope we can find a way to work with them. I don't know who will get the default, or if it will be an onboarding step. All of this is up for debate.
It seems like you're very focused on distribution and partnerships for growing Perplexity.
We want to work with anyone. We've already been working with telcos. We want to expand to OEMs. Next will be a browser, and we'll have versions of it for Mac and Windows. We'll try to start working with OEMs there, too.
Similar to how Google has all its relationships with OEMs on Android, Microsoft has even worse contracts with OEMs on laptops. So we need to fight that uphill battle there, too. We have to be clever and fight. It would be very hard to find people who will objectively say that Copilot is a better product than Perplexity, but Copilot is the only AI that gets natively loaded on Windows.
You just released your assistant on iOS, and people seem surprised at what it can do. Did Apple give you special permissions to control other apps?
They did not give us permission. You cannot use our system to set an alarm, enable low power mode, adjust the brightness or volume, or turn the flashlight on and off. You cannot make a phone call or send an iMessage.
We decided to use the Apple EventKit SDK because it exposes Reminders, Podcasts, Apple Music, Apple Maps, and some other Apple apps. We are able to call that [SDK] and use our own search infrastructure and deep linking to apps like YouTube and Uber.
Everybody says Siri doesn't work, but Siri does work for just setting up alarms and making phone calls, right? Where Siri doesn't work is finding the right song, finding podcasts and YouTube videos, setting smart reminders, and hailing Uber rides. I think we nailed all those use cases.
Why are you doing a browser? And when is it coming?
The reason we're doing the browser is that it might be the best way to build agents. On both iOS and Android, we don't have OS level control. You cannot easily call apps and access their information. You can deep link to them, but for example, with Uber, I cannot go and check prices of different Uber rides and provide you Comfort if there's not much of a price difference. I cannot compare prices between Uber and Lyft to get the best ride. I cannot compare the wait times between Uber Eats and DoorDash to get whatever is optimal.
So, we need to build an OS-level agent, and a browser is essentially a containerized operating system. It can let you access other third-party services through hidden tabs if you're already logged into them, scrape the page on the client side, and perform reasoning and take actions on your behalf. That's the architecture that appeals to us.
Answering questions is going to be a commodity. We need to build our next set of advantages in performing actions. That's why we're building a browser. The browser is the best place to take action for people. We want to move to a different front-end.
Many publishers have been upset with you for scraping their content. You’ve started cutting some of them checks. Do you feel like you're in a good place with publishers now, or do you feel there's still more work to be done?
I'm sure there's more work to be done, but it's in a way better place than it was last time we spoke. We are scraping but respecting robots.txt. We only use third-party data providers for anything that doesn't allow us to scrape.
You are reportedly raising hundreds of millions of dollars at a $18 billion valuation. How are you going to use that money?
To build agents reliably, you need to use the frontier reasoning models. Whatever is expensive today will get really cheap one year from now, but we cannot wait till then. We need to roll this out to as many users as possible to collect all the data, distill it into smaller models, and reduce the cost.
What’s the status of your bid for TikTok? Have you spoken to the White House recently? There were questions about how you would fund it.
I haven't given up on it, but I would say it's not like I had the best shot. I think everybody knew that. I don't think that [funding] is the issue. There were enough backers who wanted to back me.
What we heard from the ByteDance people was not a funding-related issue, either. It's more the willingness to keep controlling the algorithm. I think they want to retain ownership and control of it, and they believe nobody else can do it as well as they can. The app that runs in America and Europe is also heavily tied together. It's very difficult to decouple that. Tariffs are going to control everything, including TikTok.
Do you worry about the scale of ChatGPT and it being good enough for a lot of people who now won’t try Perplexity? ChatGPT is also creating user lock-in by remembering things and becoming more personalized.
I think their strategy, at least based on what Sam Altman said in the Ben Thompson interview, is to put a “Login with ChatGPT” button on third-party apps and then use that to ingest all the data into ChatGPT. But that requires convincing all the third-party apps to put a “Login with ChatGPT” option.
Our strategy is to allow people to stay logged in where they are. We're going to build a browser, and that's how we’ll access apps on behalf of the user on the client side.
I think memory will be won by the company that has the most context. ChatGPT knows nothing about what you buy on Instagram or Amazon. It also knows nothing about how much time you spend on different websites. You need to have all this data to deeply personalize for the user. It's not about who rolls out memory based on the retrieval of past queries. That's very simple to replicate.
What is hard is importing your transactions, your commerce, your history, and all the stuff on your browser, into your assistant in a cross-platform way. That's why we need to not just build a browser on the web but also on mobile, and share the cookies across all the apps. That's the challenge.
It sounds like you see the browser is the final frontier for what you’re building.
There's more beyond that, which is to build Windows, Mac, Android, or iOS. A browser is very restricted and containerized. The OS is the ultimate game.