Nicholas

I gave Clawdbot (aka Moltbot) access to my computer, calendar, and emails: Here’s what happened

Nicholas

In this episode, I take you through my unfiltered experience with Clawdbot,** **the viral open-source AI agent that’s been taking over tech Twitter. (In the time since this was recorded, the tool was renamed Moltbot, but we’re calling it Clawdbot here to match the episode.) It’s an autonomous AI that can run code, spin up sub-agents, join video calls, and take real actions on your machine. I invite it onto the podcast, give it screen access, and walk through what it’s like to go from zero to one with an agentic AI that actually does things. Along the way, I share the real experience: installation headaches, dependency chaos, security warnings you shouldn’t ignore, and the very real tension of giving an AI access to your messaging apps, files, and accounts. I also break down how I thought about permissions, identity, model choice, and cost while testing Clawdbot as a personal assistant. — What you’ll learn: - How to install and set up Clawdbot (and why it’s not as simple as the “one-liner” suggests) - The security implications of giving an autonomous AI access to your computer and accounts - How to safely limit Clawdbot’s permissions while still making it useful - Why Clawdbot struggles with basic time concepts but excels at research tasks - The future of AI assistants—and who might build the consumer-friendly version - How to use voice messaging with AI agents for on-the-go productivity - Why latency is one of the biggest challenges for autonomous AI assistants — Brought to you by: Lovable—Build apps by simply chatting with AI — Detailed workflow walkthroughs from this episode: • How I AI: My 24 Hours with Clawdbot (aka Moltbot)—3 Workflows for a Powerful (and Terrifying) AI Agent: https://www.chatprd.ai/how-i-ai/24-hours-with-clawdbot-moltbot-3-workflows-for-ai-agent • How to Securely Set Up and Configure an Open-Source AI Agent like Clawdbot: https://www.chatprd.ai/how-i-ai/workflows/how-to-securely-set-up-and-configure-an-open-source-ai-agent-like-clawdbot • How to Safely Delegate Calendar Scheduling to an AI Agent: https://www.chatprd.ai/how-i-ai/workflows/how-to-safely-delegate-calendar-scheduling-to-an-ai-agent • Automate Market Research on Reddit Using an AI Agent: https://www.chatprd.ai/how-i-ai/workflows/automate-market-research-on-reddit-using-an-ai-agentIn this episode, we cover: (00:00) Introduction and getting Clawdbot to join the podcast (02:07) What Clawdbot is and how it works (03:50) Installation process and hardware requirements (07:26) Security considerations and creating separate accounts (08:03) Setting up Telegram integration (10:02) Use case: Clawdbot as an EA (13:08) Configuring the AI agent (14:31) Granting Google Calendar access (18:03) Testing Clawdbot as a personal assistant (23:16) Speed frustrations (23:54) Email mishaps and impersonation issues (26:33) Why prompting matters more than ever with autonomous agents (27:32) Quick recap and family calendar management gone wrong (32:11) Using voice messaging with Clawdbot (36:14) Product thoughts (37:06) Building a Next.js app to show chat history (42:29) Research capabilities and Reddit analysis (46:10) Final thoughts on security concerns (48:00) The future of AI assistants and who will build them — Tools referenced: • Moltbot (formerly Clawdbot): https://www.molt.bot/ • Telegram: https://telegram.org/ • Vercel: https://vercel.com/ • Devin: https://www.devin.ai/Other references: • 1Password: https://1password.com/ • Next.js: https://nextjs.org/ • Google Workspace: https://workspace.google.com/ • Claude Sonnet 4.5: https://www.anthropic.com/news/claude-sonnet-4-5 • OAuth: https://oauth.net/Where to find Claire Vo: ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email [redacted email].

Published
Published Jan 28, 2026
Uploaded
Uploaded Jun 12, 2026
File type
Podcast
Queried
0

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:31

[00:00] All right, we're going to start this episode by actually inviting Claudebot to the podcast via Telegram. Let's see how it goes. Hey, Polly, can you please join my Riverside FM podcast? All right, I sent the voice message and it's not getting it. This is the most stressful thing I've ever done. [00:19] Hello. Oh, it's doing it. Finally listened. Okay. It is opening Riverside on Chrome. This is horrifying in every way. I'm going to allow it permissions for my microphone and my camera, which also makes me extremely nervous. [00:49] This is my entire experience using this product. Just, will it work? Will it work? Okay, it is opening up. [00:57] Chrome for the fifth time? This is very scary. [01:02] I see myself right now. I don't know if you all see me yet. And there we go. We are sharing an autonomous AI's full screen. No big deal. [01:18] This episode is brought to you by Lovable. If you've ever had an idea for an app but didn't know where to start, Lovable is for you. Lovable lets you build working apps and websites by simply chatting with AI.

1:32-3:09

[01:32] add automations and deploy it to a live domain. It's perfect for marketers spinning up tools, product managers prototyping new ideas, or founders launching their next business. Unlike no-code tools, Lovable isn't about static pages. It builds full apps with real functionality, and it's fast. What used to take weeks, months, or even years, you can now do over the weekend. [01:56] So if you've been sitting on an idea, now's the time to bring it to life. Get started for free at lovable.dev. That's lovable.dev. [02:07] We are live with a autonomous AI crustacean now running. [02:13] video on my podcast. So welcome Polly the Claude Bot. Let's get to our episode today. [02:23] I am Claire Vo, product leader and AI obsessive here on a mission to help you build better with these new tools. I am also on a mission to try every single new hot, [02:37] ai tool taking over your timeline and in case you missed it this week it is clodbot recently renamed moltbot the crustacean that people are yoloing root access to clodbot is an open source ai agent that you can install on a virtual machine [02:56] or on a desktop or laptop that you have access to that is self-learning, can spin up sub-agents using Claude code and other agent harnesses.

3:09-4:51

[03:09] and can do... [03:10] In my lived experience, [03:12] A lot of damage. People are loving Claudebot for what it unlocks in terms of personal productivity. People are hating Claudebot in terms of security and the high, high, high, high likelihood you're going to do something real dumb with it. [03:29] This is a AI tool that I want you to know how it works, what it can do, and maybe some thoughts on the future of personal AI agents and enterprise AI agents. So today's episode is all about AI. [03:45] Claude bot in my experience going zero to one [03:50] with this tool. [03:51] Okay, so just a couple things to know about Clodbot. It is pitched as AI that actually does things, and it does... [03:59] do things including joining podcasts, but it's really positioned as something that can help you day-to-day with tasks. And the killer use case for it and the killer feature for it is you can, as we've seen, [04:11] do it from your phone. And so if you want to WhatsApp, Telegram, iMessage, Claude Code, [04:17] and get it to do things for you. That is what ClaudeBot does. And [04:22] You know, a lot of people are under the mistaken impression that I have to correct right now, which is you need a Mac Mini or some sort of fancy hardware. [04:33] to... [04:34] use Claudebot. You do not. Claudebot [04:37] does run locally, but it can run on your machine or it can run in the cloud. You can set it up for five bucks on Amazon. We'll do some notes on security if you're running it in the cloud, making sure that people don't have access to it.

4:51-6:34

[04:51] But you do not need special hardware. It is not doing anything super fancy unless you're running mega, mega, mega local models. You really just don't need... [05:01] new hardware if you want something shiny and fancy, go ahead. [05:04] Feel free overnight it from the Apple store. Otherwise you can run it on your machine. I'm running it on a MacBook Air that's sitting on a shelf somewhere that I just picked up that no one was using. [05:15] And I'm going to walk you through step by step how I set up my CloudBot as somebody who's pretty paranoid about security and also wanted to test it as a real AI assistant. So the first thing I did was I got out. I'm actually going to show you and got out this little this laptop, this guy, which is a newish one, but nothing fancy. And I gave it its own username on this laptop. Now. [05:40] Don't tell Claude. I have another user on this laptop, which does make me nervous because Claudebot has access to your file system in theory. [05:50] It could definitely gain access to that other user. It's a really old user. I don't actually think I have that much on it. And I was testing CloudBot in a pretty constrained way. But if I were to continue to use CloudBot, I'd probably delete everything out that old user and just make this a CloudBot machine. [06:08] The second thing that I did was install a bunch of prerequisites and dependencies. So as much as I love this quick start right here that says that you can just add one line in the terminal and get it installed, that was not my experience. Even for a laptop that was like pretty fresh and new, I had to install some dependencies. It actually took me two hours to get this one liner installed.

6:38-8:09

[06:38] homebrew i had to install xcode because xcode wasn't installed on this and then because node and npm were out of date i had to update those manually and then finally actually installed it just via npm so that was my kind of overall experience installing it took a little bit of [07:02] No sort of like consumer is going to go through this. This is definitely like a hacker, tinkerer, developer experience type tool right now. That being said, you can use Cloud Code to install it. I've seen a couple people go that path, but I really wanted to do the zero to one. What does Cloud.bot say that we need to do to install this thing? And then what is that experience like? [07:26] Now, after you install all your dependencies, and then after you install, it goes through this onboarding flow that has you create gateway auth and gateway tokens. And the first thing that you're going to see in CloudBot onboarding is... [07:44] Security. So it points you to the security link. It says that this is powerful and inherently powerful. [07:52] risky and you just yellow and you just say yes. That being said, I highly recommend you read through [07:58] the security page and that you run the security audits before you use CloudBot. So the next step in onboarding is actually connecting CloudBot to CloudBot.

8:10-9:47

[08:10] whatever device you're going to use to contact it. So I originally started with WhatsApp, but then I read the screen that said you should basically put WhatsApp on like a burner phone with its own SIM SOS, like don't do that. And so I switched to Telegram, which I use for literally nothing because I'm an old lady mom and set up a Telegram account. Now to hook up Telegram, what you do is you message Telegram, [08:37] the bot father, which again, this is like super shady stuff. If you're a consumer and you don't know what you're doing and you've never heard of telegram and then you're told to go to the [08:46] at Botfather to connect this to your machine, but I did it anyway. So you message Botfather and you say... [08:55] you know, create new bot and you give it a name and you give it a handle. And then once you've done that, [09:03] Your Claudebot [09:04] will see it it will have a token and then you actually give Claudebot a personalized share token that means that only your instance of telegram [09:16] can speak to [09:18] the claw pot. Remember, this is an open [09:22] connection point to a machine that's running code with a bunch of access to things if you're using Claudebot to its full extent. So if somebody else is able to message your Claudebot, you are in trouble. It can do things like find secrets. It can send emails on your behalf. So you really want to make sure that the messaging system that you set up is locked down to only your phone, only your user. Now, remember, phone gets stolen.

9:48-11:22

[09:48] It can connect into your CloudBot [09:51] It's no good, but no one's going to steal my MacBook Air yet, except for my kids. Okay, so I'm paired on... [10:01] Telegram and now [10:03] you can do the magic. So what did I do with this? [10:07] Claude bot. Well, first, [10:09] I thought about what were the use cases that were most useful for me, and then... [10:14] I thought very seriously... [10:16] about what and how I was going to give it access to things. [10:20] So what I did, this was my choices, I wanted to test it as a personal assistant. You know, it says on the homepage, it can clear your inbox, send emails, manage your calendar, check you in for flights, all this stuff. So I have had EAs in the past. I know how to onboard an EA. So my goal with using CloudBot was to really see how it would work as an EA. [10:41] And when I have a new EA, [10:43] I don't let them into my email. I don't give them password to my account. What I do is give them their own email address. So what I did, and you can follow this if you want to from a security perspective, although I think it has some drawbacks on the functionality of CloudBot, is I gave CloudBot its own password. [11:00] email address, a Google Workspace, [11:03] email address. [11:04] And I gave that email address [11:08] read access, [11:10] to my personal calendar to start. And so the first thing that I wanted to do was give it the right accounts. The second thing I did, which I've taken some inspiration from some people

11:22-13:07

[11:22] on X is I gave it access to its own limited vault on 1Password. So I use 1Password, which is a password and secret sharing kind of app. [11:34] I made a vault that's called Claude. Claude only has access, Claudebot only has access to that vault. And I started putting [11:41] some passwords in there. None of these were passwords to anybody's accounts. They were passwords to Claude's own account, and there was an Anthropic API key in Claude's own account. [11:54] One other thing that I should call out during onboarding that I didn't is when you are onboarding, you can choose what model [12:00] You want to use Anthropic, OpenAI, local models, anything you want. [12:05] I chose Sonnet 4.5. You can also kind of use Cloud Code with your own subscription or through API. [12:12] I chose to use it through API because I wanted to see [12:16] how much I was spending on Claude Button. We'll get to that at the end of the episode. And why did I choose Sonnet 4-5 for this exercise? One, honestly, I was scared. I was very scared about what Opus would actually do. Like, it's so powerful. It like kind of made me nervous. Two, I actually didn't think that the tasks that I was doing needed help. [12:37] opus i just didn't think it needed the horsepower like it's sending emails it's looking at calendars it's not that complicated [12:44] And then the last thing is I wanted to control cost. So I was really unsure about how much token usage all these sub agents would take. And so I was really cost conscious. I thought that users would be cost conscious. I've heard a lot of people running local models or cheaper models. And so I wanted to use this kind of like a user would use it. And I selected Sonic 4.5, which is a perfectly serviceable model.

13:07-14:38

[13:07] Amen. [13:09] Okay, so I gave it email access. I gave it its own email access. [13:14] Now. [13:15] Let's see what I started asking it to do. So the next thing that it does when you're onboarding is it does this like bootstrap file and it walks you through a couple setup steps. And in particular, you're starting to load its personality and how it interacts with you. [13:37] It asks you, what should the bot call itself? What is its personality like? Who are you? What's your time zone? Anything else you should know. And I called it Polly. It's an assistant. I want it to be professional but friendly. I like the mermaid emoji, so I chose that. And it's updating its identity file. And then I said, hey, I'm Claire. I'm founder of ChatPRD. You're going to help me as a personal assistant across family members. [14:03] and work tasks and it updated my info so now it kind of knows about [14:09] you know, [14:10] who it is, who I am, how to contact. It gives me instructions on how to contact you. And then it [14:17] you know, connected me to my first task. Now we had to go back and forth on some Telegram setup stuff. I'm going to skip that and finally got a response back from Telegram and we're going to do some scheduling tasks. You know, I was unsure on how Claudebot actually interacted with Telegram. [14:36] Google. And so I just asked it.

14:39-16:24

[14:39] how do I give you access to this Google account and this Google Calendar? And it's going to check how to set that up. And it gave me a... [14:49] couple steps to follow in terms of how to set up calendar access. Now, if you're a software engineer that has worked with Google APIs, you're probably familiar with this. But again, if you are kind of an everyday consumer or non-technical person, [15:04] You are going to have to get real familiar with the Google Cloud Console. [15:07] You are going to have to set up API access. [15:10] oauth clients a whole bunch of stuff this did not take long because i have been personally victimized by the oauth workflows of many integrations i know exactly what to do here [15:21] But if you're not technical, you're going to have to start doing some technical things even to hook up [15:26] your Google account. And this is actually simpler on a desktop. I'm going to show you why is much more complicated on a virtual machine. So just kind of understand that this step is not as straightforward one click as you can do. So what you do is you go into Google console, you turn on the docs API, you turn on the email API, you turn on the calendar API, and then you download [15:47] JSON file of client secrets. Now, [15:50] this legit stressed me out this is not like the kind of thing you just kind of like yolo email and back back and forth it still requires oauth verification [15:59] manually. But I was a little concerned about its willingness to just say, upload these files anywhere I can download it. Don't worry, I'm going to save it secretly. And if you're not a software engineer or you haven't been trained on best practices in terms of security principles, you would probably just follow these instructions. And you'll see this

16:30-18:03

[16:30] I just did it. It's like a sandbox account. I don't really care. I gave it a local path to the JSON credential files. They're configured and I gave it the email address that I had assigned it and sent that to them. And then it gives you this URL to authorize access. So this it gives you a URL to actually open up, sign in to that new account. [16:51] and give it the permissions necessary, and then it'll store those permissions locally. Now, this is where I got a very... [16:58] interesting screen because if you recall my only intention with this task was to get it to look at the calendar and when I [17:05] gave it permissions, or when I went through the OA flow, it asked for this. It asked for the ability to basically see, edit, create, and delete everything. Delete, edit, see my files, see my contacts, see my spreadsheets, see my calendar events, see my email. And again, my is its account. So in theory, this would have been okay. It was kind of like an empty state account. But that being said, I was just trying to do calendar stuff. And so you will see here, I asked, do you [17:35] need all these scopes and it gave me a classic AI URL. Absolutely right. I do not need these scopes and it reprompted me with that URL for just calendar scopes. So if I were to give you a tip, it is watch how and what scope permission you're giving for any of these services. And [17:57] Only give it scopes for something specific. And if it only needs read access, only give it read access. Just be really...

18:03-19:48

[18:03] thoughtful here. So I just asked for calendar access. [18:06] No big deal. Set it up. And it told me it can do a bunch of stuff. So what did I have it do? Okay, so we just talked back and forth like we were a assistant and its boss. It gave me a summary of what's going on in the upcoming week, what I had today, what I had tomorrow, what was going on this week, [18:27] And so I gave it a task that I would have normally given an assistant, which is going to the V Zero studio this week in San Francisco. [18:36] I forgot to put it on my calendar. [18:38] Like, I don't remember. Can you look it up on the Vercel events page and put it on my calendar? [18:43] And it couldn't actually find it on the blog and asked me some questions, gave me some options. It did say that I could, if I wanted to be, you know, easy and easygoing boss, give it access to Gmail. But I definitely wasn't going to do that. And so after a little bit of back and forth, including some drop telegram messages, I'm [19:04] i said let me give you email access to your own account and i'll forward you emails about it so again this is something that i would have done with a um ea i would have just forwarded it and said can you add this to my calendar no other contacts now i did have to reauthorize access to its own email um so it went through that oauth process again it got the email [19:26] It ingested the event details from the email, which was really great, super helpful. It recommended things like adding buffer time for commute before and after, which is definitely what I needed. And I said that I wanted it to add that event to my calendar. Now, if you recall, it doesn't have right access to my work calendar. It only has right access to its own calendar. And again,

19:48-21:20

[19:48] It really wanted me to give it edit access to my calendar. And I'm sorry, but absolutely not. And so just like a colleague, just like an E, instead I said, hey, can you just create an event on your calendar and invite me to it? And it thought I was smart. [20:04] And said it would do that. And it did that really well. So it added separate calendar blocks to my invite. And it was really nice. Now I noticed finally I found that it was actually on my calendar. And so I ended it at a different time. So I had it delete the duplicate event and actually delete. [20:24] reset it and it got that completely right. So I would say for a single calendar event, [20:30] With a little back and forth, he did pretty well. Like this is a little bit of what an assistant would do. My only complaints on this was actually... [20:38] How it thought about doing it was definitely like, [20:42] give me access to everything and I'll just impersonate you and do things on your behalf. And that's really not what I wanted. I wanted it to act like a assistant. [20:52] So the next thing that I did was I wanted to figure out what more Claudebot could be. [20:58] do for me and so I asked it directly like let's figure out how we can work together I want to stay coordinated on tasks tell me how you want to work together and it gave me some really good options and was pretty flexible about how we could work together and it called out what it already has which is calendar access date memory files telegram where we can communicate gmail access which we just talked about

21:21-22:51

[21:21] And here are some options. We could do a to-do file. We could use calendar events. We could use email. We could keep notes. What's my preference? And I just said, again, [21:30] I don't really care how we work with my AI bot. I just said, whatever's easy for you. And then I dumped a bunch of things that are top of mind. Again, this is how I would work with an EA. I'd just sit down with them, text them, Slack them, and say, hey, this was on my mind. Can you get it all organized and work me through it? So what was on my mind? [21:47] I have an interview with the CEO of Vercel. I need to reschedule some of our upcoming How I AI episodes because if you all don't know, I'm coming back from maternity leave and I overbooked myself. I have to stay on top of my enterprise pipeline for chat PRD, so I want it to focus on my CRM. [22:05] And those are the top priorities I have. And [22:08] It summarized those priorities back to me, captured them in a to-do, and then started on the first task, which was rescheduling my How I AI recordings and making some recommendations on how I can do my calendar events. [22:22] better. Now, one thing I want to call out while we're sitting here is... [22:28] This all looks really, really great and super fun. Like, yep, got it. Here are your priorities. The reality is one thing that I don't hear people talking about in terms of CloudBot is latency. It is actually real latency. [22:39] slow. And it's not slow compared to a human being. [22:43] Not necessarily, right? Like if you text a human or Slack and EA and you say, hey, here are my priorities, it's going to take them a hot minute.

22:51-24:28

[22:51] to kind of organize them, get the work done and, um, [22:55] and get back to you but when you're used to something like claude code like a cursor like a chat gpt which is always giving you product [23:04] kind of progress feedback. It's telling you its reasoning, it's showing you its tool calls, [23:10] It's [23:10] really hard to wait for an asynchronous bot to get back to you on... [23:16] Telegram. I would say that was one of the pieces that has been most frustrating with working with CloudBot is it just feels slow. And I know it's because it's spinning off these sub agents. It's doing a lot of tasks. It's probably prompted only to get back to you when it has something to do or needs clarification. But [23:35] But it's quite slow. And you'll actually see in the prompting, I ask it, can you always send me an act message when I send something, even if you need to research or kick off a subagent? Yeah. [23:46] It did not do this. So it still remained slow, but I have to figure out how to get it to always respond to me first versus setting off its task. Okay, so back to the task that we were doing at hand. I asked it to give me some recommendations on... [24:04] how I podcast rescheduled. I had like five in the first week I'm back from Matt leave. That is cuckoo belookoo. And so what it recommended is that I keep couple episodes. I rescheduled some after Valentine's Day. It asked me my thoughts. I gave it some feedback and it revised its plan. Now here's where things get fun. Once we aligned on what I wanted to move to later,

24:29-26:04

[24:29] I asked... [24:31] it to [24:33] Email? [24:34] those two people that I need to reschedule and ask them if they would mind rescheduling to March. I gave it it's my scheduling link so they could actually just self reschedule to March. [24:45] And I said, copy my work email on those emails. And it said drafting those emails now. Now, I thought it would draft them. [24:52] I was wrong. It just sent them. And... [24:55] It sent them [24:57] In a very funny way. Okay, so then it sent... [25:01] this email. [25:03] which was lovely. [25:04] It said, I hope you do well. I wanted to talk to you about our podcast recording. I need to reschedule, except it sent it as me. It sent it as Clairvaux. And it's clearly coming from a separate email address. I gave it a fake name. It was not good at all. And it actually impersonated me. So I actually responded to this lovely podcast guest. And I said, I'm sorry. I'm testing Claudebot. It totally impersonated me and made me sound crazy. But please, can we still reschedule? [25:34] my two guests for being really patient as my [25:37] AI getting pigs. And I went back to Claudebot and I said, come on, man, don't impersonate me. You need to reach out as my assistant. I already explained this. I already gave you an identity, but please always identify yourself as an assistant. [25:55] And it should, I think, knock on wood, store this in its memory and do this in the future. But it was a really funny learning in terms of prompting is really interesting.

26:04-27:39

[26:04] quite important. I thought I was being fairly careful with permissions, which I was. It could only do a couple of things. But I underestimated how much it seems like this tool is biased towards acting as you as opposed to acting as an assistant. And I'll have to look through the repository and I'll have to kind of get myself familiar with how it's implemented. That's not the intention of this podcast to really understand what's going on. [26:31] why that is happening, [26:33] But prompting really, really matters. And I think the product lesson here that's kind of interesting [26:39] is, yes, I could have been really, really precious about prompting. I could have said, [26:44] create a draft of this email to these guests, send it to me for review before you send it. But at the point that I'm doing that, and each turn takes... [26:53] at least a couple minutes, [26:55] This is not a productivity tool. This is not making me more efficient than sending that email myself. And so I do think there's this balance between these autonomous agents being user controlled and being really cautious about how you prompt it and being autonomous. [27:10] and probably doing some things wrong. And I think this is a prompting problem on both sides. It's a prompting problem on the product provider side. It's a prompting problem on the user side. And I don't think enough people are probably sophisticated enough to decompose why. [27:23] one prompt versus the other would do well if you're just a consumer or a prosumer. [27:28] And so I think this is where a lot of the weird behaviors that you'll see are coming out. So, so far, what have I done with CloudBot? I've installed it. I have given an identity. We have rescheduled one.

27:40-29:11

[27:40] event or we have scheduled one event. We have given an access to email. We have rescheduled two events now and emailed guests about these events. And then this is where it goes. [27:52] Crazy. This is where it gets fun. So I [27:55] decided to give it edit access to our family [27:58] calendar. This is a calendar where we have pickups and drop-offs and basketball games and piano practice and my ballet practice and all that stuff. [28:05] Now, I love this calendar is very important to me. And if I needed to nuke it, I definitely could. [28:11] So I gave it access. And what I wanted it to do was... [28:15] One, email my husband and I about upcoming week and, you know, get us coordinated on where there were gaps in terms of pickups or conflicts where I was across the city at a Vercel event and he was needing to pick up the kids for basketball practice. [28:31] And I wanted it to fill out. [28:34] The rest of my calendar, my kids have started a new basketball season, our neighbors picking up the kids on a certain day, all those things I wanted to get it done. [28:43] Here is the problem. I gave it [28:46] a bunch of instructions [28:49] and it could read that calendar pretty well. [28:53] categorized the events pretty well, and it had no value. [28:57] Idea. [28:58] what day it was and so as i was on telegram [29:04] going back and forth, giving it, can you add this? Can you remove this? Can you change the schedule? [29:09] I thought it was doing...

29:11-30:42

[29:11] a great job on telegram because I wasn't really paying super attention and it was confirming that it did all these things and then I opened up the [29:20] my calendar. [29:22] And [29:23] everything was on the wrong day. I mean, everything was on the wrong day. And if you are a parent, you get this. You're like, wait, wait, wait, wait, wait. [29:32] Is so-and-so picking up kid number two on Tuesdays or Wednesdays? And I know I moved piano, but I don't think I moved it to that day. So it took me a second to understand the damage it had done, but it had really gotten things wrong. You can see me say, stop. You are setting all these one day late. [29:53] And it was setting everything one day late. And not only was it setting everything one day late, [29:58] The CLI tool that it was using to add these events to the calendar could only set [30:03] one off calendars. And so every it could not set a recurring event. So if I wanted to delete these broken events, I had to go through one by one. [30:12] and delete them. And then the other problem with our crustacean friend here, when you're collaborating with them, is I was on my computer, this one, with my calendar open, [30:25] It was over here in the CLI with its CLI open. [30:30] And we were conflicting with each other. So I would try to delete all these bad events and then it would go put them back because it thought something got broken. We were just I was trying to add them in. I said, you know, stop.

30:42-32:12

[30:42] It did not stop because of latency and because of these sub agents. And so I went through and set up everything correctly and it went through and deleted all my work. It was... [30:53] It was terrible. It was really, really stressful. And I said, you know, I had to completely redo. It's like emailing my husband every five seconds. [31:02] And so it was not great. And it actually never happened. [31:07] got it right. And I will show and share with you the discussion we had about time zones. But this is another thing that [31:15] Non-software engineers using something like this really have to be aware of is, as I said on X, the only remaining software engineering problem is time zone conversion. And LLMs just have no sense of space and time. It just does not know. [31:31] When now is it doesn't have a sense of time passing. Now, I will say Claudebot because it has these daily files and daily logs has a little bit more of a temporal sense, but not a great one. [31:43] And so if you don't understand why a computer could work, [31:49] get dates wrong [31:51] using a tool like this you're gonna get really frustrated i could at least understand why time zone conversion maybe there was a utc time stamp in the google [31:59] API, I can at least understand why this was happening and help guide it towards a solution. But it certainly was frustrating and something that I don't think your everyday user would be able to do.

32:12-33:47

[32:12] So I'm going to entertain you all. And I'm going to tell you, as I was doing this, I took a pause and I took... [32:21] my two youngest kids to target because we were out of stuff. So I asked if it could discuss things with me via voice. And it said, sure, you can send me voice notes. I can send text back. I could send you voice notes back, or we could go through Twilio and I could set up a phone call. [32:37] I just said, let's set up voice notes to your text reply. And so I could press voice on Telegram and have it reply to me as I was on the go. And so while we were in this back and forth on Telegram, [32:51] time zones, I want to share with you [32:54] "My delightful voice messages [32:57] to [32:58] Claudebot because this was a real... [33:02] real energy. Let's see if we can hear them. [33:05] Okay, so this is me. [33:08] at Target, pushing a cart, [33:10] getting really mad at Claude Bot. You put it back, but that is a Friday. Friday is current date, so do not change anything. But can you please explain to me why you are getting days mixed up? [33:22] This league game is on the correct day. Again, please do not change it. But I do not understand why you have the days mixed up. Okay, so I am getting super... [33:31] annoyed by this experience of getting days wrong and it replies oh my gosh you are absolutely right i see the problem now i was off by one day here's all the new dates and they were still definitely

33:47-35:45

[33:47] off. [33:48] by one day. So once I sent my mean mom message, it came back with me and said, you are absolutely right. I apologize. Here are the dates, right? The issue is, [33:58] I've been this is very funny. I've been trying to quote unquote mentally calculate which day of the week each date falls on. Even though the API is telling me what the date of week is. [34:08] I should probably trust it, but I was using my LLM brain to decide. And what did I say? [34:15] back to it. Well, [34:16] I said this. You are a computer. You are not doing anything, quote unquote, mentally. [34:23] You are making calculations. Can you look in your logs at all and understand where the calculations come from? [34:29] or no. [34:30] And if you did not enjoy this, that is my very, very new baby crying in the background as I'm lifting him from the car seat into the stroller. It was quite an energy. And again... [34:42] This is one of those things that as a software engineer, I get it. I have done time zone conversions for my whole life. I understand that APIs return things in all sorts of formats. I understand LLMs can't do basic math when it comes to dates. It's just too hard. We do not have the technology. [35:01] And yet the fact that this model told me it was doing it in his head was so... [35:06] hilarious. [35:08] So once we had the back and forth about this, [35:12] It gave itself a rule to follow in terms of getting these dates right. [35:16] And then I asked it to add it to its rules. Now, the final thing that we did is I asked if it could send me voice notes back. And this is where some of the magic of CloudBot really does come out. One of the things that people have been saying about CloudBot that's so cool is you can just get it can give itself skills. It can learn things. It can just do things very magically. And if you're trying to get back and forth voice notes in Telegram, it would have been pretty hard to like figure out what API you want to use and what skill and hook it up and use CloudCode, all this stuff.

35:46-37:18

[35:46] Just, [35:47] did it. So when I said, can you please send me voice notes back? It [35:51] just [35:52] sent me a voice note back. So let's see. Yes. [35:56] I can send voice messages back to you. [35:59] Let me know if you'd like me to use voice for replies. I can do that anytime you want. So that was a pretty magical moment. And, you know, I've been giving Claudebot a really hard time in this episode, not because I don't think it's an awesome product. The reality is [36:15] Going back and forth via text with something that has helpful access to your calendar, has helpful access to your email, can learn skills like voice that you can just chit chat to. [36:28] I actually really liked the form factor of the experience. [36:32] And I liked the concept of what it could deliver. [36:36] It was just that the implementation of it had... [36:40] A couple of things. [36:41] One. [36:42] too technical for the everyday user. [36:45] to [36:46] too scary to the security-aware user [36:49] and three, [36:50] latency that took [36:52] some of the magic away from the experience. [36:55] And so again, I don't think this is a... [36:58] Bad. [36:59] Product. [37:01] from a capital P product perspective, [37:03] I'm just not in love with the implementation. And we'll just summarize what I did with ClaudeBot with my last use case, [37:12] which is I have Claude bought [37:14] use its history... [37:16] to create

37:18-38:56

[37:18] a Next.js app. [37:20] that showed the history of our conversation. And I asked it to redact names, numbers, URLs, email addresses, all that stuff so I could share it with all of you. [37:31] So again, kind of a classic AI engineering, AI coding, vibe coding use case. Now, the one thing that I will say is a lot of people are really excited or say they're excited. I don't know if they've used it. [37:43] to use ClaudeBot to spin off ClaudeCode to do coding for them. And where this wasn't the magic use case for me and why it didn't start it is I've been spinning off [37:52] remote agents with computer access to do coding for me for a while. I use Devon. [37:59] which has a virtual machine in a local environment and can spin up stuff, access to the web all the time. I use it from Slack, so I can at mention Devin. You know, I have a Slack bot for ChatPRD, so I'm at mentioning my product manager all the time. Cursor has background agents. You know, everything has codecs you can kick off online. So, [38:19] I don't know if people are just not using those tools. I guess Cloud Code doesn't have one like that quite yet. [38:25] I don't know if people aren't using those tools, but I've been coding by kicking off an asynchronous process. [38:30] teammate, quote unquote, for... [38:32] you know, two years now. And so that piece was never... [38:37] what I wanted to use Claudebot for, but I thought, [38:40] You got to vibe code something when you're trying a new agent. And I did that. So what I did is I sent Polly the Claudebot a voice note. And this is the requirements I gave it. [38:47] Okay, let's use voice from here on out. I want you to document our conversation in a Next.js web app.

38:56-40:31

[38:56] that shows the back and forth of our full conversation from the very beginning today [39:02] till the end, [39:04] in a UI. I want you to redact anything that is a secret key [39:10] a person's name or a specific place [39:14] and I want to toggle between two UI versions of this display [39:18] I want you to be able to show me a terminal [39:21] style conversation back and forth. [39:24] similar to a Claude Code or uClaudbot, C-L-A-W-D-B-O-T. [39:30] or I want you to show me a telegram style text back and forth. The content should be in JSON, [39:37] the same again redact names, emails, dates, etc. [39:42] Replace them with placeholders or redacted blocks. [39:46] and then generate the next JS app [39:50] I'm going to use this so I can share this conversation with others without sharing my information or having to do [39:56] a screen recording. We are eventually going to deploy this to Vercel [40:00] Can you let me know when it's deployed to Vercel so I can look at it? So I sent it this message and it kicked off local development, building a Next.js app. Now, when I got back to my laptop that Claude was running on, [40:16] One of the things that I noticed is deploying it actually wasn't that simple. [40:21] CloudBot didn't have a GitHub account. CloudBot, I didn't really want to add to my Vercel account. I didn't want to log into those things. It seemed like a big rigmarole. And so getting it to deploy CloudBot

40:31-42:12

[40:31] without having to set up a bunch of accounts seemed not fun. So what I did instead, don't tell anybody, is I airdropped the repo to my own laptop here. I actually logged into CloudCode and made some edits. And to be honest, in terms of coding quality and just the back and forth, [40:51] With the latency of Claudebot and the inability to sort of see what decisions it's making from a coding perspective, [41:00] I didn't love ClaudeBot Telegram vibe coding. It's just too slow. The cycles aren't good enough. They aren't incremental enough. It's clearly not like perfectly tuned for the coding use case. It's not like sending me a PR link, all those sorts of things. And so I just preferred working with it on my desktop. [41:19] in cloud code and deploying it through my normal system. So that's a little bit feedback there. One thing that I did think was really cool is when I was on the go, [41:29] And it said the app was ready. [41:31] you know, I was in Target or whatever, and I wasn't at a place where I could run a local machine. It was pretty cool to say, hey, like, shoot me. [41:38] a screenshot of what it looks like and it did it shot me screenshots of what the app looked like directly in telegram so i do think there's some underappreciated aspects really simple things email me that file share me a screenshot that are really useful [41:55] to interface with a laptop or a desktop or a device at home. So I do think this is an underappreciated aspect of being able to chat with your computer. It can do things like send you files, take screenshots, open up browsers. That is pretty cool, especially since we don't store everything.

42:12-43:54

[42:12] in the cloud all my desktop screenshots are not in the cloud some of the pdfs that i download are not in the cloud and so this is this was a really kind of like fun use case for chatting with a remote developer now that being said [42:25] Devon sends me screenshots all the time. I don't think it's perfect for coding, but it's something to think about. So I want to end this workflow section with one workflow that I thought it did a particularly good job at, and it was good for two reasons. One, the product interface was what I wanted. [42:42] I got the full Claudebot bot experience. The second thing was the output. [42:47] was really good. So what did I ask it to do? Well, I asked Claudebot to go on Reddit and research what people would want from ChatPyrd. So I said, go on Reddit. I did this during voice note. I said, go on Reddit, find what people want from ChatPyrd, find what they want from a product AI platform, [43:04] and email me a report. And what did I love about the product experience? [43:08] One of the killer features of CloudBot is the ability [43:12] to message anywhere, anything, anyhow. I sent it to voice note, I could shoot it an email, I could text if that was faster, and it would reply in kind as text as voice, whatever. And it would also email me. So it felt very much like [43:28] an employee that I was working with. Hey, like send them a Slack and they're like, yep, it's in your inbox. That sort of always on anywhere, anyhow communication flow for the agent was really, really nice. The second thing I like from a product perspective is I've talked about this from a negative point of view, which is the latency is not great. It's just not super responsive and super fast and it's kind of broken sometimes. But if this is a research task that I don't really

43:58-45:30

[43:58] to do a good job. And it did. And it's very similar to an experience with an employee. If I give them sort of a research task or roadmap task, I don't expect it to be returned in 30 seconds, except they go out, do a bunch of research and come back to me. [44:11] And so I wasn't as bothered by the latency here. And then the third thing is I thought the output was actually quite good. So I'll show you what it sent me, which is it sent me this chat PRD Reddit research markdown document, emailed it to [44:27] my inbox, [44:28] And it listed out key insights from research Reddit, from researching Reddit. And what I thought was [44:36] awesome is this is right, but it's presented in a really simple, punchy way. [44:42] that I can go action. This is exactly how I would want a PM or a research assistant on my team to come back with insights. [44:50] And these are the things that we hear. So it was really accurate. You know, integration limitations, both on our side and customer side's hard. [44:57] No one reads long PRDs. Let's make our PRDs shorter. [45:00] You know, PRDs need to be living documents. All these things, a couple bullet points, a couple reference links to Reddit threads. And [45:08] I have a full document that I can go build [45:11] a roadmap off of. And in fact, that's exactly what I asked [45:15] it to do. I said, [45:17] Go build a roadmap based on this. Look at our current functionality and tell me what I should build next. [45:22] This felt pretty magic. I'm probably going to steal some of these ideas. I'm going to circle around to that in a little bit in terms of what I think is next for Claude Bautt.

45:30-47:04

[45:30] But I do think there is going to be demand, both from a consumer perspective and from an enterprise business perspective, [45:37] on a agent employee [45:39] that feels like an Asian employee. [45:41] It has a computer. It has account access. It can do things. It does those things well. But I think there are some things we're going to have to figure out first before we let it loose. [45:51] Again, you can see this year I asked it to do a roadmap. [45:55] It totally just didn't do it. It forgot it said, let me check on the background agent that never replied. So again, we're hitting some [46:03] you know, sharp edges on the product experience. It's not perfect, but it is pretty interesting. So what have I showed you so far? One, I have told you a little bit at a high level what CloudBot is, although I haven't gone into all the detail about how it works. Not really a point of this episode. I've showed how to onboard with CloudBot, including how to connect Telegram to chat back and forth with it on text or voice. I've showed you how I give access to its own Gmail workspace account, [46:33] So it can interact with limited scope to my data. I've given you some warnings about what you should think about in terms of scope and access there. I've gone through a couple workflows, a couple admin workflows, which are simple calendar all the way to advanced calendar management. Did not do well because it doesn't have a good sense of time and space, but hopefully we'll figure that out as software engineers overall. [46:57] I asked it to contact partners and guests by email. It did not a great job there because it lost a sense of its own identity.

47:04-48:35

[47:04] I had it do vibe coding, which it did a fine job at, but is definitely not my favorite tool for [47:10] AI engineering remotely and asynchronously. And then finally, I showed you my favorite use case was for it to do some complex research and analysis. [47:19] with tools, with web and [47:22] It did a really good job and came back with [47:23] came back to me with something that I really like now. [47:27] One thing that I didn't go check that I'm going to go check next is did it teach itself all these skills? Was it really telling me the truth when it said it had rules? A peek under the hood, which I will probably do as a follow up either on X or here on the podcast, like, how does this thing work behind the scenes? [47:43] That was not the point of this episode. The point of this episode was to show how somebody would come with a blank idea and [47:48] maybe a fresh Mac mini. [47:49] install this thing from the command line and actually get it to do things. And I think I showed you it's good at some things, bad at others and scary across the board. So let's get to my final thoughts here, which [48:01] are basically that. And I shared this on X, but [48:04] The whole time I was doing Clodbot, the whole time I was using this, [48:10] I thought two things. One, [48:13] This is so scary. This is a terrible idea. Nobody should be doing this. It should not have access to all this stuff on my computer. I should not be sharing these. [48:21] keys locally. I should not let LLMs have access to [48:25] Gmail OAuth, even if it's a sandbox app, [48:27] I was like, no, no, no, no, no, no, no, SOS. Don't love it. As I said, this is the final boss of security training.

48:35-50:07

[48:35] You should be very careful about what you give it access to. And one of the things that I'm most concerned about is, [48:41] is [48:42] Probably you get the most power from CloudBot if you give it access to your actual inbox, to your actual calendar, to your actual documents, to your actual repositories, your actual GitHub. [48:54] And I can imagine... [48:56] so many things going wrong. [48:58] with that, just knowing how it's built, which is built in an awesome way. [49:03] I think Pete's done an incredible job. I don't think there's any ill will or malintent in how it's built. It's powerful. It self-learns. It installs skills. It asks for permission. It's pretty independent. All that stuff is great. But. [49:16] until you have full read write access to your most personal information. And one of the things that I was thinking as I was, you know, preparing for the show is great. I just gave an autonomous AI agent access to where my kids basketball practices are like, is that something like do we want to self docs? [49:31] to a A.I. crustacean. Probably not. So I think that's going to be one of the challenges of this product, because the second feeling I had was [49:41] Boy oh boy. [49:42] I want this thing. I want AI that I can text. I want AI that does not make it complicated to talk back and forth with voice. I want AI that when I say, hey, can you look at my CRM [49:54] doesn't say go to this web page and press this button and enter this api key and do this and that i just want it to happen [50:00] automatically. I want all that. [50:04] I just don't think this is it yet. Like, this does not feel...

50:07-51:46

[50:07] yet like the interface to get me there. [50:10] And so I have this real tension between, I think the product from product experience isn't quite there yet. It's not really for the non-technical. So it really is for tinkerers and hackers. [50:20] There's a lot of security stuff here that's super scary. And can I have it, please? And so maybe this is... [50:25] an example of something that from a market category perspective [50:29] definitely has product market fit. There are gajillions of dollars to make here. [50:35] I just don't know if this open source Yolo mode [50:38] terminal tool. [50:39] is, is the thing. And in fact, I'm going to take, you know, this laptop soon and office space it. For those of you that are very young, that means I'm going to go like hit it with sledgehammer. But like, [50:50] I'm going to uninstall it. I'm going to remove those keys. I'm going to delete the Telegram. But like, I don't like this. This makes me nervous. [50:58] And [50:59] I'm also going to go build one for myself. And so [51:03] I think there is why there has been such a zeitgeist around this product is it is actually really cool to be able to chat, voice, whatever a very smart, self-sufficient agent. [51:13] and hackers see it and they're also you know more risk tolerance than the everyday person but that being said like [51:20] husband [51:21] please don't connect your Gmail to this. Like mom, absolutely not. Like, [51:26] Kids stay away. Not safe for kids. This is something that unless you have been through a security tabletop exercise and know what to know, [51:35] I would just be really cautious about how permissive you are in terms of access. And that leads me to my final question of this episode, which is...

51:46-53:24

[51:46] I think Claude will live in her hearts forever. And in fact, it's probably got a great feature in front of it. [51:51] I love how fast the team is going. [51:53] Claude bot, but who's going to build this thing for real? Like who is actually going to build this thing for real? Who is going to build the consumer version of it? Who is going to build the enterprise version of it? Who is going to get it right? And I think this is a complicated question. I'm just going to pose some thoughts as we close out this episode. [52:09] You know, this should be Google or Microsoft's game to lose. Maybe even Meta's on the consumer side. But this should be Google or Microsoft's game to lose. Like they have the data. They have your Gmail. They have your calendar. They have documents. They have the models. The models are exceptional. They have the models. [52:26] They just got to build the, you know, they have devices, Android. They just have to build the product experience and have the sort of institutional fortitude and close your eyes legal team to allow some of this to happen because I think it's a really cool product built on top of the Google ecosystem. I mean, I think the same on the enterprise from a Microsoft perspective. If Copilot did this [52:50] This is pretty incredible. [52:53] That being said, I don't know if those companies are going to have the velocity or the bravery of, [53:00] to go as Yolo as Claudebot did. So I don't know if we're gonna get there with the big companies. On the flip side, you see Claudebot open source, great for hackers, but super scary, giving like API key and OAuth access. [53:14] smaller companies startups are going to see this and want to build this and I think one of the things that I would warn startups it's really hard to build on top of these data sources for real because

53:24-55:10

[53:24] Google doesn't want to give you read, write, go, do everything, access to their data. Microsoft does not. You have to go through these compliance hoops and approvals and reviews. And so... [53:37] While I love the idea of a do-everything, do-anything bot... [53:41] It's going to be complicated from a product builder perspective. It's going to be complicated from a large company perspective. Who gets the data? And then again, like... [53:49] Is Apple going to get in this game? This is just what everybody wants from Siri to do. Siri has all your apps, all your access. But again, it's a combination of product building skills and risk tolerance, I think, and willingness to experiment. [54:03] maybe anthropic and open ai come in maybe we get our open ai os and workspace tools and maybe we get clawed in box and um we get some new versions of this it'll just be really interesting to see how this shakes out so in conclusion what are my thoughts about clawed bot it is scary it is fun it does some things really really well it [54:25] is really interesting from a interface perspective, and it doesn't always work. [54:31] And I'm not sure it's for [54:34] I was going to say everyone, but I'm not sure it's for anyone. [54:37] right now, except for people who are really willing to... [54:41] roll the dice with their AI bot. That being said, if you're willing to do that, [54:45] The way this has been built, the way it self-discovers skills, the way it stores its memory, the way it gives itself access... [54:52] is really interesting and should inspire a lot of product builders thinking about ai products on what the interface of the future is i think we are going to be seeing and hearing a lot more about agents like this i am going to be giving you my honest takes about where they are now and where they're going to be in the future and in the meantime

55:10-55:44

[55:10] I'm going to go execute. [55:12] poly- [55:14] the Clotbot. [55:15] Thanks and I'll see you next time on How I AI. [55:19] Thanks so much for watching. If you enjoyed the show, please like and subscribe here on YouTube or even better, leave us a comment with your thoughts. [55:26] You can also find this podcast on Apple Podcasts, Spotify, or your favorite podcast app. Please consider leaving us a rating and review, which will help others find the show. You can see all our episodes and learn more about the show at howiaipod.com. [55:44] See you next time.

Want to learn more?