About this transcript: This is a full AI-generated transcript of Microsoft Build 2026 — Opening Keynote from Microsoft, published June 3, 2026. The transcript contains 21,078 words with timestamps and was generated using Whisper AI.
"*music* *music* *music* *music* All right, good morning. *music* Good morning. It's fantastic to be back in San Francisco at Build. It's always fun to be at developer conferences at times of great change. You know, developer conferences are always about understanding tech shifts, understanding the..."
[00:00:00] *music*
[00:00:16] *music*
[00:00:18] *music*
[00:00:28] *music*
[00:00:37] All right, good morning.
[00:00:39] *music*
[00:00:42] Good morning.
[00:00:44] It's fantastic to be back in San Francisco at Build.
[00:00:50] It's always fun to be at developer conferences at times of great change.
[00:00:55] You know, developer conferences are always about understanding tech shifts, understanding the new stack.
[00:01:01] But it's also about really coming to grips with the new opportunity, right?
[00:01:07] For us as developers, for the companies that we work at, as well as the broader world.
[00:01:12] So today we're going to unpack this, this conference is all about that.
[00:01:17] And if there's one key takeaway, it would be this.
[00:01:21] How do you all participate fully in this frontier intelligence ecosystem?
[00:01:28] Right?
[00:01:29] It's not about any one piece of technology that you'll hear about or even the platform itself.
[00:01:35] It's about the value that you can build, you can compound, you can create on top of the platform.
[00:01:42] That's what a developer conference needs to be about and that's what we will focus on.
[00:01:47] So let's kind of get into it.
[00:01:49] Let's take a look at this AI stack that this conference is going to unpack in a great amount of detail.
[00:01:56] It starts, of course, with the compute fabric, right?
[00:01:59] That ubiquitous compute fabric that spans the edge and the cloud.
[00:02:05] You then have this layer which is the emerging layer where you have the models, you have context, you have the tools the models can access.
[00:02:15] And then on top of that, you have the runtime where you deploy your agents and applications that you built on top of the context layer, on top of the model layer.
[00:02:27] And then, of course, you have the best tooling to do all of this.
[00:02:31] And then you have the security, the compliance and the governance, right?
[00:02:35] That's the simplest form of the tech stack.
[00:02:38] But let's kind of start where it always starts, which is infrastructure.
[00:02:43] And in fact, let's start at the edge with Windows.
[00:02:47] Because when you step back, the amount of compute there is at the edge is actually astounding, right?
[00:02:56] I mean, think about every NPU, GPU, CPU even, every PC.
[00:03:03] If you sort of aggregate that, that's a lot of compute power.
[00:03:06] So we asked ourselves one simple question, right?
[00:03:11] If we can deliver unmetered intelligence to every desk and every home, right?
[00:03:17] It takes us all the way back to the very beginning.
[00:03:20] But that's what we said.
[00:03:21] Can we do that in this era of AI?
[00:03:24] And in fact, we're in some sense delivering that already today, right?
[00:03:29] When you look at something like, say, Outlook Summarize, it's actually using onboard AI locally.
[00:03:34] Same thing with PowerPoint all text or team super resolution.
[00:03:40] It's just not Microsoft stuff.
[00:03:42] For example, Adobe After Effects or Premiere are both using Windows ML across NPUs and GPUs for local processing.
[00:03:52] And so today, one of the things that I'm really excited about in order to tap into all this compute power is to expand the scope of Windows ML and Windows AI, right?
[00:04:04] So you now have the full install base of GPUs that you can get to.
[00:04:09] So I'm really thrilled that every developer out there can count on building for local onboard AI and then have it run across all of the install base.
[00:04:19] Now, we are also announcing two very cool new models that are all going to run on Windows Inbox.
[00:04:29] The first is a new SLM.
[00:04:31] It's sort of a more efficient model, ION instruct.
[00:04:36] And it's a great reasoning model.
[00:04:40] And then we have the planning model, ION plan, which is a local agentic loop.
[00:04:45] I mean, think about it, right?
[00:04:46] You now have a full local agentic loop.
[00:04:49] You can give it tools access and build a fully onboard agent agentic applications without having to round trip to the cloud.
[00:04:59] And of course, we said in order to really push the limits, realize this unmetered intelligence, there's also a lot of hardware that's evolving too.
[00:05:10] And so we're thrilled to see the innovation across the Windows ecosystem, right?
[00:05:14] So we've got to see what we're doing.
[00:05:15] We've got to see what we're doing.
[00:05:16] We've got to see what we're doing.
[00:05:17] We've got to see what we're doing.
[00:05:18] We've got to see what we're doing.
[00:05:19] We've got to see what we're doing.
[00:05:20] We've got to see what we're doing.
[00:05:21] We've got to see what we're doing.
[00:05:22] We've got to see what we're doing.
[00:05:23] We've got to see what we're doing.
[00:05:24] We've got to see what we're doing.
[00:05:25] We've got to see what we're doing.
[00:05:26] We've got to see what we're doing.
[00:05:27] We've got to see what we're doing.
[00:05:28] We've got to see what we're doing.
[00:05:29] We've got to see what we're doing.
[00:05:30] We've got to see what we're doing.
[00:05:31] We've got to see what we're doing.
[00:05:32] We've got to see what we're doing with Snapdragon C.
[00:05:33] So it's great to see all of this.
[00:05:35] And, of course, that brings us to NVIDIA and RTX Spark.
[00:05:40] This is a next-generation SoC for PCs.
[00:05:43] It brings together the CPU, the GPU, as well as the AI capabilities into a single SoC.
[00:05:50] And that's the exciting part.
[00:05:51] And it includes unified memory architecture and also the integrated DRTM.
[00:05:58] So you now have all of the systems innovation and the SoC coming together.
[00:06:04] And we're really thrilled that one of the first devices that we built was the Surface Ultra.
[00:06:11] It's a beautiful device.
[00:06:14] And it really brings the power of NVIDIA together with the design and the craftsmanship of Surface.
[00:06:22] It's 128 gigabytes of unified memory.
[00:06:26] It's beautiful 2,000 nit display.
[00:06:29] And it has an all-day battery life.
[00:06:33] And so we are very excited to see this later this fall.
[00:06:37] And it's also wonderful to see all the designs from all the OEM partners
[00:06:43] who are building, taking advantage of this new SoC and the new platform
[00:06:48] and bringing out some pretty exciting machines that are going to be all available come this fall.
[00:06:55] And so, of course, we said, okay, this is fantastic.
[00:06:59] What can we do next?
[00:07:01] So we said, let's try and push this and push the architecture to its limit for developers.
[00:07:09] Right?
[00:07:10] What if we could just max the compute, max the memory, build out that developer machine that's the dream machine?
[00:07:19] And that's what we're announcing today.
[00:07:21] Surface, RTX, Spark, DevBox.
[00:07:24] Let's roll the video.
[00:07:25] I was pulled to you like water to the moon.
[00:07:41] I was pulled to you like water to the moon.
[00:07:42] I was pulled to you like water to the moon.
[00:07:43] I was pulled to you like water to the moon.
[00:07:44] I was pulled to you like water to the moon.
[00:07:45] I was pulled to you like water to the moon.
[00:07:46] I was pulled to you like water to the moon.
[00:07:47] I was pulled to you like water to the moon.
[00:07:48] I was pulled to you like water to the moon.
[00:07:49] I was pulled to you like water to the moon.
[00:07:50] I was pulled to you like water to the moon.
[00:07:51] I was pulled to you like water to the moon.
[00:07:52] I was pulled to you like water to the moon.
[00:07:53] I was pulled to you like water to the moon.
[00:07:54] I was pulled to you like water to the moon.
[00:07:55] I was pulled to you like water to the moon.
[00:07:56] I was pulled to you like water to the moon.
[00:07:57] I was pulled to you like water to the moon.
[00:07:58] I was pulled to you like water to the moon.
[00:07:59] I was pulled to you like water to the moon.
[00:08:00] I was pulled to you like water to the moon.
[00:08:01] I was pulled to you like water to the moon.
[00:08:02] I was pulled to you like water to the moon.
[00:08:03] I was pulled to you like water to the moon.
[00:08:04] I was pulled to you like water to the moon.
[00:08:06] I was pulled to you like water to the moon.
[00:08:07] I was pulled to you like water to the moon.
[00:08:08] I was pulled to you like water to the moon.
[00:08:09] I was pulled to you like water to the moon.
[00:08:10] I was pulled to you like water to the moon.
[00:08:11] I was pulled to you like water to the moon.
[00:08:12] I was pulled to you like water to the moon.
[00:08:13] I was pulled to you like water to the moon.
[00:08:14] I was pulled to you like water to the moon.
[00:08:16] I was pulled to you like water to the moon.
[00:08:17] I was pulled to you like water to the moon.
[00:08:18] I was pulled to you like water to the moon.
[00:08:19] I was pulled to you like water to the moon.
[00:08:20] I was pulled to you like water to the moon.
[00:08:21] I was pulled to you like water to the moon.
[00:08:22] I was pulled to you like water to the moon.
[00:08:23] I was pulled to you like water to the moon.
[00:08:24] I was pulled to you like water to the moon.
[00:08:25] I was pulled to you like water to the moon.
[00:08:26] I was pulled to you like water to the moon.
[00:08:27] I was pulled to you like water to the moon.
[00:08:28] I was pulled to you like water to the moon.
[00:08:30] I was pulled to you like water to the moon.
[00:08:31] I was pulled to you like water to the moon.
[00:08:32] And it's got a one petaflop of AI compute, 20 CPU cores.
[00:08:39] All of those things have 128 gigabytes of unified memory access.
[00:08:45] So super excited about this coming in the fall.
[00:08:49] And you can join the wait list.
[00:08:51] I'm on the wait list as well.
[00:08:52] So we'll get there.
[00:08:55] You know, so then we said, why stop there?
[00:09:00] We said, what if we did one more thing?
[00:09:04] And in fact, I think Jensen did that already.
[00:09:07] And he said, Windows is coming to the DGX station.
[00:09:11] So a new, I describe it as the desktop data center,
[00:09:15] which you can have right there and running a 1 trillion parameter model locally.
[00:09:22] I mean, just to sort of put it in perspective,
[00:09:24] it's pretty close to what perhaps we had when we built GPT 2.5 or 3, right?
[00:09:29] One of the first supercomputers.
[00:09:32] So it's pretty crazy to think that we've come this far where you can now have a data center on your desktop.
[00:09:39] And of course, we're extending the developer endpoints even to the cloud.
[00:09:43] Windows 365 now has the developer distribution that's optimized for developer productivity in the cloud.
[00:09:51] So that's the Windows 365 that I use every day.
[00:09:54] And it's simply about making Windows, whether it's on the laptop, the desktop, the cloud, wherever, the best place to build.
[00:10:04] And so to that end, we have tons and tons of updates, right?
[00:10:09] We're starting, by the way, with one of the favorite things for all of us, which is distraction-free dev environment.
[00:10:17] We're also introducing an intelligent terminal, which has got built-in GitHub co-pilot, right?
[00:10:22] Terminal with sort of the co-pilot intelligence.
[00:10:25] And of course, there's lots and lots of Linux love across Windows now.
[00:10:31] So you have 70-plus utilities.
[00:10:34] So grep in full glory is now available for regular Windows access.
[00:10:40] 70-plus utilities, as I said, all of that is coming to Windows.
[00:10:45] We're also bringing things from the Mac that you love, things like Starship, Z-Shell.
[00:10:53] Homebrew is going to be native on Windows as well.
[00:10:56] So you'll be able to switch to Windows.
[00:10:59] And today, we're announcing WSL containers.
[00:11:04] This has, I know, been a point of pain when you're managing all these environments.
[00:11:09] Switching all the mental gymnastics of trying to keep this straight.
[00:11:13] So having first-class support for containers will really help us be in the flow
[00:11:20] when you are building and deploying locally.
[00:11:23] And to show you all of this on our new Surface RTX Spark,
[00:11:29] I wanted to invite up on stage Kayla to just take a spin through all the dev tools.
[00:11:36] Kayla, go ahead.
[00:11:38] Kayla, go ahead.
[00:11:39] Our team has been working hard to make Windows a great place for development.
[00:11:48] And today, I'm going to show you some major improvements and delighters you can try out today.
[00:11:53] So what we have here is the default experience on the Surface RTX Spark dev box.
[00:11:59] Right away, it feels calm.
[00:12:01] There's no news feed, no widgets popping up, no notifications, and of course, we're in dark mode.
[00:12:06] I'm immediately ready to start development.
[00:12:08] Well, there's one thing I'd like to change.
[00:12:11] I like to put my taskbar on the left.
[00:12:13] So let me jump into my taskbar settings using the new run, which is built leveraging the architecture of PowerToys command palette.
[00:12:22] No, no, no.
[00:12:23] I want it all the way on the left.
[00:12:25] There we go.
[00:12:26] After popular demand, we're excited to announce that vertical taskbar is now available in Windows Insider builds.
[00:12:36] The new Surface RTX Spark also has a bunch of key dev tools already installed, like Python, Node, and many more of your favorites.
[00:12:44] All of the developer goodness, all in one file.
[00:12:47] And if you want to get the same experience today on your device, we're making this file available to everyone right now.
[00:12:53] We have a public repo set up with the configuration file and instructions for how to apply it using Winget Configure,
[00:12:59] which will make the adjustments to Windows and install all the tools.
[00:13:03] Now, one cool thing that I have running here is PowerToys' new utility called Grab and Move,
[00:13:08] which lets you hold Alt and move the window around from anywhere.
[00:13:12] Another tip is that you can enable end task, which lets you end the process without having to open Task Manager.
[00:13:19] So let me jump into my dev drive.
[00:13:21] Dev drives run on REFS with Defender running in async and are optimized for performance when it comes to development scenarios.
[00:13:29] Also, File Explorer is Git aware.
[00:13:32] We've got stuff like last change author name, last change message, the status of each file.
[00:13:38] Plus, my favorite is that the branch name is on the bottom left.
[00:13:41] So now let's get started building and open our terminal.
[00:13:45] This is an experimental experience called Intelligent Terminal that makes working with agents even more seamless.
[00:13:52] When you first install Intelligent Terminal, you're greeted with the option to pick your favorite agent.
[00:13:57] I'm going to use GitHub Copilot for today, but you can use whichever agent speaks to you.
[00:14:01] Now here in Intelligent Terminal, I have a regular terminal pane at the top and an agent that's listening on the bottom, and I can work between them while the agent helps along the way.
[00:14:11] So for example, here's an error being generated.
[00:14:14] My agent pane is able to detect it and provide a fix, which is great when I don't remember the syntax, especially for something like Regex.
[00:14:21] So I'm going to work on OpenClaw, and I've already built it using WSL Container.
[00:14:27] WSL Container is a native container experience on Windows, plus it can leverage the GPU, which is perfect for the Surface RTX Spark.
[00:14:34] It can also reference your existing container files, just like the one in the OpenClaw project.
[00:14:41] So here's one of the files open in Microsoft Edit, which ships in Windows by default and just got syntax highlighting in its latest version.
[00:14:49] And then you can also just see your containers running with a simple container images command.
[00:14:54] And since we're on the topic of WSL, we're providing a WSL profile that's designed to feel comfortable for those of you who use tools like Starship,
[00:15:03] ZSH, and Homebrew.
[00:15:05] So it comes preconfigured with all of your favorite utilities, and it's available in the repo that I showed earlier.
[00:15:11] And it also includes one of my favorites, BTOP.
[00:15:15] So the Surface RTX Spark is designed for developer heavy workloads, including serving large local models for coding.
[00:15:23] I've already done some development with a 120 billion parameter model that most machines can't even load.
[00:15:29] So here's a quick view of my usage, and we can see how many tokens I've used locally.
[00:15:33] So we're looking at about 3.4 million tokens leveraged on the device itself.
[00:15:38] Now, we can kick off multiple sub-agents using Fleet.
[00:15:41] And just so we don't have to watch me type, I'm going to use Copilot's voice feature, which is also leveraging its own local model.
[00:15:48] So I'll just hold Spacebar and tell it what I want it to do.
[00:15:52] Find any console.writeline or debug.writeline calls in the tray and node projects and convert them to the standard logger used elsewhere in the code base.
[00:16:04] There we go.
[00:16:05] Now, the main agent will delegate sub-agent tasks of appropriate complexity to the local model, utilizing my GPU and making it more cost efficient.
[00:16:14] Now, as developers, while we're debugging, we're often looking through log files to diagnose any issues.
[00:16:20] Sometimes finding the location of the log files is a challenge.
[00:16:24] I'd love to be able to just type something like grep log and find them all.
[00:16:29] Ah, sweet.
[00:16:30] So on top of already adding curl, tar, and sudo to Windows, now we're adding over 75 command line utilities like env, head, tail, and touch for those of us who love to live in the terminal.
[00:16:44] So I found all my log files, but now having to parse them is the second challenge.
[00:16:51] Well, I've actually had Ion instruct proactively performing analyses on my log files this whole time.
[00:16:58] Now I can quickly diagnose anything that's gone wrong in my development, plus I don't have to worry about token usage because it's all local.
[00:17:05] We can even take a look at our machine's resources.
[00:17:08] The models are loaded, and you can see 90 gigs of RAM being utilized by the GPU, truly showcasing the full power of the Surface RTX Spark.
[00:17:17] We were able to use three local models simultaneously, unmetered, while going about our regular dev flow without a hitch.
[00:17:24] That's huge.
[00:17:26] So I know you're going to love what the team's been working on.
[00:17:37] We hope it gives you a glimpse of what's possible on Windows today and where we're headed next.
[00:17:42] Thank you.
[00:17:43] Back to you, Satya.
[00:17:44] Yeah, that's really the beginning of this idea of unmetered intelligence, having those models and having the agents using the models work in parallel to what you may be doing along with the cloud as well.
[00:18:02] And that, I think, is what the platform enables as first class.
[00:18:07] Now let's move to the cloud.
[00:18:09] The driving equation for us remains the same, which is tokens per dollar per watt.
[00:18:15] How do we optimize around this?
[00:18:17] So when we think about the system's problem, we think about electrons coming on one end and tokens on the other end.
[00:18:24] And how do we think about the system's optimization end to end?
[00:18:28] It starts with the data center design itself, right?
[00:18:31] The core compute, storage, network, all the accelerators that go into accelerating each of those components.
[00:18:38] How do you even think about the DC to DC connectivity and the networking, as well as the offload to something like the local compute, right?
[00:18:48] That's the sort of systems challenge.
[00:18:51] But before we even get into all of the systems and the technology and the innovation, perhaps the most important design criteria for us is how do we earn the permission from the communities in which we're building these data centers?
[00:19:06] And that's where these principles ground us and focus us.
[00:19:12] How do we ensure that the DCs do not increase the electricity prices?
[00:19:17] Making sure that we are replenishing all our water use.
[00:19:22] Creating jobs in the local communities for the local residents.
[00:19:26] Adding to the tax base.
[00:19:28] Making sure we are strengthening the communities by investing in local training and the nonprofits in the area.
[00:19:35] Only when we live up to these principles, do the hard work around it, is when we earn the permission to go ahead and innovate and build.
[00:19:43] And we've been doing a lot of data center build out.
[00:19:47] Today, Azure spans more than 500 data centers in 80 regions.
[00:19:53] It's the most expansive, you know, we have the most expansive hyperscaler footprint out there.
[00:19:58] And we have added more data center capacity in the last 18 months than the first decade of Azure, just to put that in perspective.
[00:20:08] But more importantly, what we are building is also very different.
[00:20:13] In fact, the first 15 years or so, you know, we built out the commercial cloud infrastructure for a set of heterogeneous, you know, workloads that spanned the enterprise.
[00:20:25] But now when you look at what we are building, right, when you think about all the gigawatts that were going to come online, really, they have three dominant workloads.
[00:20:35] There is training, there is inference, and then there is the agent runtime, right?
[00:20:40] These are three dominant workloads.
[00:20:42] And in fact, when you look at Fairwater, it was sort of our first AI super factory.
[00:20:48] You know, it spanned two regions, Georgia and Wisconsin.
[00:20:53] The entire system was designed from the ground up for AI, right?
[00:20:57] We worked, in fact, very closely with even NVIDIA on this.
[00:21:00] And it's a two-story architecture that lets us essentially place racks, obviously, in three dimensions and pack the maximum number of GPUs densely with network access.
[00:21:13] That means you've really got fantastic higher performance networking, lower latency, and more effective bandwidth across the entire cluster.
[00:21:22] And we're rethinking even the power delivery, right?
[00:21:26] So we started, okay, how do we deliver hundreds of kilowatts per row while minimizing all the loss, right?
[00:21:33] Which is the conversion loss that happens from the grid to the silicon.
[00:21:37] So we basically sort of even took a new approach to it.
[00:21:41] And all of this also changes with the cooling system, right?
[00:21:45] And water.
[00:21:46] So in fact, the cooling loop is filled once and the data center can operate effectively with zero water consumption.
[00:21:54] In fact, the daily water usage over the course of an entire year is roughly equivalent to what a single restaurant would use, right?
[00:22:03] And when it comes to the systems and the silicon, again, we have a lot of choice.
[00:22:15] We have first-party silicon.
[00:22:16] We have partner systems.
[00:22:18] We were the very first cloud to bring up, in fact, NVIDIA's Vera Rubin system for validation.
[00:22:25] Very exciting to see that.
[00:22:27] We're working closely with AMD.
[00:22:29] We worked with them on MI300.
[00:22:31] Now we're working with them on their next-generation AMD GPUs.
[00:22:36] Maya 200 is continuing to scale.
[00:22:39] In fact, it's live in Iowa and Arizona.
[00:22:42] We'll deploy it internationally later this year.
[00:22:44] It delivers 30% improved tokens per dollar compared to sort of what's the leading GPU today.
[00:22:51] And we have validated it with 5.5, GPT 5.5.
[00:22:55] And we are going to use that to power Microsoft 365 co-pilot.
[00:22:59] And so when it comes to running these agents, the interesting thing now is it's no longer just about having an AI accelerator or a GPU.
[00:23:11] The CPU is really critical.
[00:23:14] In fact, the ratios may be even coming to one is to one.
[00:23:18] And that's why we are innovating with Cobalt.
[00:23:21] We're announcing the preview of Cobalt 200 VMs, our next-generation ARM-based CPU designed for both cloud-native and agent workloads today.
[00:23:32] So it's exciting to see Cobalt make progress as well.
[00:23:40] You know, one thing that we've been trying to make sure is that it's being optimized for these new workloads I talked about, right?
[00:23:46] These new agent workloads.
[00:23:47] So, in fact, Cobalt delivers 50-plus percent better performance than Cobalt 100 on cloud-native.
[00:23:53] But we started benchmarking them using the GitHub co-pilot traces, these agentic traces, to see because the call patterns are so different.
[00:24:01] And we're now seeing 33% lower latency for the agent calls, 14% faster speed, 23% higher throughput.
[00:24:10] So this is about the core design of both the AI accelerator and the CPU for the agent.
[00:24:17] And, of course, when you talk about AI workloads, you need scale, you need reliability, and that's why the network becomes super critical.
[00:24:26] And, of course, we have been able to focus on it.
[00:24:27] We have innovated with the MRC architecture and rebuilt how traffic across Azure moves to support AI workloads, which are fundamentally synchronous data-parallel workloads, right?
[00:24:40] And so, therefore, they span tens and thousands of GPUs.
[00:24:44] And so you need to have them coherent.
[00:24:47] That's what the next frontier is for us to make sure we're able to keep scaling the network architecture.
[00:24:55] And, of course, it's not about just inside the data center.
[00:24:58] It's also about connecting across the data centers.
[00:25:01] And every Fairwater data center, for example, is connected through our continent-spanning AI-WAN forming this truly fungible compute fabric.
[00:25:14] And, you know, all this innovation is exciting, but when you think about innovation, designing for AI and, I would say, deep understanding of systems,
[00:25:28] you know, there's no better company than NVIDIA and there's no better person than Jensen to talk about it.
[00:25:36] So I wanted to invite, live from Taipei, Jensen Wang, CEO and founder of NVIDIA.
[00:25:43] Thank you so much for being at Build again.
[00:25:53] I know it's late for you in Taipei.
[00:25:56] I really appreciate you staying up.
[00:25:58] You know, I've been looking at social and, you know, and everything people have been talking about since your keynote over the weekend.
[00:26:09] And suddenly, you know, this concept of unmetered intelligence right at the edge is so hot again.
[00:26:18] So maybe you want to talk a little bit.
[00:26:20] You've thought about this, talked about this.
[00:26:22] And now, of course, with RTX Spark really delivered, I think, what's a breakthrough system for AI to be much more ubiquitous.
[00:26:31] But maybe, Jensen, you can just share a little bit your vision around where you see this going.
[00:26:37] Well, this all started about three years ago between a conversation between you and I.
[00:26:41] And we were talking about how we could build a new class of PCs that's incredible for designers and creators.
[00:26:50] And it would be incredible for artificial intelligence.
[00:26:53] And it would be one of these systems that has the processing capability, but also the software stack that's integrated into the world's design packages and creator packages.
[00:27:04] And, of course, all the things that we're doing with AI.
[00:27:07] And here we are.
[00:27:09] Three years later, we built an incredible new chip.
[00:27:13] And this system is supported by all of this new software that you created for Windows.
[00:27:20] And we now have the ability to have essentially an autonomous agent running on the PC.
[00:27:27] Now, when you take a step back and you think about what does that mean?
[00:27:30] For the 40 years or some 30 years we've been working together, we went from inventing DirectX together to creating now this incredible computer that has autonomous systems running.
[00:27:47] The PC evolved from being an incredible tool to now being a tool that's used autonomously by an AI assistant.
[00:27:56] And so the idea that I could be traveling and I'm on the phone and I could text my PC and ask my PC to get some coding done or some idea that I have and it would fire up the tools on the PC.
[00:28:12] And it would make the modifications or the changes or the design that I told it to do and it would iterate with me while I'm away from the PC.
[00:28:21] My PC became an assistant while I'm sitting there, of course, this PC would be my great assistant as well.
[00:28:27] And so this idea that the PC evolved from a personal computer to a personal AI, it's just really exciting.
[00:28:37] And to see it come to life, Satya, to see it come to life and actually doing that, you know, so I'm super excited about it.
[00:28:44] Spark, as you mentioned earlier, has all this incredible capabilities, a petaflop of AI performance.
[00:28:50] It has a petaflop of NVFP4, this numerical format that our two companies worked on together that allows us to take advantage of this 128 gigabytes of memory and fit maybe a couple of hundred billion parameter model.
[00:29:07] A couple of hundred billion parameter model is state of the art.
[00:29:10] And so I think the days of having a really smart assistant running autonomously on the PC is here.
[00:29:16] Yeah, no, it's so awesome.
[00:29:17] And in fact, I'm also excited about Windows coming to the GB300.
[00:29:21] And so that's another thing that it's kind of like data center right on your desktop.
[00:29:26] And it's so exciting.
[00:29:27] But talking about that data center side, obviously, you know, this entire thing got started when we built the first supercomputer together to train the GPT models.
[00:29:40] And we have come a long way.
[00:29:41] In fact, even I was talking about the Fairwater design.
[00:29:44] It is custom built essentially for the Grace Blackwell era to be able to max the data center design with the system design you had.
[00:29:54] And now, of course, we're validating Veera Rubin.
[00:29:56] We're very excited about it.
[00:29:57] Maybe you want to share a little bit about sort of what happens even on the cloud side with how you're pushing on the systems innovation.
[00:30:05] Well, our journey is incredible.
[00:30:09] We built the first AI supercomputer together.
[00:30:12] That was based on Ampere.
[00:30:13] Of course, Hopper was an incredible success.
[00:30:16] The first two generations were focused on pre-training.
[00:30:19] Grace Blackwell came along and all of the focus moved to post-training, reinforcement learning, which allowed us to have reasoning models.
[00:30:30] And these reasoning models, based on mixture of experts, were incredibly intelligent, energy efficient.
[00:30:36] But it requires giant systems.
[00:30:39] And so we created NVLink 72, and the entire rack became one computer.
[00:30:45] We had evolved from one node to now one rack.
[00:30:49] Well, Microsoft deployed the largest number of Grace Blackwells in the world today.
[00:30:56] The fastest and the largest number of Grace Blackwells in the world.
[00:31:00] Fairwater is just a magnificent system to look at.
[00:31:05] It's just a miracle of engineering.
[00:31:07] It's just an incredible feat.
[00:31:09] It's completely liquid cooled.
[00:31:11] You mentioned something earlier that I'm very proud of as well, that it's closed looped, basically uses almost no water.
[00:31:18] And it's incredibly environmentally friendly.
[00:31:21] It's energy efficient.
[00:31:22] We're able to increase the token generation rate and reduce the cost of token generation by an order of magnitude, some 30 times over Hopper.
[00:31:33] So that was a huge achievement.
[00:31:34] Well, Vera Rubin was created for a world where these AIs are now agentic.
[00:31:41] And so where, as Hopper was created for pre-training, Grace Blackwell for training, post-training, and also inference, Vera Rubin is designed to run agents.
[00:31:52] It's agents, as you know.
[00:31:54] This computing pattern is exactly the same computing pattern we're going to run on the RTX Spark.
[00:32:00] It's exactly the same agentic system, except, of course, it's going to be much, much larger.
[00:32:05] We're going to process an enormous number of them simultaneously.
[00:32:09] Many of them are going to be from different customers and different partners.
[00:32:13] And so the entire path, the entire coding path from storage, which is the long-term memory, the working memory, is encrypted.
[00:32:23] The data is encrypted in transit.
[00:32:27] The data is also encrypted in use.
[00:32:29] And so we're going to really innovate in the area of confidential computing.
[00:32:34] And so this entire disaggregated, distributed computing system, you mentioned CPUs.
[00:32:39] Vera is a revolutionary CPU designed for agents.
[00:32:43] You know, the past CPUs were designed for humans.
[00:32:45] And, you know, we're just more patient than agents are.
[00:32:49] And agents want low latency, just as you've been working on as well.
[00:32:54] Vera was designed for extremely low latency.
[00:32:56] And so Vera Rubin is just completely revolutionary.
[00:32:59] I can't wait to show it to everybody.
[00:33:02] You've already stood it up.
[00:33:03] Yep.
[00:33:04] Our two teams, our two teams have been working very closely.
[00:33:07] You know, almost long before the chips taped out, long before the systems were brought up.
[00:33:14] Our two teams were already completely aligned.
[00:33:16] And so the design, the data centers were created for Vera Rubin.
[00:33:20] Vera Rubin is designed and integrated into your complete stack, into your networking, into your security.
[00:33:25] And so the moment that our systems were rolling off the lines, they were being stood up at Microsoft.
[00:33:31] So I'm incredibly excited about the collaboration.
[00:33:34] Yeah, I know this speed of light execution between the teams is fantastic to see.
[00:33:38] And, of course, all this is to power the ecosystem around us, right?
[00:33:42] I mean, you and I, having grown up with the PC, the server, and now with AI, have always thought about, ultimately, it's about creating the opportunity for every developer, every organization to build on the work that we do and the platforms we create.
[00:33:56] And speaking of that, there's a lot of software that NVIDIA builds that's all also coming.
[00:34:02] For example, we're going to have your models in Foundry, your tooling in Foundry.
[00:34:06] We're going to have, in fact, your software even help us with accelerating our workloads when it comes to even the data warehouse.
[00:34:14] We're going to obviously have stuff in Windows.
[00:34:17] Just talk a little bit about that broader vision of what does it mean for us an opportunity, right?
[00:34:23] Because everybody talks about this one model or one piece of tech, but it's about the broadest, biggest opportunity for people to create value.
[00:34:32] Maybe you want to share a little bit about that.
[00:34:35] Well, we've been preparing for this moment.
[00:34:38] What happened in the last several months, we've been working for a decade and a half together, getting ready for really what happened in the last several months.
[00:34:47] All of a sudden, because of agentic systems, the convergence of these really great models, AI is now useful.
[00:34:54] If you just look at GitHub, the commits into GitHub has gone completely parabolic.
[00:35:03] In the last several months, the number of commits increased by a factor of three.
[00:35:08] It's clear that agentic systems are useful, that it's doing productive work, and also tokens are now profitable as a result.
[00:35:18] And so the amount of demand for compute between the usage of the AI and the computation that's necessary for agents, the compute demand has really gone through the roof.
[00:35:29] Well, one of the things that we've been doing together is making sure that all of the tools that the agents are going to use are fully accelerated.
[00:35:38] Fabric, for example, is now fully accelerated.
[00:35:41] We're accelerating data processing, SQL, Spark, semantic-based, vector-based, graph-based.
[00:35:51] We're going to make sure that all of the tools that are available on Azure are going to be fully GPU accelerated because the agents are going to be impatient.
[00:36:00] The faster we can get the answers back to the agents, the faster they can iterate, the faster we can generate tokens, which are ultimately what the developers, both of our customers would like to do,
[00:36:09] is generate a lot of tokens that are really profitable, that are highly intelligent.
[00:36:13] Thank you so much, Jensen, for the partnership and the leadership and the innovation that you bring to this entire ecosystem,
[00:36:22] and really thrilled to be working closely with you and the team and bring all this to the developers here and beyond,
[00:36:28] and look forward to seeing what the next sort of few months and the next year bring in terms of the innovation that gets built on top of the platform.
[00:36:37] So thank you again for joining this late in the night from Taipei.
[00:36:41] Thank you so much, Satya, for your partnership and friendship.
[00:36:44] Thank you.
[00:36:45] Thank you.
[00:36:50] So, so far, we've talked about the edge and the cloud.
[00:36:55] The current form factors, right?
[00:36:57] I mean, when I saw that Jensen picture from the weekend where he had all the desktops, I felt like, man, I'm back in the '90s, right?
[00:37:03] Because it was so cool to see the lineup of all the machines that I loved and I grew up with back yet again,
[00:37:12] with new functionality, right?
[00:37:14] The same form factor, but unbelievable new functionality because of the onboard AI capability, right?
[00:37:21] So that's sort of what we've seen with the laptop, the desktop, and of course, with the cloud.
[00:37:27] But it also, you know, sets up that next question.
[00:37:32] If you have that capability, which is new function and you can put it into existing form factors,
[00:37:39] can you even purpose build new form factors for the new function?
[00:37:46] Can you build a new platform even for the agent era?
[00:37:51] And that is the motivation behind Project Solara, which we're introducing today.
[00:37:57] And to talk about this, I wanted to invite Stevie on stage.
[00:38:00] But first, let's roll the video.
[00:38:02] First, there's you.
[00:38:05] Then, everything in front of you.
[00:38:11] We get it.
[00:38:12] The noise.
[00:38:14] The weight of it all.
[00:38:17] And so we've been building something.
[00:38:21] A new chip-to-cloud platform for an agent-first world and for new agent-first devices.
[00:38:28] You come close.
[00:38:31] It's ready when you are.
[00:38:34] You speak.
[00:38:35] It understands you.
[00:38:37] What matters comes forward.
[00:38:39] Everything else falls away.
[00:38:43] And when you want it to keep going, it keeps going.
[00:38:48] Your call.
[00:38:50] Sometimes it sits with you.
[00:38:53] Sometimes it goes with you.
[00:38:56] Sometimes it sees what you want it to see.
[00:38:59] It doesn't decide for you.
[00:39:02] It just lights the path.
[00:39:04] It tries to clear the way when you need it to.
[00:39:08] That's the idea.
[00:39:10] A whole ecosystem built to clear the way.
[00:39:14] So all that's left is where you need to go.
[00:39:17] Wherever you are.
[00:39:19] Whenever it matters.
[00:39:25] I am so excited to be here.
[00:39:29] It really is great to be back on this stage.
[00:39:32] Now, before I talk about those awesome new devices you just saw.
[00:39:36] Let me start with the why.
[00:39:39] Back at Build 2023, I talked about the outside AI application structure.
[00:39:45] Where AI moves from operating within the application frame to operating globally, working across multiple apps and services to connect, coordinate, and maintain context across entire workflows, devices, and time scales.
[00:40:03] What if there were an ecosystem of devices specifically designed for that new type of application structure, for those types of agents, for that transformational interaction technology?
[00:40:17] That is the impetus behind Project Solera.
[00:40:21] But with so many possible forms, which one do you pick?
[00:40:24] What is the next device?
[00:40:26] You see, the big aha for us is that it's not about choosing one specific form factor.
[00:40:33] It is about creating a system that extends your agent across a constellation of devices.
[00:40:39] The next computer is not one device.
[00:40:42] It is all these devices working together as one system, with agents showing up closer to where and when you need them.
[00:40:51] To realize this vision, two challenges immediately show up.
[00:40:58] First, many specialized form factors already exist, but often rely on custom one-off apps and fragmented stacks that are difficult and expensive to build, deploy, and maintain.
[00:41:10] And second, across every industry, people and organizations are already building their own agents, deeply specialized and instrumented for their work.
[00:41:20] But the impact of those agents is constrained by how and where they can exist.
[00:41:26] Project Solera addresses both by giving organizations a way to extend their agents onto new, purpose-built, easy-to-manage form factors designed to reach the nooks and crannies
[00:41:38] where conventional computers either do not exist or are not optimal.
[00:41:43] It's a turnkey solution for building unique agent-first devices enabled by three pillars.
[00:41:51] First, it's enterprise-ready, enabled by the AOSP-based Microsoft device ecosystem platform.
[00:41:58] Second, it has an agent-driven interaction model with just-in-time UI that adapts to the form factor.
[00:42:07] Third, it has extensibility so you can bring your own agents.
[00:42:12] And tying it all together is Azure, unifying the system across cloud and device.
[00:42:17] Okay, that's enough of that.
[00:42:20] Now, let's talk about the devices.
[00:42:22] Today, we're previewing two very broad categories.
[00:42:26] The first is stationary and the second is portable.
[00:42:29] The first device is designed for your desk and it's built on MediaTek's silicon.
[00:42:34] With Hello for Business, just walking up to the device securely signs you in, giving you direct access to your agents, just like Nathan's about to show you here.
[00:42:45] For the information worker, this means frictionless, yet protected access to Microsoft 365 co-pilot grounded in WorkIQ.
[00:42:55] And with a simple glance, it surfaces what matters next in your workday, helping you think, plan, and even act by delegating tasks to your agents with a simple tap or just using your voice.
[00:43:07] Think of it as a dedicated, secure ambient device for work.
[00:43:11] It even supports experiences like handoff between devices, acting as a companion to your existing Windows PC.
[00:43:19] Or, it can even let you access your cloud PC through Windows 365 and a connective monitor.
[00:43:25] How cool is that?
[00:43:33] Now, the second device is portable.
[00:43:35] It's reimagining a wearable that millions of people use every day.
[00:43:40] The access badge.
[00:43:42] Built using Qualcomm's silicon for wearable, this digital badge is a lightweight form factor designed for agent information.
[00:43:48] It's designed for agent interactions on the go.
[00:43:51] And then it's adaptable across a variety of verticals and workflows.
[00:43:55] All right.
[00:43:56] I have here an early prototype of the badge.
[00:44:01] And so, thanks.
[00:44:02] And using my fingerprint, I tap to unlock the device.
[00:44:06] And I have access now to all my agents in a secured manner.
[00:44:10] And would you look at that?
[00:44:11] I already have a task.
[00:44:12] And it says, gather content for your social media posts for today.
[00:44:16] So, why not just do it right now, right?
[00:44:19] So, I'm going to hit record.
[00:44:21] And then now the device camera is recording.
[00:44:24] I'm going to pan across.
[00:44:25] I hope you don't mind.
[00:44:26] I'm going to take your shots.
[00:44:27] Yes, thank you.
[00:44:28] Co-pilot.
[00:44:29] Find some good shots from this.
[00:44:31] Clean them up.
[00:44:32] And then send them to me.
[00:44:33] So, for me and my team to review.
[00:44:36] All right.
[00:44:37] And then there you have it.
[00:44:39] Now my agent's off, running through multiple tasks.
[00:44:42] I actually clean this up and send them to me and the team.
[00:44:44] That's pretty cool.
[00:44:45] Thanks.
[00:44:50] Okay, I know it's a simple demo, but it's all agent-driven.
[00:44:54] And there are so many verticals, so many opportunities.
[00:44:57] I mean, for example, imagine in healthcare.
[00:44:59] From the moment you pick up the device, the right agent shapes the experience around the role and the workflow.
[00:45:06] Helping with check-ins, patient records, and critical insights.
[00:45:10] All through enterprise-grade secure access.
[00:45:14] And with the built-in microphones, the nurse can start a hands-free voice-based documentation, including diarization and annotation.
[00:45:22] And the side-facing camera can be used to verify and document patient vitals or even scan in medications and help verify workflows.
[00:45:30] You know, these are just a few examples of how the small purpose-built wearable can bring intelligence directly into the flow of patient care,
[00:45:39] helping nurses access and gather and even act on information while staying present with the patient.
[00:45:47] And while both the stationary and portable concept devices represent a specific expression of agent-first devices,
[00:45:54] their core reference hardware and software are designed to be highly flexible.
[00:45:59] I mean, just with a few changes, loading a different agent, adjusting the shape, the screen size, the sensors, or even input methods,
[00:46:06] the same foundation, the same software can be adapted for many verticals and workflows,
[00:46:12] such as retail, industrial, hospitality, financial service, legal, and so forth.
[00:46:16] I mean, that is the power of the platform. That's flexibility.
[00:46:21] So whatever your scenario, there are thousands of untapped opportunities to bring agent workflows into places where computing has not naturally fit before.
[00:46:30] And while this is an early look, we're really excited that AccuWeather, Best Buy, CBS Health, Levi's, Target, and others are working towards exploring how specialized agents and devices can improve their workflows.
[00:46:45] This is the broader opportunity for the ecosystem. Agents moving outside the app and taking shape in devices designed for a specific scenario, a specific customer, and a specific place.
[00:47:00] And for all of you, this is the moment to imagine where your agents should live, what form they should take, and what new work they can unlock.
[00:47:15] Last week, Saito sat down with Cristiano Eman from Qualcomm to discuss this future. Let's roll the video.
[00:47:22] Cristiano, it's so wonderful having you here at Device Lab and talk about these new reference designs.
[00:47:29] Very happy to be here. I have been to this Device Lab a lot recently.
[00:47:34] You know, one of the things that you and I have chatted for a while is how there's a real platform shift.
[00:47:40] We're moving from building operating systems, devices for apps to agents.
[00:47:47] You want to talk a little bit about how you see this?
[00:47:49] Yeah, absolutely.
[00:47:50] And every generation of technology transition within the shift, and I think this one seems to be a very big one.
[00:47:56] Agents really changes the whole nature of the device in itself.
[00:48:01] Starting with the fact that, you know, if the AI understand the world the way we understand it, it's going to be closer to our senses.
[00:48:09] It's going to be closer to our eyes, to our mouth, to our ears.
[00:48:12] It's going to be things that we wear, and it's changing the nature of the computing.
[00:48:17] I think you need a computing that is now geared towards real-time context and from silicon to cloud.
[00:48:24] I mean, this is sort of one of the reference designs we have, which I love.
[00:48:27] There's stuff happening at the edge here on this device.
[00:48:31] There's things on the cloud.
[00:48:32] You want to talk a little bit about the core systems and silicon implications for such a new ecosystem?
[00:48:39] You need a very power-efficient CPU.
[00:48:42] The whole silicon is designed for you to have a cloud-native experience.
[00:48:46] And then you have a lot of sensors.
[00:48:48] Right.
[00:48:49] A lot of sensors for context.
[00:48:50] It's a much more personalized and bespoke experience than an app in itself.
[00:48:55] And I think that's changing the nature of devices.
[00:48:58] And even the definition of a wearable platform is changing in itself.
[00:49:03] And you started to see those incredible new form factors.
[00:49:06] That computing platform, one of the things that you and I have talked a lot about is,
[00:49:11] how do you build it such that there is an open ecosystem?
[00:49:15] Right?
[00:49:16] Because it's not about one agent.
[00:49:17] It's about any agent.
[00:49:19] This is what makes it very, very interesting and exciting.
[00:49:23] Because the smartphone today is at the center of your digital life,
[00:49:29] the job of those devices today is to be extending the functionality of the smartphone.
[00:49:35] And because of that, you actually saw that many of those platforms became vertical.
[00:49:40] It was a natural thing to have a vertical platform from the same company because the phone was at the center.
[00:49:46] That changes dramatically.
[00:49:48] Right.
[00:49:49] When you think about agents.
[00:49:51] And agents becomes the center of your digital experience.
[00:49:56] And I think the industry is going to be looking for an open horizontal platform that actually enable the agents to be interacting with the best possible device for different applications.
[00:50:09] And I think that's what's exciting.
[00:50:11] And we want to make it possible for anybody who's thinking of an agentic system to not just be bound to current devices,
[00:50:18] but to imagine that there can be many, many devices that carry that intelligence in different contexts.
[00:50:24] And so that's the open ecosystem we want to build together.
[00:50:27] And again, it's so great to be partnered with you to get this started.
[00:50:31] That's awesome.
[00:50:32] We're very proud of this partnership and it's just the beginning.
[00:50:34] Thank you very much, Stevie and Cristiano.
[00:50:43] We are very excited about Project Solara.
[00:50:45] It's sort of a new platform, but perhaps more importantly, it's a set of new platform rules that don't, you know, in some sense hem in what you can imagine.
[00:50:57] The type of form factors where your agents live, I think always whenever these new platforms have come, you get to rewrite even the rules of how new platforms operate.
[00:51:11] And that's what we are trying to get done with Project Solara so that you as developers and enterprises have the flexibility to imagine the form factors that you want and have your agents be ubiquitous.
[00:51:24] Now let's go up the stack to the next layer.
[00:51:29] We are building a new intelligence layer, bringing together the models, context, as well as the tools.
[00:51:38] It, of course, starts with model choice.
[00:51:41] Every customer, every developer is going to choose the right model for the right task and their eval mix, latency budget, even COGS budgets.
[00:51:53] And Foundry today has over 11,000 models.
[00:51:58] It's the largest model catalog out there from OpenAI to Anthropic and, of course, even MAI models.
[00:52:06] And just last week, we brought the OpenAI real-time voice models along with Claude Opus 4.8 to Foundry.
[00:52:14] So we're continuing to bring all these frontier models.
[00:52:19] The consideration, though, about the models is becoming now increasingly key.
[00:52:24] In fact, from the last developer conference to now, when you're building any agentic system, having this context really shaped right is becoming super important.
[00:52:36] And, in fact, it starts right at the data tier, right?
[00:52:39] The data state to date has been built for applications that supported these user-facing applications, right?
[00:52:46] Now you have to change and build them for agents, which, again, have very different call patterns even to the data tier.
[00:52:53] Agents are continuously storing, retrieving, reasoning, acting, and learning, right?
[00:53:02] That's sort of what's happening in a continuous loop.
[00:53:05] And you see that today.
[00:53:06] Agents are using things like Cosmos DB for their memory.
[00:53:10] In fact, ChatGPT does.
[00:53:11] Azure Search is used for retrieval of indices and embeddings.
[00:53:17] Fabric IQ for the semantic models and the ontologies instead of the business logic tier effectively for agents.
[00:53:25] And also, you have Fabric real-time intelligence for all the observability traces, which now brings me to a very exciting new service, Horizon DB, which is our fully managed PostgreSQL service on Azure.
[00:53:42] Really thrilled to have this.
[00:53:47] I mean, one of the things we wanted to make sure is we built, ground up, a PostgreSQL managed service which was for high availability,
[00:53:54] scale out.
[00:53:57] It's zoned redundant with automated failover, 128 terabytes of storage per cluster, 15 read replicas.
[00:54:06] I mean, the read-heavy workloads you can scale with this managed service.
[00:54:12] And in our internal testing, we're seeing something like, you know, Horizon is delivering 3x throughput compared to any self-managed PostgreSQL setup.
[00:54:23] So, I think it's more critical than you think about the scale you need.
[00:54:26] The other data workload we're also changing pretty dramatically is the data warehouse.
[00:54:32] In a world where agents are constantly querying data, the data warehouse effectively becomes a pretty mission critical, right?
[00:54:41] I mean, it's one thing to have it be mission critical for users, but when agents need the analysis done on the fly, you know, bringing GPU acceleration to fabric is super key.
[00:54:54] And we are seeing 7x performance gains.
[00:54:58] So, it's really thrilling to see that AI acceleration.
[00:55:01] So, if you sort of have the data tier, the layer about this is the IQ layer that we are building that brings together essentially the model capabilities to deal-- along with the data, right?
[00:55:21] So, you kind of mix the data and the model capabilities so that you can deliver that right context to unlock intelligence, right?
[00:55:30] That's where-- in fact, if you want-- when people talk about token efficiency, this is perhaps the most important consideration, right?
[00:55:36] If you structure the context right and feed the models, then you will, by definition, going to be so much more token efficient.
[00:55:45] The first domain is the web.
[00:55:49] Web grounding is so important.
[00:55:52] You need that fresh, high quality, and fast web data.
[00:55:59] And that's why we are really, really excited to announce today WebIQ.
[00:56:09] And WebIQ is built on our global infrastructure that's already serving over a billion users, but fundamentally re-architected for the LLM and the agentic workflow.
[00:56:22] It's model agnostic, it's MCP native, plugs right into any agent runtime.
[00:56:29] It has web, it has news, images, video.
[00:56:32] So, agents can ground responses in fresh, verifiable content.
[00:56:38] And WebIQ leads across all of the three key criteria, right?
[00:56:44] It's best in class in quality, it's best in class in speed, as well as in cost.
[00:56:49] So, we're very, very thrilled about WebIQ being in the developers' hands as you build out agentic systems.
[00:56:55] Of course, we're not stopping there.
[00:56:57] Beyond the web, every developer wants to be able to ground their agents on what's the most valuable data across the enterprise.
[00:57:05] And so, we are bringing together Foundry, Fabric, and Microsoft 365 as this unified IQ layer, right?
[00:57:15] It's a continuously updated understanding of your organization to show you all of this rich IQ in action, building real life agents.
[00:57:27] Here is Elijah.
[00:57:28] Elijah, take it away.
[00:57:29] Elijah, take it away.
[00:57:34] Agents are only as good as the context we give them.
[00:57:38] Microsoft IQ unifies enterprise intelligence for every organization.
[00:57:44] I'm here at a Power Utilities Control Center, and we'll start by running a long-running agent.
[00:57:50] This agent is going to help us assess the current grid operations incident and produce a brief for us so we can respond accordingly.
[00:58:01] Now, I'm going to go ahead and kick that off.
[00:58:03] And while that runs, let me show you how we got here with context from Microsoft IQ.
[00:58:08] We built our agent in Microsoft Foundry.
[00:58:11] It's connected to various tools, and it's also wired to a Foundry IQ knowledge base.
[00:58:17] A single grounded source that packages our documents, operational data, and people into context the agent can reason over.
[00:58:26] After building the agent in Foundry, we published it to Microsoft 365 for the whole team.
[00:58:32] Let's go see it in action.
[00:58:35] Now, here in M365, I will start with a question about current events that an ungrounded LLM wouldn't be able to answer.
[00:58:42] I'm going to ask about current electricity prices in SF.
[00:58:47] For this, our agent pulls in the first IQ in our toolbox, WebIQ, search built for the AI era.
[00:58:54] WebIQ delivers industry-leading quality, velocity, and efficiency.
[00:59:00] WebIQ constantly indexes fresh, official sources from across the web.
[00:59:05] And additionally, WebIQ does a great job with semantic documents.
[00:59:09] And here we can see WebIQ gave us our answer grounded in reality.
[00:59:15] Now, not only can we deploy our agent here in M365, but we can also embed it in our custom apps.
[00:59:21] Let's head back to the control center.
[00:59:23] Let's see how we handled a previous incident.
[00:59:26] So after using WebIQ to gather external info, we ask for details about our potentially at-risk substations.
[00:59:34] The real power of Microsoft IQ comes when we combine that external knowledge with our own internal enterprise context.
[00:59:42] For this information, our agent pulls in the next layer of Microsoft IQ, Fabric IQ.
[00:59:49] Here is Brightline's grid represented as a Fabric ontology, an operational model of the live grid.
[00:59:57] And critically, we didn't build this from scratch.
[01:00:00] Fabric takes the Power BI semantic models used by millions of customers today and lets teams extend them into rich ontologies that help run the business.
[01:00:10] This model, yes, this model is coupled with live telemetry.
[01:00:15] So it reflects the real operational state of the grid minute by minute.
[01:00:20] This is what Microsoft IQ means by enterprise intelligence.
[01:00:24] Not data scattered across disparate systems, but a single living model of the business that an agent can reason over.
[01:00:35] Now, turning back to our agent, we can see that it gave us a table of our most exposed substations.
[01:00:43] That's the agent querying the ontology you just saw.
[01:00:46] Now, the agent has access to both the outside world and our grid.
[01:00:50] The last piece is the one that turns a situation into a response.
[01:00:54] Our policies and our people.
[01:00:57] By asking what are the steps to respond to a substation trip, we activate the final layer of Microsoft IQ, Work IQ.
[01:01:06] This is Brightline's response procedure in SharePoint.
[01:01:10] It's the playbook the team actually reaches for when something goes wrong.
[01:01:14] And the important thing is, the agent isn't working from a stale upload or copied snapshot.
[01:01:20] It's answering from the same source the team maintains day to day.
[01:01:25] When the procedure changes, the answer changes with it.
[01:01:28] No re-uploads, no prompt rebuilds, and no stale versions.
[01:01:32] And critically, this is your knowledge.
[01:01:35] The assets you create stay with you no matter what model or agent is reasoning over them.
[01:01:41] Now, if we return to our agent, we can see the proper way to respond to this issue.
[01:01:47] Not a generic recommendation, but our playbook applied to this incident.
[01:01:52] Now, we just saw one situation, three questions, and one connected answer.
[01:01:57] Now, let's go check back in on our long-running agent.
[01:02:00] And, oh, let's check the backup really quick.
[01:02:04] Moment of truth.
[01:02:06] Moment of truth.
[01:02:09] And, boom.
[01:02:13] Our task finished.
[01:02:18] Here we can see every step the agent took.
[01:02:22] First, beginning with WebIQ, connecting it to the outside world.
[01:02:26] Second, FabricIQ through Foundry, anchoring it in the real state of our operations.
[01:02:31] And, third, WorkIQ, grounding it in our people and procedures.
[01:02:36] I triggered this manually, but with Foundry routines, this can run on a schedule,
[01:02:40] turning a one-off response into continuous proactive execution.
[01:02:44] And, if we look closely, we can see it even use the power of WorkIQ to alert me directly about the incident.
[01:02:52] And, if we go ahead and check Teams, it sent me an incident brief notifying me of the situation.
[01:03:04] That's the power of Microsoft IQ.
[01:03:06] When a crisis hits, the team doesn't chase answers across a dozen systems.
[01:03:10] They ask a question and get a response grounded in the world, their operations, and their people,
[01:03:15] all in one place they can trust.
[01:03:17] Back to you, Satya.
[01:03:23] All right.
[01:03:24] Thank you, Elijah.
[01:03:26] And now, you know, let's sort of move up from this context model layer
[01:03:32] to deploying these agents and thinking about the runtime.
[01:03:36] And, you know, when you're building a first-class agentic system, you need a first-class agent runtime
[01:03:45] and a platform that we are going to ship both with Windows as well as part of Foundry in Azure.
[01:03:51] We want Windows to be a fantastic place to run and scale agents.
[01:03:57] And agents, effectively, are a new execution environment, right?
[01:04:02] It's a new paradigm even.
[01:04:03] They reason continuously.
[01:04:05] They generate and run code dynamically.
[01:04:08] They take actions across files and devices as well as across the network.
[01:04:15] Obviously, there's a lot of power in it, right?
[01:04:19] The fact that it can generate code and act on it on a long-running agent that's autonomous.
[01:04:24] But, obviously, it creates new risk.
[01:04:27] And that's why, today, we're introducing Microsoft Execution Containers, or MXC.
[01:04:33] MXC is a new policy layer that lets Windows apply isolation and containment using AI-native,
[01:04:43] or rather, sorry, OS-native primitives, right?
[01:04:46] You need to bake this into the operating system so that the containment is enforced by policy.
[01:04:52] You can have process-level isolation for lightweight agent actions.
[01:04:58] You can have session-level isolation for user separation.
[01:05:02] So, Windows and Linux virtual machines, in fact, are also great, including WSL, for much stronger boundaries.
[01:05:09] And, in fact, if you want full isolation and containment, Windows 365 for agents for maximum isolation in a separate managed environment effectively.
[01:05:19] You can pick the right containment option for the workload, and Windows will enforce it via MXC.
[01:05:26] And so, I think this becomes pretty critical as you think about deploying agents at scale on your Windows desktop.
[01:05:34] We want to ensure containment is enforced, of course, regardless of who builds the agent, right?
[01:05:40] So, this is why you want to bake it into the operating system.
[01:05:43] To that end, we are working with many partners to ensure that containment, we are building supports real developer workloads out there and also addresses the needs.
[01:05:53] In fact, NVIDIA is bringing OpenShell to Windows to securely execute autonomous AI agents.
[01:06:00] And today, we are really thrilled to announce that OpenClaw runs on Windows leveraging MXC.
[01:06:14] Yeah, we are very deeply engaged with the team to make OpenClaw run super well on Windows.
[01:06:20] And so, let me hand it over to my colleagues, Scott and Samantha, to show you OpenClaw on Windows.
[01:06:27] Scott, Samantha.
[01:06:30] Hey, friends.
[01:06:35] You know, OpenClaw came out in November of last year, and it took the world by storm.
[01:06:40] And for the last several months, I've been reusing my OpenClaw to stay on top of my health.
[01:06:44] My claw can help me manage my blood sugar, and it gives me even proactive notifications via heartbeat.
[01:06:51] I've got mine triaging my personal email, it's doing my GitHub issues and tracking packages, and it even buys me movie tickets.
[01:06:58] So, Samantha, what are you using your claw for?
[01:07:00] That's cool, Scott.
[01:07:01] I turned my OpenClaw agent into my triathlon coach.
[01:07:04] I'm nowhere near ready for my race in September, but Coach Claw developed a work back plan for me and is using my Strava data to notify me on how I'm progressing and to keep me accountable when I'm slacking.
[01:07:16] Now, we've both found our claws to be super useful, and that's why we're working closely with OpenClaw to make them successful and even more successful on Windows.
[01:07:25] We've been collaborating in the open on GitHub to bring you all an OpenClaw Windows companion app.
[01:07:31] It's going to help you set up your own claws or connect to existing ones, whether they're hosted in Windows or in WSL.
[01:07:37] And the Windows companion, we're going to sandbox the OpenClaw tool calls to keep you and your system safe.
[01:07:44] Yeah, you'll see the OpenClaw Windows companion app running right now in the background.
[01:07:48] Go ahead and right click on it, Scott.
[01:07:50] All right.
[01:07:51] That looks awesome.
[01:07:55] You'll notice immediately it looks like a native Windows app because it is.
[01:07:59] It's written in WinUI 3.
[01:08:00] It's got all kinds of information about my gateway, other machines that are participating in my claw, my sessions, and my usage.
[01:08:08] I've also got quick access to things like chat, Canvas, the main dashboard, and more.
[01:08:14] Let's jump into companion settings.
[01:08:16] Within the app, we've got full chat support with tool calling.
[01:08:21] And you'll notice down here in the corner, we've got lots of permissions options along with our sandbox configuration.
[01:08:28] Now, this sandbox is really interesting because this is using MXC, the Microsoft Execution Containers.
[01:08:34] And for this, we're going to be using process isolation.
[01:08:36] Now, newer versions of Windows are going to have even more containment options.
[01:08:40] So you're going to want to keep an eye out for more news with MXC in the future.
[01:08:43] Now, I can see that I've got one-click security option settings, but Samantha, talk to me about custom folders.
[01:08:50] Yeah.
[01:08:51] You've got full support about what files and folders you want OpenClaw to have access to.
[01:08:56] And really granular security features like clipboard access or talking to the internet itself.
[01:09:01] Now, I've given it read access to your desktop folder.
[01:09:07] OpenClaw already has a rich safety layer and that layer is only augmented more by appropriate containment that can be managed by me or policies applied by IT.
[01:09:17] Now, for the purposes of this demo, I'm going to do something really scary and ask OpenClaw to delete all the files on your desktop.
[01:09:25] That's cool because I have a nice clean desktop.
[01:09:28] I think you're keeping a secret.
[01:09:30] You hid all your icons from the audience while we are on stage.
[01:09:33] I think we need to show the audience who you really are.
[01:09:37] So disrespectful.
[01:09:38] I know where everything is.
[01:09:40] Just don't touch my stuff.
[01:09:43] I know where they are.
[01:09:45] You know, Sam, I need to make sure that you're clear that a messy desktop is an organized mind.
[01:09:49] I'm pretty sure that that's the that's the quote, right?
[01:09:53] So what we've done is we've asked OpenClaw to delete those files from the Windows node.
[01:09:57] And the only thing that is going to keep from happening is MXC because we've turned off all of the many layers that OpenClaw offers.
[01:10:03] But our IT, in this case, Samantha, has set it to read only.
[01:10:07] So it's trying to go and delete all of those files.
[01:10:09] We can actually see the different attempts where it's going and deleting and then checking the directory and then deleting again.
[01:10:15] Because it's very persistent. It wants these files gone and I want I want them to stay.
[01:10:20] Oh, no, no.
[01:10:23] The read only sandbox is there.
[01:10:25] 94 JPEGs are still on the desktop.
[01:10:27] Absolutely.
[01:10:28] My desktop icons are safe from Samantha's reign of terror.
[01:10:32] Boiled again.
[01:10:33] Oh, my goodness.
[01:10:35] So bad.
[01:10:36] Today we've seen security sandboxing when UI 3 and open source brought together all in this alpha release of a Windows companion app.
[01:10:44] We think this app is a great opportunity to showcase OpenClaw on Windows and it's only going to get better in the coming months.
[01:10:50] That is right.
[01:10:51] And by the way, I want to note that we're doing all this development work on this Calm Windows development machine with all the tools that I love like WSL, containers, and containment.
[01:11:03] And I've even got GitHub Copilot with multi-model support ready to go.
[01:11:07] Now, this is literally how the team and I have been working on the OpenClaw app on GitHub.
[01:11:13] Now, people might wonder how all of this came together.
[01:11:15] Turns out that over the holidays, I got a DM from this random guy on the internet and it took me a couple of days to get back to him.
[01:11:25] And then when I did, we just had this kind of cool idea that maybe he should come to Microsoft Build.
[01:11:32] Now, I think there's one more person that we all want to thank for bringing the next generation of agents to the world.
[01:11:37] Everybody put your hands together for the Claw father himself, Peter Steinberger.
[01:11:44] Yeah.
[01:11:46] Samantha.
[01:11:47] It's just showing off my secret DMs.
[01:11:50] I'm so excited to see OpenClaw native on Windows.
[01:11:56] You know, watching a Claw try to delete all your desktop file and just fail made me really happy.
[01:12:04] Because six months ago, that totally would have worked.
[01:12:09] You know, I built OpenClaw to have access to everything.
[01:12:13] You know, my files, my machines, my chats, always on and fully open source.
[01:12:18] That's what makes it so powerful.
[01:12:20] And that's what also makes companies a bit nervous.
[01:12:23] You know, what I kept hearing was, Peter, I love my claw.
[01:12:28] Can I use this at work?
[01:12:30] Can I use this at work?
[01:12:31] And that's what we spent the last few months on.
[01:12:34] With Microsoft, GitHub, OpenAI, NVIDIA, just to name a few.
[01:12:39] We added observability.
[01:12:41] We added auto mode for permissions.
[01:12:43] We changed how access works.
[01:12:45] It's not all or nothing anymore.
[01:12:47] You can pick which folder should be read only, which one should be write it or hidden.
[01:12:52] So here's the news.
[01:12:54] You can totally run OpenClaw inside your company now.
[01:13:02] And we even made the harness itself a plugin.
[01:13:06] You can bring your own.
[01:13:07] Copilot, Codex, whatever you already trust, and your rules come right with it.
[01:13:14] And then you put OpenClaw on top of it.
[01:13:17] You get persistent memory, heartbeats, and you get a claw right inside Slack or Teams.
[01:13:24] It's been really exciting to see OpenClaw grow into something much bigger.
[01:13:31] A global movement and a community.
[01:13:35] And I started the OpenClaw Foundation, a real nonprofit.
[01:13:39] So it stays open and neutral.
[01:13:42] Any model, any operating system.
[01:13:45] Because we are entering a new era of building these agents.
[01:13:50] More capability for the people who don't code.
[01:13:54] And more power for those who do.
[01:13:57] And we get to do this.
[01:13:58] We get to build it together in the open.
[01:14:01] So my task is simple.
[01:14:03] Come build with us.
[01:14:05] Thank you.
[01:14:08] Fantastic.
[01:14:10] Thank you very much to Peter.
[01:14:12] Thank you to Samantha.
[01:14:13] Thank you to the Companion App team for their hard work.
[01:14:16] Thank you to the OpenClaw community for giving everyone here a crustacean of their own.
[01:14:20] And as we walk out, I want to remind you that that's the first time you've ever seen OpenClaw
[01:14:23] running on a Surface Laptop Ultra.
[01:14:25] Goodbye.
[01:14:31] All right.
[01:14:32] Thank you so much, Samantha, Scott, and Peter.
[01:14:36] It's so wonderful to see OpenClaw come to Windows.
[01:14:41] And have all of that capability in terms of the security.
[01:14:47] And that comfort to be able to have these long running agents and unmetered intelligence come together.
[01:14:54] So now, let's move from the Windows side on the edge to the cloud with Foundry.
[01:15:00] And we are building Foundry into this full application platform for the agent era, right?
[01:15:08] I mean, every era of the platform shift, when we moved to the cloud, we had the cloud-native app stack,
[01:15:15] and now we have the agent-native app stack in Foundry.
[01:15:18] We are particularly excited about the Foundry hosted agent as a runtime for long running agents.
[01:15:27] Agents now have, so if you're building in Foundry hosted agents, you can have all the IQ layers.
[01:15:35] You have the tools.
[01:15:37] You have the durability and the memory and the state.
[01:15:41] You have your own sandbox.
[01:15:43] In fact, it's a super fast sandbox that you can spin up.
[01:15:48] You can generate the rubrics and the evals.
[01:15:53] You can, in fact, have all the safety and the guardrails around your agentic system.
[01:15:59] And, in fact, even one of the coolest things in Foundry is it's a continuously improving loop even, right?
[01:16:04] It's got that self-improvement loop built in, right?
[01:16:07] So you build an agent that's continuously getting better.
[01:16:11] And I'm really also excited today to announce a partnership with Fireworks AI,
[01:16:18] bringing, in fact, all of their open-weight models to Foundry.
[01:16:23] That means giving you as developers more choices as well as their great inference stack to build the next generation of these agentic applications with all the enterprise rails that Foundry has as well as the governance Foundry has.
[01:16:39] So really excited about that partnership.
[01:16:41] And so now that brings us to the tools.
[01:16:49] GitHub itself is at the heart of all this.
[01:16:52] You know, in fact, Jensen spoke to this.
[01:16:55] The GitHub is not just about the code repo.
[01:16:57] It's becoming the control plane for all the agents.
[01:17:01] And nearly everything we measure on GitHub, whether it's repo creation, PR activity, API usage, actions, all of them are growing faster because of these agentic workflows.
[01:17:14] This new scale is driven by humans and agents collaborating together.
[01:17:19] And we are exposing our tooling across every form factor.
[01:17:23] In fact, we have seen a tremendous growth in CLI driven, you know, the approachability of the CLI form factor.
[01:17:32] It's always been great to go to a terminal.
[01:17:34] And now, though, when combined with the power of the models and natural language, you know, CLI has become the thing that everybody goes to.
[01:17:41] But at the end of the day, when you have hundreds of CLIs, it becomes pretty complicated.
[01:17:50] It doesn't scale, especially the cognitive load that I have when you have 100 CLI sessions open is such that you kind of need something new.
[01:18:00] And so that's what has led us to build.
[01:18:03] You know, we need like this tool, essentially, that has the speed and the flexibility of a CLI, but has the capability of an IDE and the ability to scale to infinite number of agent sessions.
[01:18:18] And so today we are taking that next big step, introducing our new GitHub co-pilot app.
[01:18:24] And we realize that it's not sufficient, right?
[01:18:32] Because you still have a back end to deal with, right?
[01:18:34] Code is easy to generate, but what about the back end?
[01:18:37] So you need to contend with identities, storage, database schemas.
[01:18:42] And that's why we're really super excited about Rayfin.
[01:18:45] Rayfin is an agent first SDK that connects your agents to a back end as a service.
[01:18:53] And we are bringing this to everywhere you build.
[01:18:57] That's why I'm again, super excited about Rayfin and the partnership with Replit.
[01:19:03] You can now build apps in Replit while the, yeah.
[01:19:10] You can build the app in Replit while the app and data are deployed into the enterprise managed fabric tenant,
[01:19:16] thanks to the Rayfin SDK.
[01:19:19] And so this Rayfin SDK is available now for anyone else to be able to use with their tools as a back end,
[01:19:26] and essentially have this back end service.
[01:19:28] And now to show you all of the Foundry and Rayfin and building agents and the long running agents,
[01:19:35] let me invite up on stage Cassidy.
[01:19:37] Cassidy, take it away.
[01:19:38] Cassidy, take it away.
[01:19:42] Hello, everyone.
[01:19:43] I'm so excited to be amongst my fellow devs today.
[01:19:46] This has been a long day.
[01:19:47] Fix your shoulders.
[01:19:48] Sit back.
[01:19:49] You're ready.
[01:19:50] I see a lot of you sitting up.
[01:19:51] Great, great.
[01:19:52] I know you're drinking from the fire hose of information today.
[01:19:55] So I want you to go into the next few minutes thinking,
[01:19:57] what can I try out on my laptop later today?
[01:20:00] So as Satya said, we're going to show you Rayfin.
[01:20:02] But first, I can't wait to show you the new GitHub Copilot app.
[01:20:07] This app is your home base for development and operations on your computer.
[01:20:11] And we think you're going to love it.
[01:20:12] So let me show you around.
[01:20:13] When you open up the app from the start, you see this home screen here,
[01:20:16] where you can kick off a new agentic coding session.
[01:20:19] But also, before I get into the serious stuff, you can drag Mona around,
[01:20:23] and there's a game.
[01:20:25] Look.
[01:20:26] It's so fun.
[01:20:27] Okay.
[01:20:28] I'm not very good at it.
[01:20:29] So let's just get back to, you can kick off a new agentic coding session.
[01:20:34] So I started off one a little bit earlier here.
[01:20:37] And it gave me a review of a bunch of release blockers.
[01:20:40] Which one should I fix?
[01:20:41] Call it out.
[01:20:44] Two, eight, three, the critical ones.
[01:20:47] You know what?
[01:20:48] How about we just do all of them?
[01:20:50] Let's go.
[01:20:51] This app will now kick off a separate session for every single issue here.
[01:20:56] I don't have to worry about stashing or coding conflicts or anything,
[01:21:00] because the app takes care of that, with Git work trees.
[01:21:03] Git work trees are isolated environments for each session that you run,
[01:21:07] so your agents can work in parallel without stepping on each other.
[01:21:10] But you still have to merge them, right?
[01:21:12] So, Copilot has your back there, too.
[01:21:16] If I head over, not to this one, but to this issue here, I can run agent merge.
[01:21:22] And when I enable agent merge, Copilot will continuously babysit this PR
[01:21:26] through CI checks, code review, and merge conflicts.
[01:21:30] Okay.
[01:21:31] Let me keep showing you around while those are still running.
[01:21:33] Now, if I head over to my work, I can see a focused view of all of my activity
[01:21:39] and just projects loaded in the app, issues and PRs, everything here.
[01:21:43] And then under automations, I have a bunch of reusable sessions and workflows
[01:21:47] that can run locally or on the cloud.
[01:21:49] You see there's issue poetry there?
[01:21:51] That is real, and that is load-bearing.
[01:21:53] Okay.
[01:21:54] And now, under sessions, like I briefly showed earlier, these are sessions.
[01:21:59] If I want to add a new repository, I can click that button here,
[01:22:02] and it can pull from a local repo or from a GitHub repository.
[01:22:06] And then if I were to just add one, I can add a session in PocketCal.
[01:22:11] This is an open source repo.
[01:22:13] I can start a session anywhere, and it just loads it.
[01:22:16] I don't have to clone.
[01:22:17] I don't have to pull.
[01:22:18] It just works.
[01:22:19] Now, when I look at a session within this repository,
[01:22:22] let me look at this other one over here.
[01:22:24] I get an integrated browser.
[01:22:26] There's a terminal.
[01:22:27] I can see the chat.
[01:22:28] It's all loading.
[01:22:29] I can even toggle light mode and dark mode in here.
[01:22:31] And there's also this great button, pick and polish, where if I click that,
[01:22:35] I can pick and polish anything in this app, and it adds it to the chat.
[01:22:39] And I can say, hey, I want you to add reordering to this list,
[01:22:44] and it'll just work all living in there.
[01:22:46] I have access to all the most popular models via my single GitHub Copilot subscription,
[01:22:51] including those from OpenAI, Anthropic, and Google.
[01:22:55] You can see all of them in our model picker.
[01:22:57] And having model choice is great.
[01:22:59] Not only can you pick the right model for the task, but for bigger features,
[01:23:02] Copilot can request a rubber duck review.
[01:23:04] So in this session here, for example, I was using GPT 5.5.
[01:23:08] But if I scroll up, it actually requested one from Claude Opus 4.8.
[01:23:13] All models have blind spots.
[01:23:15] And the power of the Copilot multimodel approach means that I can catch them earlier.
[01:23:19] And this is all very cool.
[01:23:22] But working with AI in 2026 should be more than just chat.
[01:23:27] You just saw me scroll on that.
[01:23:28] There's so many words here.
[01:23:30] So today, I'm very excited to show you the concept of a canvas.
[01:23:36] I'm going to open one right there.
[01:23:38] The canvas is how an agent can build a custom UI to communicate with you.
[01:23:43] What if your AI could see?
[01:23:47] Everyone say, demo gods, bless us.
[01:23:50] OK, let's see if it works.
[01:23:52] Here is a fun canvas where, if I get the camera going, OK, the agent shows your PRs down here.
[01:23:59] And I can toggle it with a thumbs up or a thumbs down.
[01:24:02] Let's approve it.
[01:24:03] Yay!
[01:24:04] It's so fun.
[01:24:07] And they can go so much deeper if you want.
[01:24:09] This is just the beginning of what you can do.
[01:24:11] So, again, I kicked off a bunch of different sessions earlier.
[01:24:16] This is a Signalbox app.
[01:24:18] It's 100% agent built.
[01:24:20] It's containerized with a database backend.
[01:24:23] Would you be able to deploy this to your enterprise with no questions asked?
[01:24:28] Be honest.
[01:24:30] No.
[01:24:31] Yes, exactly.
[01:24:32] No.
[01:24:33] But you can with Rayfin.
[01:24:35] And that is very, very exciting.
[01:24:37] Let me open up a new terminal over here.
[01:24:39] All I have to do is type Rayfin up.
[01:24:45] And then, demo gods bless us.
[01:24:47] Come on.
[01:24:48] It will maybe deploy.
[01:24:53] Blammo!
[01:24:54] It's happening!
[01:24:55] Yes!
[01:24:56] And all hosted on Microsoft Fabric.
[01:24:58] OK.
[01:24:59] I know that's a lot.
[01:25:00] I know that's a lot.
[01:25:01] With Rayfin, your agents get a complete enterprise backend.
[01:25:04] So you can deploy with confidence in the way that's best for you.
[01:25:07] But this is the key thing to remember.
[01:25:10] This app is not just another session manager.
[01:25:13] Yes, it is.
[01:25:15] It manages a lot of sessions.
[01:25:16] But session managers just make it easy to create work.
[01:25:20] But GitHub Copilot helps you to finish it.
[01:25:22] Thank you so much.
[01:25:23] Happy Pride.
[01:25:24] Back to you, Satya.
[01:25:25] All right.
[01:25:25] Now we're back to ideas that have UI.
[01:25:38] That's sort of so cool.
[01:25:39] You know, it's kind of comes a full circle.
[01:25:43] Now let's talk about how you can observe, govern, and secure these agents.
[01:25:49] Agent 365 is the agent control plane.
[01:25:53] Agent requires their own identities, access controls.
[01:25:59] Even when they're working on your behalf, right?
[01:26:02] You just want that work on behalf identity to be enforced.
[01:26:05] So we extended in Entra.
[01:26:08] Agents need real-time defense.
[01:26:11] So we extended Defender.
[01:26:13] Agents require this always-on data protections and compliance.
[01:26:18] So we extended Purview.
[01:26:20] And these agents can be hosted anywhere.
[01:26:23] They can be on AWS, GCP, not just on Azure, or built with any framework.
[01:26:29] And today we're announcing a number of updates, including the GA of Agent 365 SDK.
[01:26:37] And we're expanding it to your local agents running on Windows and elsewhere
[01:26:42] and the clause we just saw earlier.
[01:26:45] And let us take a look at how all of Agent 365 composes.
[01:26:50] Over to you, Amanda.
[01:26:56] Everyone is building agents.
[01:26:58] But that's not the hard part anymore.
[01:27:00] The hard part is integrating them into your business and governing them at scale.
[01:27:05] Today, I'll show you how Foundry makes this easy.
[01:27:08] Let's start locally.
[01:27:10] I've already built a Landgraf agent.
[01:27:12] And now I'll show you the value Foundry adds.
[01:27:15] First, tools.
[01:27:17] Agents need tools to get work done.
[01:27:19] And with Foundry Toolbox, I just add my tools once.
[01:27:22] And any agent can consume them through a single MCP endpoint.
[01:27:27] And because tools live centrally, that means governance does as well.
[01:27:32] I've applied a guardrail, which blocks PII from leaking into tool calls and tool responses.
[01:27:38] And all I have to do is apply this once and all my agents are protected.
[01:27:44] Now, let's make this agent enterprise-ready.
[01:27:47] To deploy this agent to Foundry, all I have to do is add this one block of code.
[01:27:52] And then I push my changes and GitHub Actions takes it from there.
[01:27:56] Once deployed, I can actually use this agent in the same Foundry extension I showed you earlier.
[01:28:03] And let's now test it out.
[01:28:06] I'm asking it to track a few open items from a standup I had earlier today.
[01:28:11] And what's actually happening right now is Foundry is spinning up a dedicated micro VM just for this session.
[01:28:18] And the session even gets its own persistent file system.
[01:28:23] What you're going to see in a minute here is if I go to the Files tab to track the open items,
[01:28:29] the agent's actually writing to a file.
[01:28:32] Pretty cool, right?
[01:28:34] Plus, Foundry now has server-side traces and built-in evals to show me exactly what happened on every run.
[01:28:44] But how do I know if my agent's doing a good job?
[01:28:49] That's what Foundry's brand new rubric evaluators are for.
[01:28:53] It's super simple.
[01:28:54] With just one AZD command, Foundry reads my agent and then generates the evaluation criteria for me.
[01:29:02] In other words, it creates a rubric personalized just to this agent.
[01:29:07] Now let's check out this rubric in Foundry portal.
[01:29:11] Look at these dimensions.
[01:29:14] Ground governance outcome correctness, prescribed source usage.
[01:29:18] I didn't write any of these.
[01:29:20] Foundry generated them from production traces.
[01:29:23] And with this rubric, I can score my agent and run evals.
[01:29:27] But we can do so much more than that.
[01:29:30] That's where Foundry's brand new agent optimizer takes over.
[01:29:34] Here's how it works.
[01:29:36] It tunes four things.
[01:29:37] The model, instructions, tool descriptions, and skills.
[01:29:41] And then it generates improved candidates and scores each one using the rubric I just showed you.
[01:29:47] Here, I can view the candidates.
[01:29:49] I can see the strategy used.
[01:29:51] I can see their scores.
[01:29:52] And I can actually view exactly what changed.
[01:29:56] This candidate, for example, improved its score by updating the model and the system prompt.
[01:30:01] Foundry then makes it super easy to deploy the best candidate as a brand new agent version.
[01:30:09] But this is not a one-time thing.
[01:30:12] Every run feeds the next eval.
[01:30:14] And every eval tells the optimizer where to improve next.
[01:30:18] So, your agents now get better the more they're used.
[01:30:21] But now, let's put this agent to work.
[01:30:24] I've published my agent to Teams and M365 Copilot.
[01:30:29] And right now, I'm going to ask it to just catch me up on what I've missed.
[01:30:33] Because I've been offline all morning, but my agent hasn't.
[01:30:38] While it works on that, though, let me explain what makes this agent different.
[01:30:43] This is an autopilot agent, which means it has its own identity and productivity license.
[01:30:49] So, it can work across M365 on its own behalf.
[01:30:53] Earlier, you saw me using this agent by myself.
[01:30:56] But shipping a feature and a new release takes a whole team.
[01:31:00] And that's why this agent lives in our Teams group chat.
[01:31:03] Now, let's check out how it did.
[01:31:05] It summarized all updates and called out the key features I need to be tracking.
[01:31:10] But to put an agent like this actually to work in your enterprise, governance needs to come first.
[01:31:17] And that's why every autopilot agent requires admin approval.
[01:31:22] Here, admins can review all critical details.
[01:31:25] And they can even choose who has the ability to talk to the agent.
[01:31:29] But it doesn't end there.
[01:31:31] Even after approval, admins can continue to monitor the agent and block it at any time.
[01:31:37] But governance cannot apply to just one agent.
[01:31:42] Every agent in your organization needs to be managed with the same rigor as users, apps, and devices.
[01:31:49] And that's exactly what Microsoft Agent 365 provides.
[01:31:53] So, let's zoom out.
[01:31:55] Today, we saw Foundry accelerate development, take your agent from local to enterprise ready,
[01:32:02] and put it to work at M365.
[01:32:04] Foundry makes it simple.
[01:32:06] You build the agent, we handle the rest.
[01:32:08] Now, back to you, Satya.
[01:32:10] Thank you, Amanda.
[01:32:13] You know, what you just saw was how we're building security for AI.
[01:32:23] But there is also one other critical aspect, especially, you know, the news today is all about,
[01:32:30] you know, how do you defend yourself using AI against attacks that may, in fact, be using AI, right?
[01:32:37] So, last month we announced our multi-model agentic security system, M-dash.
[01:32:44] That's essentially an agent harness for security, essentially, that we built.
[01:32:50] We're bringing together a hundred agents across the frontier and custom models to really find these exploitable bugs better than any single model does.
[01:33:02] In fact, when we debuted this harness, it was on the top of Cybergym benchmark.
[01:33:08] So, I want you to take a look at what you can do with M-dash to really defend against AI attacks and defend the entire digital estate.
[01:33:17] Over to you, Sarah.
[01:33:20] Thanks, Satya.
[01:33:21] Now, we all know that security scans can take a while, so I'm not going to do one live.
[01:33:30] So, let me show you the results of an M-dash scan I already ran on my code base.
[01:33:36] Now, the system runs as a standalone CLI, but today I'm using it in my GitHub Copilot app on my local dev machine.
[01:33:44] The scan is broken down by vulnerability domains and severity.
[01:33:49] And it's also, in addition to finding traditional issues like coding errors and hard-coded secrets, it's also identifying AI-specific vulnerabilities in the code base.
[01:34:03] Now, what happened under the hood is that over 100 specialized agents are working together to discover, debate, and prove exploitable vulnerabilities end-to-end.
[01:34:14] And when the scan finishes, it generates both a SARIF log and an HTML report that I can give to my management.
[01:34:22] Now, the Defender Details command allows me to dig into these vulnerabilities.
[01:34:27] And I can see what the vulnerability is, where it is in my code, and the severity to help me prioritize.
[01:34:35] And from here, of course, we're going to fix it.
[01:34:39] Now, using the Defender Fix command, the system will remediate suggested fixes directly in my local dev environment.
[01:34:48] And when that's done, I can check out the diff.
[01:34:51] So I have full transparency about what the harness has done, and I still have a human-in-the-loop check.
[01:34:58] But, of course, everything I've shown you so far has run locally.
[01:35:03] But I can also create a PR to plug into my existing workflows and push up to my repo.
[01:35:10] And I can take the SARIF output from the scan, and I can upload it to tools like GitHub Advanced Security
[01:35:17] and manage everything alongside my other application security findings.
[01:35:22] Now, I've shown you MDash working on my code, but let me show you a vulnerability
[01:35:28] our security research teams identified using MDash that you can go and look up yourself.
[01:35:34] So the TLDR, because this is a lot to read, of this bug is that WASM time reads an out-of-date map of an object,
[01:35:44] runs off the end, and then it crashes the host.
[01:35:47] So this is exactly the kind of bug that the harness was built for,
[01:35:52] because the floor is spread across three different parts of the code base.
[01:35:56] So no single file looks wrong on its own.
[01:36:00] They look absolutely fine.
[01:36:02] And we can even see here, which is probably my favorite part of this bug,
[01:36:07] a very confident statement from the developers claiming everything is fine.
[01:36:12] Now, this is exactly the sort of reassurance that fools normal scanners and single AI models.
[01:36:18] But MDash wasn't fooled.
[01:36:20] One team of agents spotted the suspicious gap.
[01:36:23] Another team argued it apart.
[01:36:25] And a third team built a working example that actually triggered the crash.
[01:36:30] And it did all of this in an open source code base.
[01:36:33] And this is the kind of joined up reasoning that previously required significant manual security research effort.
[01:36:40] This is MDash helping developers create secure code from the start.
[01:36:45] Coming soon to your CLI and the Microsoft Defender portal.
[01:36:49] Back to you, Satya.
[01:36:51] Hi.
[01:36:52] Thank you, Sarah.
[01:36:55] So that was the stack.
[01:36:57] Before, though, we move to unpacking more of the opportunity, I want to do something different.
[01:37:04] I want to introduce two people whose LinkedIn profiles were both super impressive and slightly perplexing.
[01:37:12] Under current role, it says they are general partners at Mantis VC.
[01:37:18] Under previous role, it says they sold out Madison Square Garden.
[01:37:22] Please help me welcome Alex and Drew from the Chainsmokers.
[01:37:25] Alex, Drew.
[01:37:34] Thank you so much.
[01:37:35] You know, one of the things when my team came to me and said, hey, we're going to have Chainsmokers that build, I thought maybe that's what we're calling our new GitHub co-pilot app.
[01:37:47] But, you know, I thought maybe they're your fans.
[01:37:50] But tell me about how you got into this.
[01:37:53] You've been at it now for what, 12 plus years as venture capitalists or angel investors first, and you've had your firm now for seven plus years.
[01:38:02] And you even picked what I would have not thought is the natural place, which is B2B SaaS even as the first place.
[01:38:08] I mean, just give us a little bit about the back story on this.
[01:38:11] Sure.
[01:38:12] Well, hey, I'm sure you guys are wondering what timeline you're on where the Chainsmokers are at.
[01:38:16] Microsoft, Bill.
[01:38:17] But hey, how are you?
[01:38:20] And, you know, we've been making, we've been in the Chainsmokers for 14 years.
[01:38:24] And, you know, when our music started to take off in the mid-2000 teens, we got to play a few events like this.
[01:38:31] And we met a lot of the founders from the consumer mobile cloud era of startups.
[01:38:37] And they taught us a lot and gave us a front row seat into what early stage investing was like.
[01:38:43] We fell in love with it.
[01:38:44] We found a lot of, we had a lot in common with these founders and the way that they built their business was the same way that we thought about starting the Chainsmokers and, you know, kind of breaking through the noise there.
[01:38:56] And, you know, we got to participate in a few of their deals.
[01:38:59] We were very lucky.
[01:39:00] And we decided to institutionalize in 2020 and start our own fund.
[01:39:04] That's so cool.
[01:39:05] And when you sort of hear about all this AI stuff and what's happening, obviously there's a lot even going on.
[01:39:10] Even in your portfolio, we were talking backstage, things are changing.
[01:39:13] How do you sort of see the opportunity going forward?
[01:39:16] I mean, there's so many vectors you could talk about this on.
[01:39:19] Of course, there's the creative output side, which, you know, we've been experimenting with in music, but always important to have your authenticity when it comes to creativity.
[01:39:28] But on the investment side, I think we're moving from producing outputs to producing actions, which I think presents like a very interesting opportunity to kind of reimagine the entire architecture of the way software enterprise has been built.
[01:39:40] So instead of humans, you know, producing outputs, it's machines producing outputs and rethinking what that entire space looks like in that context.
[01:39:49] Yeah, no, it's fantastic to see that.
[01:39:52] And maybe just to close out any sort of win as artists who have had, you know, great success.
[01:39:59] When you look at a founder, what sort of is the advice you give them when you're creating, you know, you're bringing something new and it's a creative process.
[01:40:10] So what is sort of, what are you looking for in founders?
[01:40:12] What is it that you're giving them as advice?
[01:40:15] Totally.
[01:40:16] I mean, there really are so many parallels.
[01:40:17] Maybe it seems like that.
[01:40:18] Maybe it doesn't.
[01:40:19] But I mean, it's really difficult as artists to really find, I guess, what your sound is, what's authentically you, what you can continue to do over and over.
[01:40:29] Because with this much competition, you have to be moving authentically, you have to be connected to the product that you're creating and because you're going to have to iterate and keep it going and have consistency for a long period of time to kind of lock into your fan base and make something that's special and unique.
[01:40:45] And, you know, we try to advise our founders to do the same thing.
[01:40:48] Fantastic.
[01:40:49] Thank you both for joining us.
[01:40:51] Are you game for playing for us this evening for our closest friends here?
[01:40:55] I will be there.
[01:40:56] Six o'clock.
[01:40:57] Six this evening.
[01:40:58] Thank you so much.
[01:40:59] Thank you.
[01:41:00] Thank you.
[01:41:01] Thank you.
[01:41:02] Thank you.
[01:41:03] So, we'll be right here at 6:00 PM tonight and really looking forward to it.
[01:41:11] So, that effectively sort of talks a little bit about the developer stack.
[01:41:17] So, now let's talk about what is the opportunity for every company, right?
[01:41:23] At the end of the day, we are institution and organizational builders.
[01:41:27] Whether it's an AI native company, whether it's a SaaS company, whether it's an enterprise, the first thing we want to do is make sure that as you build things like plugins or agents or your AI apps,
[01:41:42] we want to make sure that they're discovered throughout the Microsoft ecosystem.
[01:41:47] That's job number one for us.
[01:41:49] That's why we're doing the things we're doing in Windows or Microsoft 365 Copilot or in Teams or in GitHub.
[01:41:56] We want to structure them such that your applications, your agents, your plugins are discovered.
[01:42:03] Customers are also building lots of line of business applications, line of business agents, then using Copilot Studio.
[01:42:11] And we want to make that all discoverable again as part of the Copilot experience.
[01:42:16] And Teams, in some sense, has become this destination for multiplayer, human-to-agent interaction.
[01:42:24] We want you to be able to find agents, interact with agents, right in Teams.
[01:42:30] And we are supercharging all of this.
[01:42:34] Copilot continues to evolve very quickly.
[01:42:37] It started first with chat, with some of the best models, with great access to WorkIQ.
[01:42:44] We didn't have the name WorkIQ, but now that's kind of where it got started.
[01:42:48] Then came co-work, a new way of working and generating these stunning artifacts and solve these multi-step problems, right?
[01:42:58] So you assign multi-step tasks to co-work.
[01:43:02] You saw GitHub.
[01:43:03] GitHub has continued to evolve as well.
[01:43:06] And now, come summer, you will be bringing coding to all knowledge work within one Copilot super app, right?
[01:43:18] That's going to be really exciting to see, yeah?
[01:43:23] So you'll have chat, co-work, and code all in Copilot.
[01:43:28] But today, we're introducing something completely new.
[01:43:34] Autopilots.
[01:43:36] We can think of autopilots as enterprise-grade claws.
[01:43:41] These are autonomous, long-running agents with full enterprise compliance that run in your tenant.
[01:43:50] Autopilots can have a name, personality, custom connectors, context, and memory.
[01:43:57] And they're totally new way to reduce toil and get you back to what you love.
[01:44:03] To kick things off, the first autopilot we are introducing is Scout.
[01:44:09] Let's take a look.
[01:44:27] We'll be right back.
[01:44:57] All right.
[01:45:11] As you can see, Scout works where you work, joining group chats and teams, handling threads
[01:45:17] in Outlook.
[01:45:19] Starting today, for those of you who are on Copilot Frontier, you can try out Scout.
[01:45:23] And in the coming months, we will build this out to a complete digital team of autopilots
[01:45:30] right inside a Copilot, right?
[01:45:31] So you can go to the Copilot app.
[01:45:33] Scout is the one that comes by default.
[01:45:35] But you can build more of these autopilots.
[01:45:39] And so that's the future of what we think of as the Copilot ecosystem itself.
[01:45:44] And so far, we have talked a lot about what you can do to build your own agents and how
[01:45:50] these agents are discovered in things like Copilot and Teams and Windows.
[01:45:54] But when you think about what makes any organization, any enterprise, any company unique, it is its
[01:46:03] tacit knowledge that it's continuously compounding through its operations.
[01:46:09] So the key consideration at some level in an AI age is to ask the question, what's the future
[01:46:14] of the firm?
[01:46:16] How do you continue to preserve and compound that tacit knowledge in the age of AI where
[01:46:23] models can learn anything from the data and the trajectories they see?
[01:46:28] To do that, we believe that every organization, whether it's an AI native company, a SaaS company,
[01:46:36] or any enterprise, will need to build their own hill climbing machine.
[01:46:42] It's a system that continuously improves against your objectives, your private evals, compounding
[01:46:52] your advantage over time, not someone else's.
[01:46:56] To deliver on this, we're taking the next step with frontier tuning.
[01:47:01] We have been working with many customers already to help really build their own learning loop,
[01:47:08] the environment, the context, the tools, the rubrics, and even train their own models.
[01:47:15] And today, we are really thrilled and excited about supercharging that frontier tuning capability
[01:47:22] with the innovation that's coming out of our superintelligence lab.
[01:47:26] To tell you all about this, let me hand it over to Mustafa.
[01:47:29] Mustafa?
[01:47:32] Thank you.
[01:47:34] Thank you.
[01:47:35] Thank you.
[01:47:37] Thank you.
[01:47:37] Good morning, everybody.
[01:47:40] You know, we really are living in the most remarkable times.
[01:47:44] Since I started working in AI, the compute that we use to train frontier models has increased
[01:47:51] by one trillion fold.
[01:47:54] That's 12 orders of magnitude of computation in just 15 years.
[01:48:00] It's now clear that a consistent exponential increase in computation leads to predictable advances in AI capabilities.
[01:48:11] And in the next few years, we're going to see three more orders of magnitude of compute applied to train frontier models.
[01:48:19] Intelligence is now a function of compute.
[01:48:22] Log linear hill climbing has become the norm.
[01:48:25] The scaling laws are clearly holding.
[01:48:27] And it is a remarkable time in our industry.
[01:48:32] And so, in this context, we at MAI are building towards what we call humanist superintelligence.
[01:48:39] State-of-the-art AI capabilities that are explicitly designed to serve people and organizations
[01:48:46] and not replace them because the type of AI that we create really does matter.
[01:48:54] We need an AI that places humanity first, that always prioritizes human well-being and human progress.
[01:49:04] This is the core philosophy and motivation behind our superintelligence efforts at Microsoft,
[01:49:10] and it shapes everything that we do.
[01:49:13] And as a platform company, our job and our commitment is to keep you developers building at the absolute frontier.
[01:49:21] So, today, we are very excited to announce a family of seven new models across image, voice, transcription, and coding.
[01:49:31] These are all built with real attention to detail and a commitment to making very practical and efficient tools
[01:49:43] that are tuned to just how you work in the real world.
[01:49:46] So, first up, MAI Image 2.5 and its Flash variant, two super-strong models that deliver a step change in quality.
[01:49:55] Now at number two on the leaderboard, surpassing the score of Nano Banana 2 on image editing.
[01:50:04] They give you precise editing with incredible control and consistency, Flash is here for super-efficient production workloads,
[01:50:13] while 2.5 gives you that maximum fidelity and professional-grade performance.
[01:50:18] They're live in PowerPoint today, they're rolling out to OneDrive, and right now you can access them on Foundry at a market-leading quality per dollar.
[01:50:26] Next up, we've got MAI Transcribe 1.5.
[01:50:31] This is the best transcription model in the world.
[01:50:35] State-of-the-art accuracy across 43 languages, beating out Gemini and OpenAI's flagship transcription models.
[01:50:43] We've optimized it for real-world use so that you can produce highly accurate transcripts for any bespoke use case,
[01:50:50] five times faster than all rival models.
[01:50:54] It's now being integrated inside a GitHub, Teams, Copilot, Dynamics 365, Contact Center,
[01:51:00] and it's now also available in Foundry, where I'm very excited to say it is the fastest, most efficient,
[01:51:07] and most cost-effective transcription model of any of the hyperscalers out there.
[01:51:17] So paired with that, we've got MAI Voice 2.
[01:51:19] This is our latest speech generation model.
[01:51:22] It has beautiful prosody, natural-sounding delivery, fine-grained emotional control, and it's available in 15 languages, with many more coming soon.
[01:51:31] We're also announcing Voice 2 Flash, and that provides the very best value and speed for ultra-latency-sensitive voice agents,
[01:51:40] which, of course, is the big thing in 2026.
[01:51:43] Next up, our text foundation model, MAI Thinking 1.
[01:51:48] This is our first reasoning model, and it's exceptionally strong in our target use cases of reasoning and SWE tasks.
[01:51:56] It's a 35 billion active parameter MOE with a 256k context window.
[01:52:02] That means that it competes in the medium-sized weight class, where it's certainly punching above its weight.
[01:52:07] And independent human raters on Surge prefer it in overall quality side-by-sides versus Sonnet 4.6.
[01:52:17] It's achieved 97% on AME 2025, which is obviously the key measure of its general-purpose reasoning abilities.
[01:52:26] But most importantly of all, it's now at 53% on SWE Bench Pro, which places it right alongside OPUS 4.6,
[01:52:35] at least on the toughest coding benchmark that's out there.
[01:52:37] So we're very happy with that.
[01:52:42] Now, there's plenty more for us to do as we get this into production and hill climb against real-world tasks and real-world traffic.
[01:52:50] But what is actually most remarkable about this model, we think, is that it is climbed entirely from the bottom.
[01:52:58] And that means that it hasn't targeted any of the benchmarks specifically, and it's done so with absolutely zero distillation.
[01:53:05] And to us, this is critical, because it means that the model is created with an enterprise-grade,
[01:53:11] clean, and commercially licensed data lineage that means that you can put it into production
[01:53:17] in a very trustworthy way with complete confidence.
[01:53:21] Now, finally, I'm incredibly excited to announce MAI Code 1 Flash.
[01:53:26] This is our new inference-efficient coding model, which has been especially tuned for VS Code and, of course, GitHub Copilot CLI.
[01:53:34] It achieves 51% on SWE Bench Pro, despite having just 5 billion parameters.
[01:53:40] And so it's much closer to Haiku in terms of size, but cheaper in cost, delivering really strong coding performance at great inference efficiency.
[01:53:49] And it's rolling out today inside of VS Code.
[01:53:57] Now, alongside distribution on Foundry and optimization for our 1P products, we're also very excited to make our models available on Open Router, as well as Fireworks and Base 10.
[01:54:07] So this means that for the first time, you're going to be able to tune the weights directly yourself in an ecosystem of your choice.
[01:54:14] Now, across this entire family, safety and security have been built in from the start.
[01:54:21] Our voice models come with protections against unauthorized cloning.
[01:54:24] Everything is watermarked from scratch.
[01:54:27] We've reduced our over-refusals, improved representation, including for people with disabilities.
[01:54:32] And we're also publishing a very detailed technical report today to give you a full and transparent understanding of how we put all of this together.
[01:54:41] Now, one of the things I'm particularly excited about is that we have been carefully co-designing our models with our own silicon.
[01:54:50] So that means that we've optimized MAI Thinking 1 on our very own Maya 200 chip and benchmarked it head-to-head against the GB200.
[01:55:00] And so on top of the 30% performance improvement that Satya talked about earlier, we're now seeing a further 1.4x performance per watt gain when we run our MAI models on the Maya 200 end-to-end.
[01:55:15] And that's huge, because as everybody knows, at this scale, every watt counts.
[01:55:21] And silicon and model co-design is a really key advantage that we think is going to help keep everybody here right on the frontier with the most efficient and most powerful thinking and coding agents out there.
[01:55:34] We're also super excited that these faster and more efficient MAI models are coming to the N1X that Satya mentioned a few moments ago.
[01:55:42] And we think that's going to be able to deliver the very best performance on Windows in a few months' time.
[01:55:49] Now, to us, this is what owning the full stack end-to-end looks like.
[01:55:54] It's the foundation of Microsoft Frontier Tuning, lets you customize the MAI models using our full stack hill climbing machine right where you want it.
[01:56:02] And it means that the disciplined and very relentless engineering that has gone into building our models is now available to all of you on a platform that you can trust, working on your behalf to create custom agents that you will control.
[01:56:19] So the really big thing, of course, that's happened in the last year is these RLEs, Reinforcement Learning Environments, these unique training gyms for your AIs.
[01:56:28] They create company and task-specific agents adapted only to you built on MAI models.
[01:56:34] So, for example, within Microsoft, we use our RLEs combined with our MAI models to climb towards the best agentic use cases on Excel.
[01:56:44] Our MAI-tuned model is now on par with GBT 5.4 on public and private benchmarks, whilst at the same time being 10 times more efficient on cost.
[01:56:56] Thank you.
[01:56:57] You know, and many other early adopters are seeing similar results.
[01:57:02] When we've tuned our models on McKinsey's tasks, MAI delivered the highest win rate, even outperforming GBT 5.5, and again delivering 10x greater efficiency on cost.
[01:57:13] So, to us, this is the advantage of very carefully calibrated frontier tuning.
[01:57:20] And, importantly, unlike with some of the other companies, with MAI, you don't rent intelligence from a shared model that learns from everybody.
[01:57:30] Only you keep the benefits of your hard-earned workflows, know-how, knowledge, and your own institutional data.
[01:57:38] Only you get to control the resulting model.
[01:57:41] And so, with us, the RLEs and the models that you build inside of them, they become your moat.
[01:57:48] I really think this is distinct.
[01:57:50] It marks a new era in AI that we're all very, very excited about.
[01:57:55] Okay, so now just one final announcement that I'm very excited about.
[01:58:00] We are taking customization and co-creation of our models to the highest level possible on what I think of as perhaps the most important application of AI, healthcare.
[01:58:11] So, today, we're very proud to be announcing that we're partnering with Mayo Clinic to jointly develop a new frontier model for health and then deploy it around the world in their hospitals and beyond.
[01:58:22] So, this morning, please help me in welcoming to the stage a physician, groundbreaking researcher, president, and CEO of the Mayo Clinic, Dr. Gianrico Ferruzio.
[01:58:38] Thank you so much for being here, Gianrico.
[01:58:41] Now, of course, everyone will recognize Mayo as perhaps the leading hospital in the world with an incredible track record of research and innovation and clinical practice.
[01:58:51] Tell us a little bit more about what you hope to get out of our collaboration.
[01:58:55] Well, first of all, thanks for having me here.
[01:58:56] Thanks to Satya as well.
[01:58:59] Mayo Clinic is known for being able to live up to our primary value.
[01:59:02] The needs of the patient come first.
[01:59:04] We deliver outstanding healthcare.
[01:59:07] We're ranked the number one healthcare organization in the world.
[01:59:11] Yet, we know most people in the world will not have access to Mayo Clinic.
[01:59:15] So, seven years ago, we decided to create a platform, the Mayo Clinic platform, moving all of healthcare from a pipeline to a platform.
[01:59:24] And with our partners, that platform now is in four continents and reaches about 100 million people.
[01:59:32] It has created the largest, to our knowledge, deepest longitudinal healthcare data set in the world, multimodal, including genomics.
[01:59:42] So, here together, now we have the opportunity to do what we do best together, which is create a frontier model for healthcare.
[01:59:49] What it means if you're a patient, if you're somebody interested in healthcare, you can get clinical and logistical answers to your healthcare.
[01:59:58] But if you're a healthcare provider, if you're a physician, it can give you insight, it can act as your real-time team member that can tell you what is likely to happen next.
[02:00:09] But it can also prevent harm and, therefore, increase patient safety and giving valuable insights that make the team better at giving you what you need most, which is better healthcare.
[02:00:20] Yeah, and I think that one of the things that we're most excited about is that the models are already pretty incredible at textbook knowledge.
[02:00:27] They've read all the journals and all the papers, but what they're really lacking is the kind of clinical practice and clinical expertise of your team and your clinicians that you've developed over the last many decades.
[02:00:37] So, how do you think we might go about using that clinical practice to improve the performance of the model in production?
[02:00:43] So, the exciting part here is we each do what we do best, and we can tackle something that has eluded healthcare for a long time, trusted, scalable solutions.
[02:00:54] And to do that, you need to have the right data, you certainly need to have the right people, but you also need to have a very patient, focused lens.
[02:01:02] And between the two of us, we now have all this together, so we can build this frontier model and then offer safe, secure, trustworthy, and, of course, effective healthcare solutions for all.
[02:01:15] Well, our number one objective, of course, is to put the patient first, deliver the highest quality we can in a trusted way, and then hopefully share that with the world over.
[02:01:22] So, we're very excited for this partnership.
[02:01:24] Can't wait to share more with everyone in the future.
[02:01:26] Us too.
[02:01:26] Thank you.
[02:01:27] Thank you.
[02:01:27] Thank you.
[02:01:32] So, today marks some very, very exciting steps that we're taking on our journey to create humanist superintelligence at Microsoft.
[02:01:41] We now have an incredible roster of seven new world-class models to keep everybody working at the absolute frontier.
[02:01:48] And we're really looking forward to everybody being able to co-create your own unique agents adapted to you that you'll control.
[02:01:56] I really feel like this is a new era in AI, an era of AI that you control on your terms.
[02:02:03] So, let's build it together.
[02:02:05] Thank you very much, everyone.
[02:02:12] And now over to Tanya to show us how frontier tuning works in practice.
[02:02:24] Thanks, Mustafa.
[02:02:25] With Frontier Tuning, we're making it possible for you to create your own enterprise AI with the models and the harness tuned on your data and your workflows.
[02:02:39] Let me show you how you can build your own hill climbing machine.
[02:02:44] MAI Thinking 1 is now available in private preview in the Foundry model catalog.
[02:02:51] You can go ahead and deploy the model as is, or if you want to go on your own hill climbing journey, you can start by clicking the fine-tune button.
[02:03:01] Now, I'm in the fine-tuning UI.
[02:03:03] The first thing I'll do is add my data set.
[02:03:07] Next, I'm going to add a grader, and that's it.
[02:03:10] I go ahead and submit the job.
[02:03:13] A couple of hours in, I see how the model is learning.
[02:03:17] It generates rollouts, it scores them, and we start to hill climb.
[02:03:22] Now, if you want full control over your RL training loop, let me give you a sneak peek into our low-level training API.
[02:03:31] You can see here I have the MAI thinking model.
[02:03:35] I can also configure my rollout strategy as well as my hyperparameters to define exactly how I want my training algorithm to work.
[02:03:45] In fact, I can also incorporate my own RL gym here by defining the tools that this model interacts with.
[02:03:54] All right, you just saw all the code, but as an M365 customer, you're never starting from scratch.
[02:04:01] Let's hop on to co-pilot.
[02:04:04] As a part of Frontier Tuning, we're introducing a new way to build RL environments based on your data and your workflows.
[02:04:14] One of our customers, Lando Lakes, one amongst the largest agri-businesses in the United States, is in fact using this to perfect that butter on your morning toast.
[02:04:27] Let me walk you through their environment.
[02:04:30] Now, on a high level, the environment consists of skills, knowledge, and tools.
[02:04:37] But in the back end, we create an entire RL gym for your agents to continuously learn the way that you work.
[02:04:46] Now, let's look at the butter report generation skill here.
[02:04:49] These tasks are very complex.
[02:04:52] They require many manual steps and a high degree of precision.
[02:04:57] Now, in these tasks, even 80% accuracy isn't good enough.
[02:05:02] To hill climb to higher accuracy, we're extending the industry definition of skills to include rubrics on what good looks like.
[02:05:12] But this is just one task.
[02:05:14] How do I scale this to codify all of the tasks in an enterprise?
[02:05:19] You spend a lot of time in M365, in Teams, Outlook, Word, Excel, and PowerPoint.
[02:05:26] We use those signals to suggest skills and rubrics that truly define the way that you work.
[02:05:34] Next, you can add your organizational knowledge for grounding, like from OneDrive, SharePoint.
[02:05:40] Now, the environment comes built in with Microsoft tools, and you can add custom tools to it.
[02:05:46] Because these tools tap into real workflows, we virtualize them to simulate execution so the model can learn without actually impacting the life state of your business.
[02:05:59] And now, my most favorite part, the science.
[02:06:03] By generalizing all of these learnings into the main model, as well as in the embedding model, we're able to hill climb for tasks that require high accuracy.
[02:06:17] And not only this, we are able to hill climb to more than 90% accuracy for Lando Lake's tasks using the MAI model.
[02:06:29] In fact, we estimate this model to be 10x more efficient than the baseline models.
[02:06:36] Now, with this environment set up, let's go back to butter reporting.
[02:06:41] Let's run this task, this time using this tuned environment as an inferencing harness.
[02:06:49] Now, usually this task takes a couple of minutes to run.
[02:06:53] In the meantime, I will show you a cached response.
[02:06:58] Now, as you can see here, the agent has leveraged test-time-free search with multiple models, including a fine-tuned model.
[02:07:08] And what you see here is a summary that doesn't feel generic.
[02:07:13] It feels undoubtedly Lando Lake's.
[02:07:16] And that's not it.
[02:07:17] The task holds itself to high standards.
[02:07:21] And it continuously retrospects and evaluates itself.
[02:07:26] This way, with frontier tuning, your agent continuously improves with your compliant RL environment on your data, encoding the way you work.
[02:07:39] Now, that is what I call frontier tuning as smooth as butter.
[02:07:43] We cannot wait to see the environment you build.
[02:07:48] Back to you, Satya.
[02:07:50] All right.
[02:07:53] Thank you so much, Satya, Mustafa, Jen, Rico.
[02:07:57] Thank you for the partnership here.
[02:07:59] You know, what you just saw is a pretty significant shift.
[02:08:05] We believe that time has come for every company to just move from consuming a frontier model to fully participating at the frontier and the frontier ecosystem.
[02:08:18] That's the transition.
[02:08:21] You can have your own private evals and outcomes, your private RLs and traces, your enterprise knowledge, create this scaffolding for models to hill climb.
[02:08:34] That's what will allow you to create that differentiated IP that you own, you control.
[02:08:41] But the second salient point is that there is a new operating point at the frontier where you can use a very efficient reasoning model and a coding model and achieve frontier level performance.
[02:08:58] Because you've done the hard work of creating that environment, that RLE, that hill climbing machine in which these models with your traces can hill climb to the frontier.
[02:09:09] So we think that the combination of these two is a pretty big game changer in how people think about what does it mean to operate at the frontier?
[02:09:18] What does frontier tokens look like?
[02:09:21] How are you in control?
[02:09:22] What's the future of a firm?
[02:09:24] These are the big questions.
[02:09:25] And really an ecosystem that gets built around as opposed to a few models that just are hungry for all data.
[02:09:33] To close out, though, I want to talk about that frontiers beyond that push this ecosystem of ours and really say, what's the next big thing?
[02:09:46] Bending the curve, you know, when Mustafa talked about this with Gianrico when it comes to health, but building, you know, that scientific discovery loop
[02:09:56] can have perhaps the biggest societal impact, right?
[02:10:01] Science today still is a little too linear.
[02:10:05] You create a hypothesis, you form a hypothesis, you run an experiment, you wait for lab results, which is out of band, and then you begin again.
[02:10:14] But what if the scientific method itself could become more continuous, more parallel, more programmable?
[02:10:22] That's what we're doing with Microsoft Discovery, right?
[02:10:24] Discovery brings together the models, the HPC compute, the knowledge graphs of all the scientific knowledge, the automated lab and simulation into one agentic discovery loop.
[02:10:38] And so today, I'm really thrilled to GA Microsoft Discovery, and to show you all of this in action, let me invite up on David on stage.
[02:10:47] David, take your way.
[02:10:49] Hello.
[02:10:49] So, I work in the Microsoft Discovery and Quantum team.
[02:10:55] Today, to recycle a PET plastic like this bottle, you have to shred it and melt it.
[02:11:00] The result is degraded.
[02:11:02] You cannot use it to make this bottle again.
[02:11:05] It's downcycling.
[02:11:06] Cambridge Consultants, part of Cat Gemini, is using Microsoft Discovery to advance that research.
[02:11:12] Let me show it to you.
[02:11:14] This is the Microsoft Discovery app.
[02:11:17] It is based on VS Code, because agentic discovery has many parallels with agentic software engineering.
[02:11:24] Today, I want to do these three things as a scientist.
[02:11:28] First, I want to write a scientific paper about this topic.
[02:11:32] I'll explore an existing line of research.
[02:11:34] Instead of melting the plastic, I can use proteins to the composite, so you can recycle it again and again.
[02:11:44] Second, I want to perform the actual discovery to find new proteins.
[02:11:50] And third, I want to create a lab protocol to test the results in a real lab.
[02:11:57] If you are a developer, these three steps should be very familiar to you.
[02:12:01] They look a lot like planning, coding and testing, and deploying to production, but for science.
[02:12:08] Let's launch it.
[02:12:11] This will kick off the discovery engine.
[02:12:13] You can think of this as a team of specialized agents, always running in the background, following the scientific method.
[02:12:21] You can see those agents here, and you can add more.
[02:12:24] Microsoft Discovery includes a community of agents, models, and tools across many domains.
[02:12:31] You can use open source, third parties, or create your own.
[02:12:37] Okay, so this is still running, and it will be running for a while, even hours or days, because the discovery engine, like science, is not sequential.
[02:12:47] It is exploring hypotheses and performing long-running simulations dynamically.
[02:12:53] So let me open one that is already completed.
[02:12:55] You can see the files that it created here.
[02:12:58] This one is the research paper that I was asking for.
[02:13:03] To create it, Discovery is using a knowledge graph internally, bringing together public scientific literature and internal knowledge.
[02:13:12] This is a critical asset, because it provides complete visibility of the reasoning, so scientists can be in full control.
[02:13:20] We can go now to step two, the discovery itself.
[02:13:24] Let me open the output for that one.
[02:13:26] So, how do you come up with new proteins to decompose plastic?
[02:13:32] This is the approach that Discovery took.
[02:13:34] First, you need an AI model to predict how good a protein is for that.
[02:13:40] Then, you need a way to generate new proteins.
[02:13:43] And finally, you need to identify the most promising ones.
[02:13:47] So, let's start with the model.
[02:13:49] Microsoft Discovery trained multiple models and picked the best.
[02:13:54] And there's no out-of-the-box agent for that, but that's okay.
[02:13:58] If Microsoft Discovery does not find an agent for a task, it will create one on the fly, which is pretty cool.
[02:14:05] So, here's all the files that it created.
[02:14:08] This one here is the YAML for the agent.
[02:14:11] And this one, for example, is the Python code to train the models.
[02:14:15] Now, we need to generate candidates for new proteins.
[02:14:18] And that task requires a lot of compute.
[02:14:22] Discovery is integrated with HPC, so agents can use it for complex simulations.
[02:14:28] You can see the process here.
[02:14:31] It started with a seed protein, and then it created variations by replacing small segments.
[02:14:37] You can see those segments there.
[02:14:39] Then it applied the model from before to see if the variation helps or not, learning in that process.
[02:14:46] This is done millions of times in multiple jobs in parallel, exploring a huge tree of proteins.
[02:14:54] The result is 80 proteins that are ready to be sent to the lab for testing, which we said it was like deploying software, right?
[02:15:02] The problem, though, is that creating a protein is a little bit more complicated.
[02:15:08] The most common way is inserting DNA into bacteria, so the bacteria create the protein for me.
[02:15:15] So, Discovery created another file.
[02:15:18] This one here.
[02:15:19] This file contains all the DNA sequences to create each protein.
[02:15:23] I can send this now to a lab, create the proteins, and test them.
[02:15:28] Or I can go one step further.
[02:15:31] If I have an automated lab, I can integrate it directly with the agents.
[02:15:36] Let's do it.
[02:15:37] So, let me go back to the session, and let's just submit job to lab.
[02:15:43] Go.
[02:15:44] This will use a custom agent that is sending instructions for the lab equipment.
[02:15:49] And this is very real.
[02:15:51] Cambridge Consultants does have an automated lab, and I have it right here.
[02:15:57] So, this is the application control in the lab.
[02:15:59] It has a co-pilot interface, so the scientists can interact with the lab to design experiments,
[02:16:05] which is, it feels like being Iron Man, but for chemistry.
[02:16:08] In this case, Discovery is admitting the job.
[02:16:11] You can see it already here.
[02:16:12] But remember, you need to grow the bacteria, so this will take some time.
[02:16:16] So, let me open a previous round.
[02:16:18] And there you go.
[02:16:19] These are all the steps that the agents are doing.
[02:16:23] Most of them completely automated with human supervision.
[02:16:27] How cool is that?
[02:16:35] This is bringing together the physical world and the digital agents in a unified discovery loop.
[02:16:41] And this was just one example of what Microsoft Discovery can do.
[02:16:45] Customers across industries are using it today to embrace a new era of scientific discovery.
[02:16:50] Thank you.
[02:16:51] Back to Yusati.
[02:16:56] Thank you so much, David.
[02:16:58] You know, talking about scientific discovery, we're also continuing to make rapid progress
[02:17:05] on our long-term goal of building a scalable quantum computer.
[02:17:10] We announced last year our first QPU.
[02:17:14] We created a new state of matter that was only theorized 100 years ago.
[02:17:20] And we proved it out that it exists.
[02:17:23] Our vision was to take a very radically different approach to addressing the fundamental barriers
[02:17:30] to building a scalable quantum machine, which is all about reliability, speed, as well as size.
[02:17:38] Since then, we have continued to make progress across the full quantum stack
[02:17:43] with both our academic and industry partners.
[02:17:47] In fact, in QNorth, we will have a quantum computer powered by atom computers, natural atom computers.
[02:17:54] With our stack in there, we are also working with Algorithmic and Columbia and ETH Zurich.
[02:18:00] And we ourselves, in fact, use this discovery agentic loop to accelerate the work in quantum compressing
[02:18:10] years of research into this last year.
[02:18:14] And today, I'm really thrilled to announce Majorana 2.
[02:18:19] And so this is Majorana 2.
[02:18:23] Majorana 2 implements the next generation material stack that we use discovery
[02:18:30] to discover and build and help fabricate.
[02:18:33] The resulting qubits are exceptionally reliable and capable of maintaining their state much
[02:18:40] longer.
[02:18:41] While other common approaches deliver a lifetime of just microseconds or even milliseconds,
[02:18:47] Majorana 2 provides qubit mean lifetime of 20 seconds or up to even a minute,
[02:18:54] essentially 1,000 times higher than what we were able to achieve at Majorana 1.
[02:19:02] And the operations, and this is so key, operations in Majorana 2 are one microsecond, enabling pretty complex
[02:19:10] quantum computation in that lifetime.
[02:19:14] And all of this in the same qubit form factor of Majorana 1 in one 100th of a millimeter controlled all digitally, which is again a super important aspect.
[02:19:24] Making it all possible to fit a million of these qubits in a chip smaller than a credit card.
[02:19:31] It's this combination of the reliability, the speed, the size, that makes the topological approach so unique.
[02:19:41] With Majorana 1, we had proven out the foundational physics, and with Majorana 2, now we begin the engineering scale.
[02:19:50] But ultimately, it is never about tech for tech's sake, it's about tackling those pressing challenges of people and planet.
[02:20:07] It is also the fundamental point of this conference.
[02:20:12] The question is not whether you can build the next great model, the next great platform, or even this quantum machine.
[02:20:19] The question is, how do we build this frontier ecosystem together?
[02:20:25] Because there are really two stories that people can tell about this moment.
[02:20:31] One is that technology concentrates power, reduces human agency, and leaves the society to absorb the consequences.
[02:20:40] The other is that we use this next wave to unlock opportunity for developers, scientists, enterprises, and every community.
[02:20:51] And our job is to make the second story true.
[02:20:54] That's our North Star for the frontier ecosystem.
[02:20:57] Let's all go build together.
[02:20:59] Thank you all very, very much.
[02:21:00] Thank you.
[02:21:10] Thank you.
[02:21:40] This is one of those rare moments when something that once felt beyond imagination, suddenly feels within reach.
[02:22:00] Quantum is our most fundamental physical theory.
[02:22:02] It's really the low-level operating system of the real physical world.
[02:22:06] For quantum computing to deliver real world impact, it has to be both reliable and scalable.
[02:22:15] Reaching quantum at scale will allow us to take a more direct approach to solving problems in chemistry.
[02:22:21] The discovery of new material.
[02:22:23] Life sciences.
[02:22:24] The way in which we bring drugs to market.
[02:22:27] It opens up millions of possibilities.
[02:22:31] because quantum will change the world, and this change is just beginning to happen.