AI Tools & Frameworks Directory

Discover essential tools, libraries, and frameworks to power your AI workflows.

Tool• Apr 1, 2026

ADeLe: Predicting and explaining AI performance across tasks

AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their performance. They do not explain failures or reliably predict outcomes on new tasks. To address this, Microsoft researchers in collaboration ...

#Microsoft#Research

Tool• Mar 26, 2026

AsgardBench: A benchmark for visually grounded interactive planning

Imagine a robot tasked with cleaning a kitchen. It needs to observe its environment, decide what to do, and adjust when things don’t go as expected, for example, when the mug it was tasked to wash is already clean, or the sink is full of other items. This is the domain of embodied AI: systems […] Th...

#Microsoft#Research

Tool• Mar 26, 2026

GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation

Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most systems split these decisions into two steps: a VLM generates a plan in natural language, and a separate model translates it into executable ac...

#Microsoft#Research

Tool• Mar 25, 2026

For KPMG Canada’s Christine Andrew, Copilot isn’t just a time saver—it unlocks high-value impact

The post For KPMG Canada’s Christine Andrew, Copilot isn’t just a time saver—it unlocks high-value impact appeared first on Source.

#Microsoft#AI

Tool• Mar 25, 2026

Copilot in the C-Suite: Infobip’s Veselin Vuković on using Copilot to nurture partnerships

The post Copilot in the C-Suite: Infobip’s Veselin Vuković on using Copilot to nurture partnerships appeared first on Source.

#Microsoft#AI

Tool• Mar 23, 2026

Will machines ever be intelligent?

Are machines truly intelligent? AI researchers Subutai Ahmad and Nicolò Fusi join Doug Burger to compare transformer-based AI with the human brain, exploring continual learning, efficiency, and whether today’s models are on a path toward human intelligence. The post Will machines ever be intelligent...

#Microsoft#Research

Tool• Mar 12, 2026

Systematic debugging for AI agents: Introducing the AgentRx framework

As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-step API workflows, a new challenge has emerged: transparency. When a human makes a mistake, we can usually trace the logic. But when an AI a...

#Microsoft#Research

Tool• Mar 10, 2026

From raw interaction to reusable knowledge: Rethinking memory for AI agents

It seems counterintuitive: giving AI agents more memory can make them less effective. As interaction logs accumulate, they grow large, fill with irrelevant content, and become increasingly difficult to use. More memory means that agents must search through larger volumes of past interactions to find...

#Microsoft#Research

Tool• Mar 10, 2026

Kerry Group’s Shane McGibney on how Copilot is his knowledge partner

The post Kerry Group’s Shane McGibney on how Copilot is his knowledge partner appeared first on Source.

#Microsoft#AI

Tool• Mar 4, 2026

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model

We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens in new tab), HuggingFace (opens in new tab) and GitHub (opens in new tab). Phi-4-reasoning-vision-15B is a broadly capable model that can b...

#Microsoft#Research

Tool• Feb 26, 2026

CORPGEN advances AI agents for real work

By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and all demanding attention at once. For AI agents to be genuinely useful in that environment, they will need to operate the same way, but toda...

#Microsoft#Research

Tool• Feb 26, 2026

ILUNION’s José Luis Barceló credits creative legal team —and Copilot 365—with a ‘deep transformation’

The post ILUNION’s José Luis Barceló credits creative legal team —and Copilot 365—with a ‘deep transformation’ appeared first on Source.

#Microsoft#AI

Tool• Feb 24, 2026

How an AI tool is helping UK clinicians save time and be present with patients

The post How an AI tool is helping UK clinicians save time and be present with patients appeared first on Source.

#Microsoft#AI

Tool• Feb 12, 2026

Faster decisions: How an AI agent is redefining executive workflows at one of the world’s largest building materials companies

The post Faster decisions: How an AI agent is redefining executive workflows at one of the world’s largest building materials companies appeared first on Source.

#Microsoft#AI

Tool• Feb 10, 2026

Building Qiddiya City: How Copilot helps Abdulrahman AlAli navigate a project of unprecedented scale

The post Building Qiddiya City: How Copilot helps Abdulrahman AlAli navigate a project of unprecedented scale appeared first on Source.

#Microsoft#AI

Tool• Feb 5, 2026

Rethinking imitation learning with Predictive Inverse Dynamics Models

This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of what happens next, PIDMs reduce ambiguity and learn from far fewer demonstrations. The post Rethinking imitation learning with Predictive Inv...

#Microsoft#Research

Tool• Feb 5, 2026

Paza: Introducing automatic speech recognition benchmarks and models for low resource languages

Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages. It covers 39 African languages and 52 models and is tested with communities in real settings. The post Paza: Introducing automatic speech recognition benchmarks and mo...

#Microsoft#Research

Tool• Jan 26, 2026

Introducing Maia 200: The AI accelerator built for inference

The post Introducing Maia 200: The AI accelerator built for inference appeared first on Source.

#Microsoft#AI

Tool• Jan 20, 2026

Multimodal reinforcement learning with agentic verifier for AI agents

Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and produces more reliable, data-efficient agents for real-world applications. The post Multimodal reinforcement learning with agentic verifier f...

#Microsoft#Research

Tool• Jan 15, 2026

OptiMind: A small language model with optimization expertise

OptiMind is a small language model that converts business operation challenges, described naturally, into mathematical formulations that optimization software can solve. It reduces formulation time & errors & enables fast, privacy-preserving local use. The post OptiMind: A small language model with ...

#Microsoft#Research

Tool• Jan 14, 2026

4 ways AI is reshaping discovery, health, work and responsibility

The post 4 ways AI is reshaping discovery, health, work and responsibility appeared first on Source.

#Microsoft#AI

Tool• Jan 7, 2026

From Signal magazine: How Jaron Lanier is reframing what it means to build – and trust – AI

The post From Signal magazine: How Jaron Lanier is reframing what it means to build – and trust – AI appeared first on Source.

#Microsoft#AI

Tool• Dec 17, 2025

From Signal magazine: How AI is reshaping the game on and off the court

The post From Signal magazine: How AI is reshaping the game on and off the court appeared first on Source....

#Microsoft#AI

Tool• Dec 11, 2025

Agent Lightning: Adding reinforcement learning to AI agents without code rewrites

By decoupling how agents work from how they’re trained, Agent Lightning turns each step an agent takes into data for reinforcement learning. This makes it easy for developers to improve agent performance with almost zero code changes. The post Agent Lightning: Adding reinforcement learning to AI age...

#Microsoft#Research

← Prev

1 2