Video: AI in Real Life: Using LLMs to Turbocharge Microsoft Learn

100s of community videos are available to free members. Conference talks are generally available to Gold members.

AI in Real Life: Using LLMs to Turbocharge Microsoft Learn

Thursday, February 13, 2025 • Rosenfeld Community

This video is featured in the AI and UX playlist and 1 more.

Sarah Barrett

Director of Information Architecture, Microsoft

Summary

Enthusiasm for AI tools, especially large language models like ChatGPT, is everywhere, but what does it actually look like to deliver large-scale user-facing experiences using these tools in a production environment? Clearly they're powerful, but what do they need to make them work reliably and at scale? In this session, Sarah provides a perspective on some of the information architecture and user experience infrastructure organizations need to effectively leverage AI. She also shares three AI experiences currently live on Microsoft Learn: An interactive assistant that helps users post high-quality questions to a community forum A tool that dynamically creates learning plans based on goals the user shares A training assistant that clarifies, defines, and guides learners while they study Through lessons learned from shipping these experiences over the last two years, UXers, IAs, and PMs will come away with a better sense of what they might need to make these hyped-up technologies work in real life.

Key Insights

•

Everything chatbots are overly ambiguous and difficult to optimize effectively.
•

Targeted AI applications tailored to specific user tasks work better and reduce risk.
•

The 'ambiguity footprint' helps product teams assess AI feature complexity along multiple axes.
•

Application context and whether AI features are critical or complementary impacts their ambiguity.
•

Visible AI interfaces set different user expectations compared to subtle or invisible AI features.
•

Prompt design strongly shapes AI behavior, even with similar interfaces delivering very different outputs.
•

Dynamic context injection into models adds power but significantly increases development complexity.
•

Consistent, thorough evaluation is essential but often neglected in AI application development.
•

Data privacy and ethical considerations restrict access to usage data, impeding evaluation efforts.
•

Incrementally building AI capabilities on less ambiguous features trains organizational muscles needed for more complex AI.

Notable Quotes

"You’re building three apps in a trench coat with a kind of iffy interface slapped on top of it."

"Chat really isn’t necessarily the best interface for lots of user tasks."

"We tend to see PMs and designers converging on a single everything chatbot, which I find insufficient."

"Ambiguity is inherent when working with AI, but that doesn’t mean you have to accept all of it."

"If you haven’t planned for evaluation, you end up eyeballing the results, which absolutely does not work."

"Responsible AI practices and legal reviews at Microsoft saved us from launching dangerous ambiguous features."

"The bigger and more ambiguous is not always better in AI applications."

"Very similar interfaces can conceal extremely different AI prompts, which shape the outputs."

"Dynamic context is more powerful but adds a ton of stuff to build, making AI development trickier."

"Many organizations just ask around and call it good when evaluating AI models, which is not sufficient."

Previous video

Surprise me

Next video

Ask the Rosenbot

Or choose a question:

What are the risks of building a single everything chatbot for diverse user tasks?

How can product teams measure and manage ambiguity in AI applications?

What is an ambiguity footprint and how does it apply to AI UX design?

Why is rigorous evaluation critical when shipping generative AI features?

How does the context of AI features within user journeys affect design complexity?

A Cultural Approach: Research in the Context of Glocalisation

Summary

Key Insights

Notable Quotes

Or choose a question:

More Videos

Discussion

Creating a Basis for Change: Scaling Design Maturity

Failure Friday #1 with Dan Ward

Bias Towards Action: Building Teams that Build Work

AI for Information Architects: Are the robots coming for our jobs?

Opening Keynote: Org Design for Design Orgs

A Seat at the Table: Making Your Team a Strategic Partner

Scaling Research Via an Ops First Model at Clever

Two Jobs in One: Being a “Leader who is a Researcher” and a “Researcher who is a Leader"

Latest Books All books

Test

test

Human-Centered Security

The Design Conductors

Research That Scales

The User Experience Team of One (2nd Edition)

Design for Impact

Managing Priorities

Duly Noted

Design That Scales

Interviewing Users (2nd Edition)

Design for Learning

The Leader’s Journey

Strategic Content Design

Closing the Loop

Dig deeper with the Rosenbot