Log in or create a free Rosenverse account to watch this video.
Log in Create free account100s of community videos are available to free members. Conference talks are generally available to Gold members.
This video is featured in the Ed's Test playlist playlist.
Summary
Artificial Intelligence (AI) tools like large language models present opportunities — and risks — for people working with digital content. Can AI help with tedious tasks, like encoding and categorizing documents, or rewriting text snippets? Will AI be so good at these tasks that Information Architects (IAs) are no longer needed? In this session, Jeff and Karen provide an overview of what "AI" means for IAs, explaining the differences between natural language processing, machine learning, and large language models. They also dig into a real-world example of using different systems to categorize web content from Reddit. IAs will come away from the session reassured that the robots pose no threat to their jobs.
Key Insights
-
•
Natural language processing is about mathematically modeling words and their relationships but struggles with context and nuance.
-
•
Machine learning models detect statistical patterns but do not truly understand content or reason about it.
-
•
Large language models like ChatGPT work by predicting what text should follow based on massive internet-scale training data.
-
•
Tokenization breaks text into meaningful units, filtering out filler words to create useful representations for AI.
-
•
Choosing the right AI model and customizing prompts is more important than using the 'biggest' or most popular model.
-
•
Using AI for content categorization requires iterative refinement, precise instructions, and ongoing validation.
-
•
AI tools often generate plausible but overlapping categories, which complicates achieving mutually exclusive, exhaustive taxonomies.
-
•
Cost, time, and carbon footprint are significant considerations for scaling AI workflows, making simpler tools sometimes preferable.
-
•
Visualization techniques like Sankey diagrams and proximity charts help interpret AI categorization results and spot issues.
-
•
Human expertise in information architecture is more vital than ever to effectively harness AI in complex content tasks.
Notable Quotes
"AI is not going to save us. It is a tool that we use with significant human support and understanding."
"Trying to map all the possible connotations and contextual interrelationships between words is mindbogglingly computationally expensive."
"The risk with LLMs is that they invent new categories or start writing Reddit posts instead of just categorizing."
"The best large language models we tested only achieved about 47% accuracy in categorizing Reddit posts."
"Breaking down tasks into one thing at a time dramatically improves AI performance and reliability."
"Choosing a tool designed for your specific problem matters way more than just picking the smartest AI available."
"We ended up using garbage ID numbers for categories to keep the LLM from inventing new ones."
"It wasn’t obvious the difference between AI generating category ideas and validating categories until we tried both."
"Humans still excel at mental modeling of topics and nuance that AI struggles to replicate automatically."
"Foundational IA and metadata infrastructure are the meat and potatoes supporting scalable AI-driven experiences."
Or choose a question:
More Videos
"Sometimes you’re the leader, sometimes you’re at the end, sometimes you’re trying to get other folks to come in."
Jennifer KanyamibwaCreating the Blueprint: Growing and Building Design Teams
November 8, 2018
"There’s a tension between immediate evaluative research and more durable insights for future use."
Brigette Metzler Dana ChrisfieldResearch Repositories: A global project by the ResearchOps Community (Videoconference)
August 27, 2020
"Project delays are frequently due to organizational and cultural issues rather than actions of the project team."
Carl TurnerYou Can Do This: Understand and Solve Organizational Problems to Jumpstart a Dead Project
March 28, 2023
"Leadership is unlocking potential in people and processes and taking responsibility for doing that."
John Mortimer Milan Guenther Lucy Ellis Patrick QuattlebaumPanel Discussion
December 3, 2024
"The hardest lesson: being comfortable when you’re uncomfortable and figuring out how to create the most impact."
Dante GuintuHow to Crush the Talent Crunch
September 8, 2022
"The new marketing is about preparing users or customers for the new kinds of products that will emerge as time moves forward."
Richard BuchananCreativity and Principles in the Flourishing Enterprise
June 15, 2018
"The last of long time, we end up in bars late at night, exhausted but happy from all the conversations and learning."
Dan WillisTheme 3: Intro
January 8, 2024
"We do experiments to learn, and if we learn something, then it’s a successful experiment."
Dan WardFailure Friday #1 with Dan Ward
February 7, 2025
"Generative workshops such as design sprints can devolve into political theater if we’re not careful."
Chris GeisonTheme Two Intro
March 28, 2023