Summary
Keeping large content repositories organized is an ongoing challenge. There's always new stuff coming in, and taxonomies evolve over time. Resource-strapped teams seldom have opportunities to re-categorize older content. It's a task well-suited for generative AI. Large language models have powerful capabilities that can help teams keep content organized at scale. Using LLMs in this capacity can lead to better user experiences and free team members to focus on more valuable efforts. This presentation explores two approaches for using LLMs to organize content at scale: 1) re-categorizing content using existing categories and 2) developing new categories from existing content. Both will be shown as proofs of concept alongside feasible next steps.
Key Insights
-
•
Large content repositories often become disorganized over time as products and markets evolve, yet organizations tend to deprioritize reorganizing content.
-
•
Search alone is insufficient for navigating large content repositories because users may lack context about the content available.
-
•
Taxonomies and content categories help users understand and navigate both external and internal content repositories.
-
•
AI, particularly large language models like GPT-4, can effectively assist in retagging large amounts of existing content faster than manual processes.
-
•
Maintaining human oversight while using AI tools is critical to prevent errors such as hallucinated tags or irrelevant categorizations.
-
•
Creating new taxonomies for large repositories requires analyzing the entire corpus, for which techniques like embedding databases and retrieval augmented generation (RAG) are useful.
-
•
Graph RAG, which integrates knowledge graphs with LLMs, improves precision by incorporating semantic relationships beyond keyword similarity.
-
•
AI tools enable new workflows that increase not just speed but open novel possibilities in content organization and information architecture.
-
•
Privacy concerns in client work lead to experimentation with local AI models to avoid exposing sensitive internal content.
-
•
Learning AI tools hands-on is essential for content professionals to harness their full potential and adapt workflows accordingly.
Notable Quotes
"Keeping large content repositories organized is an ongoing challenge that often gets deprioritized."
"Search alone might not cut it, especially because people often don’t know what to look for or don’t have enough context."
"I thought, if terms aren’t understandable to users, GPT-4 probably won’t understand them either, so I cleaned up the taxonomy."
"The LLM introduced some tags of its own even though I asked it not to, so I made sure to review everything before changes went live."
"Using GPT-4 to retag 1,200 posts took about a third of the time manual tagging would have taken."
"LLMs are optimized to work with short snippets of text, so we have to prepare content carefully before feeding it to them for big picture analysis."
"Graph RAG improves on plain text RAG by using knowledge graphs, adding semantic relationships that boost precision."
"These tools have increased both my speed, efficiency, and efficacy as an information architect."
"Working with these tools requires developing different workflows and questioning how we’ve always done things."
"The only way to get a taste for what these AI tools can do is by actually getting stuff done with them."
Or choose a question:
More Videos
"Transparency is building respect for what your teammates do."
Jennifer KanyamibwaCreating the Blueprint: Growing and Building Design Teams
November 8, 2018
"Research repositories and libraries are social things."
Brigette Metzler Dana ChrisfieldResearch Repositories: A global project by the ResearchOps Community (Videoconference)
August 27, 2020
"In many organizations, people believe the team from their functional area is their real team, not the cross-functional project team."
Carl TurnerYou Can Do This: Understand and Solve Organizational Problems to Jumpstart a Dead Project
March 28, 2023
"Gina was closing the dots and shifting perspective from a designer closer to the business."
John Mortimer Milan Guenther Lucy Ellis Patrick QuattlebaumPanel Discussion
December 3, 2024
"We don’t have all the answers; we’re in the thick of it all and actively learning just like you."
Dante GuintuHow to Crush the Talent Crunch
September 8, 2022
"Creativity means a change in perception, from the familiar to the unfamiliar, even initially ridiculous."
Richard BuchananCreativity and Principles in the Flourishing Enterprise
June 15, 2018
"Saul Metz is going to give advice on how to hire other outsiders like her."
Dan WillisTheme 3: Intro
January 8, 2024
"The Wright siblings had a sister, Catherine, who was a full participant and should also be remembered."
Dan WardFailure Friday #1 with Dan Ward
February 7, 2025
"We need to step out from informing decisions and into becoming changemakers."
Chris GeisonTheme Two Intro
March 28, 2023