Summary
Keeping large content repositories organized is an ongoing challenge. There's always new stuff coming in, and taxonomies evolve over time. Resource-strapped teams seldom have opportunities to re-categorize older content. It's a task well-suited for generative AI. Large language models have powerful capabilities that can help teams keep content organized at scale. Using LLMs in this capacity can lead to better user experiences and free team members to focus on more valuable efforts. This presentation explores two approaches for using LLMs to organize content at scale: 1) re-categorizing content using existing categories and 2) developing new categories from existing content. Both will be shown as proofs of concept alongside feasible next steps.
Key Insights
-
•
Large content repositories often become disorganized over time as products and markets evolve, yet organizations tend to deprioritize reorganizing content.
-
•
Search alone is insufficient for navigating large content repositories because users may lack context about the content available.
-
•
Taxonomies and content categories help users understand and navigate both external and internal content repositories.
-
•
AI, particularly large language models like GPT-4, can effectively assist in retagging large amounts of existing content faster than manual processes.
-
•
Maintaining human oversight while using AI tools is critical to prevent errors such as hallucinated tags or irrelevant categorizations.
-
•
Creating new taxonomies for large repositories requires analyzing the entire corpus, for which techniques like embedding databases and retrieval augmented generation (RAG) are useful.
-
•
Graph RAG, which integrates knowledge graphs with LLMs, improves precision by incorporating semantic relationships beyond keyword similarity.
-
•
AI tools enable new workflows that increase not just speed but open novel possibilities in content organization and information architecture.
-
•
Privacy concerns in client work lead to experimentation with local AI models to avoid exposing sensitive internal content.
-
•
Learning AI tools hands-on is essential for content professionals to harness their full potential and adapt workflows accordingly.
Notable Quotes
"Keeping large content repositories organized is an ongoing challenge that often gets deprioritized."
"Search alone might not cut it, especially because people often don’t know what to look for or don’t have enough context."
"I thought, if terms aren’t understandable to users, GPT-4 probably won’t understand them either, so I cleaned up the taxonomy."
"The LLM introduced some tags of its own even though I asked it not to, so I made sure to review everything before changes went live."
"Using GPT-4 to retag 1,200 posts took about a third of the time manual tagging would have taken."
"LLMs are optimized to work with short snippets of text, so we have to prepare content carefully before feeding it to them for big picture analysis."
"Graph RAG improves on plain text RAG by using knowledge graphs, adding semantic relationships that boost precision."
"These tools have increased both my speed, efficiency, and efficacy as an information architect."
"Working with these tools requires developing different workflows and questioning how we’ve always done things."
"The only way to get a taste for what these AI tools can do is by actually getting stuff done with them."
Dig deeper—ask the Rosenbot:














More Videos

"We scaled panels down to about 60 countries in 11 languages focusing on markets important to Airbnb teams."
Wyatt HaymanGlobal Research Panels (Videoconference)
August 8, 2020

"Instead of choosing colors and then checking contrast, we define target contrast ratios first and generate colors accordingly."
PJ Buddhari Nate BaldwinMeet Spectrum, Adobe’s Design System
June 9, 2021

"Government often moves slowly and struggles to communicate future intent amid short-term sustainability concerns."
Sarah GallimoreInspire Progress with Artifacts from the Future
November 18, 2022

"Developing trust means showing you understand what it takes to get something shipped, that you’re reliable, and that people can be vulnerable with you."
Peter MerholzThe Trials and Tribulations of Directors of UX (Videoconference)
July 13, 2023

"Intersectionality is a framework to describe compounding levels of discrimination people experience."
Dr. Jamika D. Burge Mansi GuptaAdvancing the Inclusion of Womxn in Research Practices (Videoconference)
September 15, 2022

"Investment is attention. If the business funds you, they believe in what you’re doing."
Amy MarquezINVEST: Discussion
June 15, 2018

"Do you play games? Cool, you’re in. Stop asking questions. It’s about behavior, not identity."
Dane DeSutter Natalie Gedeon Deborah Hendersen Cheryl PlatzBeyond the Console: The rise of the Gamer Experience and how gaming will impact UX Research across industries (Videoconference)
May 17, 2024

"Denying the body is not an antidote to the suffering of the world. When we protect our rest, we protect our dreaming."
Zariah CameronReDesigning Wellbeing for Equitable Care in the Workplace
September 23, 2024

"Regular positive feedback, not just negative, is essential because many with ADHD rely on extrinsic motivation."
Jessica NorrisADHD: A DesignOps Superpower
September 9, 2022