Summary
Keeping large content repositories organized is an ongoing challenge. There's always new stuff coming in, and taxonomies evolve over time. Resource-strapped teams seldom have opportunities to re-categorize older content. It's a task well-suited for generative AI. Large language models have powerful capabilities that can help teams keep content organized at scale. Using LLMs in this capacity can lead to better user experiences and free team members to focus on more valuable efforts. This presentation explores two approaches for using LLMs to organize content at scale: 1) re-categorizing content using existing categories and 2) developing new categories from existing content. Both will be shown as proofs of concept alongside feasible next steps.
Key Insights
-
•
Large content repositories often become disorganized over time as products and markets evolve, yet organizations tend to deprioritize reorganizing content.
-
•
Search alone is insufficient for navigating large content repositories because users may lack context about the content available.
-
•
Taxonomies and content categories help users understand and navigate both external and internal content repositories.
-
•
AI, particularly large language models like GPT-4, can effectively assist in retagging large amounts of existing content faster than manual processes.
-
•
Maintaining human oversight while using AI tools is critical to prevent errors such as hallucinated tags or irrelevant categorizations.
-
•
Creating new taxonomies for large repositories requires analyzing the entire corpus, for which techniques like embedding databases and retrieval augmented generation (RAG) are useful.
-
•
Graph RAG, which integrates knowledge graphs with LLMs, improves precision by incorporating semantic relationships beyond keyword similarity.
-
•
AI tools enable new workflows that increase not just speed but open novel possibilities in content organization and information architecture.
-
•
Privacy concerns in client work lead to experimentation with local AI models to avoid exposing sensitive internal content.
-
•
Learning AI tools hands-on is essential for content professionals to harness their full potential and adapt workflows accordingly.
Notable Quotes
"Keeping large content repositories organized is an ongoing challenge that often gets deprioritized."
"Search alone might not cut it, especially because people often don’t know what to look for or don’t have enough context."
"I thought, if terms aren’t understandable to users, GPT-4 probably won’t understand them either, so I cleaned up the taxonomy."
"The LLM introduced some tags of its own even though I asked it not to, so I made sure to review everything before changes went live."
"Using GPT-4 to retag 1,200 posts took about a third of the time manual tagging would have taken."
"LLMs are optimized to work with short snippets of text, so we have to prepare content carefully before feeding it to them for big picture analysis."
"Graph RAG improves on plain text RAG by using knowledge graphs, adding semantic relationships that boost precision."
"These tools have increased both my speed, efficiency, and efficacy as an information architect."
"Working with these tools requires developing different workflows and questioning how we’ve always done things."
"The only way to get a taste for what these AI tools can do is by actually getting stuff done with them."
Or choose a question:
More Videos
"Being delightful is the only thing a video game does; without delight, the game dies."
This Game is Never Done: Design Leadership Techniques from the Video Game World
November 6, 2017
"Diagrams allow us to take our time while remaining focused on the big picture."
Abby CovertStuck? Diagrams Help (Videoconference)
October 27, 2022
"Some minorities have had to develop flexibility and adaptability, which can be both a burden and a source of power."
Steve Portigal Susan Simon-Daniels Tamara Hale Randolph Duke IIWar Stories LIVE! Q&A-Discussion
March 30, 2020
"International government design communities can share learnings and scale user-centered design beyond national borders."
Kara KaneCommunities of Practice for Civic Design (Videoconference)
April 7, 2022
"Identifying the right success metrics is a major problem because execs often don’t know what to ask for."
Caroline VizeThe State of UX: Five Lessons from 2021 to Accelerate Digital Experience in 2022
March 9, 2022
"We all will be here today if we haven’t practiced human centered design for as long as we remember."
Lada GorlenkoTheme 3: Introduction
June 10, 2021
"Your AI is learning from real-world input — sometimes from untrusted sources — so ethical iteration is essential."
Jay BustamanteNavigating the Ethical Frontier: DesignOps Strategies for Responsible AI Innovation
October 2, 2023
"When you empower others, you open up possibilities that transform your teams and accelerate your time to market."
Marjorie Stainback Kelsey KingmanTransforming Strategic Research Capacity through Democratization
October 24, 2019
"Design should inherently be ethical and trauma-responsive; trauma-informed should be our default."
Rachael Dietkus, LCSW Uday Gajendar Dr. Dawn Emerick Dawn E. Shedrick, LCSWLeading through the long tail of trauma (Videoconference)
July 13, 2022