IWAC Chatbot
The IWAC Chatbot offers a new way to explore the Islam West Africa Collection (IWAC) through natural language conversation. Instead of relying on exact keywords, users can ask complex questions and receive synthesized answers drawn directly from the collection's newspaper articles.
Why a Chatbot?
Keyword searches have significant limitations for historical research. They miss synonyms, struggle with historical vocabulary and OCR errors, and cannot capture conceptual relationships. A search for "hajj" won't find documents discussing "pilgrimage to Mecca" unless those exact words appear.
The chatbot overcomes these barriers using semantic search. It understands meaning beyond surface terms, recognizing that documents about "hijab" may be relevant to questions about "veiling practices"—even without keyword overlap.
How It Works
The system uses Retrieval-Augmented Generation (RAG) to connect a Large Language Model to the IWAC's document collection. When you ask a question like "What were the difficulties of organising the pilgrimage to Mecca in Togo?", it retrieves the most relevant articles based on meaning, then generates a response grounded exclusively in those sources—with citations and links for verification.
This approach addresses a common problem with AI: the tendency to invent information. By anchoring responses in actual documents, every answer can be traced back to its sources.
Features
- Natural language queries: Ask complex questions without mastering specialized search techniques
- Semantic connections: Discover thematic links that keyword searches would miss
- Filtering options: Narrow results by date, newspaper, location, or subject
- Full transparency: Every response includes citations and direct links to IWAC documents
Try It Now
While the custom chatbot is being finalized, an IWAC workspace is available in Google NotebookLM. Over 11,500 newspaper articles have been loaded as sources. Users can chat in multiple languages and generate summaries, audio overviews, and structured reports—all grounded in the IWAC corpus.
The chatbot prototype is available on GitHub.