Team
The project team consisted of a project manager, software developers, and data scientists, working in an agile development environment.
Development Challenges
PDF Document Parsing: Ensuring high-quality text extraction for some document formats proved to be challenging. Part of the dataset we used to build the chatbot included presentations. Unique formatting and tables with inconsistently merged columns work well for display purposes, but complicate text extraction. There are a large number of libraries in Python that help with this, each with its own strengths but none tackled every type of document perfectly.
Intent Detection: Detecting if a user was asking about available job listings vs job details required a fair amount of training in natural language understanding for the chatbot to correctly interpret the intent due to the nuances between questions about each topic.
Testing
A comprehensive testing strategy was implemented, including AI tests for relevance and hallucinations, integration tests, and user acceptance testing to ensure the chatbot met all functional and performance requirements.