The Kore.ai NLP engine uses Machine Learning, Fundamental Meaning, and Knowledge Graph (if any) models to match intents. All the three Kore.ai engines finally deliver their findings to the Kore.ai Ranking and Resolver component as either exact matches or probable matches. Ranking and Resolver determines the final winner of the entire NLP computation.
The NLP engine uses a hybrid approach using Machine Learning, Fundamental Meaning, and Knowledge Graph (if the bot has one) models to score the matching intents on relevance. The model classifies user utterances as either being Possible Matches or Definitive Matches.
Definitive Matches get high confidence scores and are assumed to be perfect matches for the user utterance. In published bots, if user input matches with a single Definitive Match, the bot directly executes the task. If the utterances match with multiple Definitive Matches, they are sent as options for the end-user to choose one.
On the other hand, Possible Matches are intents that score reasonably well against the user input but do not inspire enough confidence to be termed as exact matches. Internally the system further classifies possible matches into good and unsure matches based on their scores. If the end-user utterances were generating possible matches in a published bot, the bot sends these matches as “Did you mean?” suggestions for the end-user.
Based on the ranking and resolver, the winning intent between the engines is ascertained. If the platform finds ambiguity, then an ambiguity dialog is initiated. The platform initiates one of these two system dialogs when it cannot ascertain a single winning intent for a user utterance :
- Disambiguation Dialog: Initiated when there are more than one Definitive matches returned across engines. In this scenario, the bot asks the user to choose a Definitive match to execute. You can customize the message shown to the user from the NLP Standard Responses.
- Did You Mean Dialog: Initiated if the Ranking and Resolver returns more than one winner or the only winning intent is an FAQ whose KG engine score is between lower and upper thresholds. This dialog lets the user know that the bot found a match to an intent that it is not entirely sure about and would like the user to select to proceed further. In this scenario, the developer should identify these utterances and train the bot further. You can customize the message shown to the user from the NLP Standard Responses.
Thresholds & Configuration
Ranking & Resolver Engine can be configured as follows:
- Open the bot for which you want to configure thresholds.
- Hover over the side navigation panel and then click Natural Language -> Training.
- Click the Thresholds & Configurations tab.
- Ranking & Resolver Engine section allows you to set the threshold
- Proximity of Probable Matches which defines the maximum difference to be allowed between top-scoring and immediate next possible intents to consider them as equally important. Prior to v7.3 of the platform, this setting was under the Fundamental Meaning section.
- Dependency Parsing Model for enabling rescoring the intents by the Ranking and Resolver engine as well as for the intent recognition by the Fundamental Meaning model. This configuration is disabled by default and needs to be set implicitly. See below for details.
Dependency Parsing Model
The platform has two models for scoring intents by the Fundamental Meaning Engine and the Ranking & Resolver Engine:
- The first model predominantly relies on the presence of words, the position of words in the utterance, etc. to determine the intents and is scored solely by the Fundamental Meaning Engine. This is the default setting.
- The second model is based on the dependency matrix where the intent detection is based on the words, their relative position, and most importantly the dependency between the keywords in the sentence. Under this model, intents are scored by the Fundamental Meaning Engine and then rescored by the Ranking & Resolver Engine.
Dependency Parsing Model can be enabled and configured from the Ranking & Resolver section under Natural Language -> Training -> Thresholds & Configurations.
Note: This feature was introduced in v7.3 of the platform and is supported only for select languages, see here for supported languages.
Dependency Parsing Model can be configured as follows:
- Minimum Match Score to define the minimum score to qualify an intent as a probable match. It can be set to a value between 0.0 to 1.0, with the default set to 0.5.
- Advanced Configurations can be used to customize the model by changing the weights and scores associated with various parameters. This opens a JSON editor where you can enter the valid code. You can click the ‘restore to default configurations’ to get the default threshold settings in a JSON structure, you can change the settings provided you are aware of the consequences.
The Natural Language Analysis will result in the following scenarios:
- NLP Analysis identifying a Definitive match with FM or ML or KG engines
- NLP Analysis with Multiple engines returning probable match and selecting a single match
- NLP Analysis with Multiple engines returning probable match and resolver returning back multiple results
- NLP Analysis with No match
Each of the above cases is discussed in this section.
To understand NLP detection, let us use the example of a Bank bot with the following details:
Scenario 1 – FM identifying a Definitive match
- The Fundamental meaning(FM) model identified the utterance as a Definitive match.
- The Machine Learning (ML) model also identified it as a Possible match.
- The score returned for the task identified is 6 times more than other intent scores. Also, all the words in the intent name are present in the user utterance. Thus the FM model termed it a Definitive match.
- The ML model matches the Find ATM intent as a Probable match.
- The user utterance is “How do I make transfer money to a London account?”
- The user utterance contains all the terms required to match this Knowledge Graph intent path Transfer, Money, International.
- The term international is identified as a synonym of London that the user used in the utterance.
- As 100% path term matched the path was qualified. As part of confidence scoring, the terms in the user query are similar to that of the actual Knowledge Graph question. Thus, it returns a score of 100.
- As the score returned is above 100, the intent is marked as a Definitive match and selected.
- FM engine found it a Probable match as the key term Transfer is present in the user utterance
- ML engine found the utterance as a Probable match as the utterance did not fully match any trained utterance.
- All the 3 engines returned possible match and no definitive match
- ML Model has 1 possible matches and FM Model has 2 possible matches, of which 1 is common. Knowledge Graph has 1 possible match. All possible matches identified are re-ranked in the Ranking and Resolver.
- The Ranking and Resolver component returned the highest score for the single match (Task name – “ When can I start making payments using BillPay plus? ”) from the Knowledge graph engine. The scores for other probable match come out to be lower than 2 percentile of the top score and are thus ignored. The winner, in this case, is the ‘KG’ returned query and is presented to the user.
- Though most of the keywords in the user utterance map to the keywords in the KG query, still this is not a definitive match because
- The number of path term matched are not 100%
- The KG engine returned the score with a 64.72% probability. Had we used the word ‘Billpay’ instead of ‘bill pay’ the score would have been 87.71%. (still not a 100% match)
- Now as the score is between the 60%-80% threshold the Query is presented as part of the ‘Did-you-mean’ dialog and not as a complete winner. If the score was above 80% the platform would have given out the response without re-confirming with the ‘Did-you-mean’ dialog.
Scenario 5 – Resolver returning back multiple results
- All the engines detected probable matches
- KG returned with 2 possible paths
- Ranking and resolver found the 2 queries with a score of less than 2% apart.
- Both the Knowledge Graph intents are selected and presented to the user as ‘Did-you-mean’
- Both the paths were selected as terms in both matched and the score for both the paths is more than 60%