The GenAI Node lets you leverage the full potential of LLMs and Generative AI to quickly build conversations that involve complex flows and provide human-like experiences. You can define the entities you would like to collect and the business rules that govern the collection of these entities. The XO Platform orchestrates the conversation using contextual intelligence, ensuring that the conversation is always grounded to your enterprise business rules. You can also provide exit rules for handing off the conversation to the virtual assistant or the human agents.
Note: The GenAI Node v2 is included in the XO v10.5.1 release. All GenAI Nodes and prompts created after this release are version 2.
Why a GenAI Node?
There are two key scenarios when a GenAI node might be beneficial:
- Handling co-referencing and entity correction in conversations: NLP might not pick up co-referencing and entity correction during a conversation. For example, in a flight booking task, someone might ask to book two window seats, then change their mind and ask to modify one of the
seat types
from thewindow
to themiddle
. In this scenario, the VA must correct the already collected entity(seat type)
and perform entity co-referencing to modify fromwindow
tomiddle
. - Managing complex flows without extensive scripting: Complex flows like the above increase dialog task complexity, requiring multiple paths and nodes. Even then, it is humanly impossible to predict all such scenarios. Scripting all these possibilities might also result in a sub-par end-user experience.
Leveraging a generative AI model mitigates these scenarios by eliminating the need to predict and configure such complex possibilities while still under the constraint of defined rules and exit scenarios. This can facilitate more natural conversations and improve end-user experience.
What’s New in Version 2 of the GenAI Node
Node Level Enhancements
- Conversation History Length: Specify the number of recent messages (both user and VA) to send to the language model as context.
Custom Prompt Enhancements
- Required Entities: A new dynamic variable holding a comma-separated list of entity names to be captured by the LLM. This allows platform users to specify which entities need to be collected or included in the output.
- Collected Entities: An object containing the entities and their values collected by the language model.
- Custom Prompt Creation using JavaScript: The Platform introduces a JavaScript mode that enables you to create prompts using JavaScript. It will process the JavaScript and any variables in the prompt to generate a JSON object. The users can preview and validate the scripts by seeing the key-value pairs of the resulting JSON object, similar to a message node. Finally, the system will send the generated JSON object to the configured model.
Sample JavaScript
const jsonRepresentation = { Â Â messages: [ Â Â Â Â { Â Â Â Â Â Â role: "system", Â Â Â Â Â Â content: `You are a virtual assistant representing an enterprise business. Act professionally at all times and do not engage in abusive language or non-business-related conversations. ${System_Context} Your task is to collect entities from user input and conversation history. Entities to collect: ${Required_Entities} Entities already collected: ${JSON.stringify(Collected_Entities)}. Business rules for entity collection: ${Business_Rules}. Instructions: - Capture all mentioned entities. - Do not prompt for entities that have already been provided. - Generate appropriate prompts to collect unfulfilled entities only in ${Language} Language and keep the entities collected in the Original Language. - Keep prompts and messages voice-friendly. Output format: STRICTLY RETURN A JSON OBJECT WITH THE FOLLOWING STRUCTURE: {"bot": "prompt to collect unfulfilled entities", "conv_status": "ongoing" or "ended", "entities": [{key1: value1, key2: value2, ...}]} Always ensure that the entities collected SHOULD be in an array of one object. Conversation status: Mark conv_status as 'ended' when all entity values are captured or if any of the following scenarios are met: ${Exit_Scenarios} - Otherwise, set conv_status as 'ongoing'.` Â Â Â Â }, Â Â Â Â ...Conversation_History, Â Â Â Â { Â Â Â Â Â Â Â Â "role": "user", Â Â Â Â Â Â Â Â "content": `${User_Input}` Â Â Â Â } Â Â ], Â Â model: "gpt-4", Â Â temperature: 0.73, Â Â max_tokens: 300, Â Â top_p: 1, Â Â frequency_penalty: 0, Â Â presence_penalty: 0 }; context.payloadFields = jsonRepresentation;
Support for Variables
- Support for Dynamic Variables: Context, Environment, and Content variables can now be used in pre-processor scripts, post-processor scripts, and custom prompts.
Node Behavior
Runtime
You can work with this node like any other node within Dialog Tasks and invoke it within multiple tasks. During runtime, the node behaves as follows:
- Entities Collection:
- On reaching the GenAI Node, the platform invokes the Generative AI model to understand the user input.
- The platform uses the entities and business rules defined as part of the node configurations to understand the user input and identify the required entity values.
- The responses required to prompt/inform the user are automatically generated based on the conversation context.
- The platform drives the conversation until all the defined entities are captured.
- Contextual Intents:
- Contextual intents (Dialog or FAQs) recognized from the user input continue to be honored as per the Interruption Settings defined in the bot definition.
- Post completion of the contextual intents, the flows can return to the GenAI Node.
- Exit Conditions:
- The platform exits from the GenAI Node when any of the defined exit conditions are met.
- These conditions provide you the ability to define scenarios that need a different path in the conversation, for example, handing off to a human agent.
- The platform can also exit the GenAI Node when the user exceeds the maximum number of volleys (retries to capture the required entities).
- The platform stores the entity values in the context object, and this information can be used to define the transitions or any other part of the bot configuration.
Output
The output generated by this node is fully usable throughout the dialog flow, even once the node is no longer in use. Output is maintained in a structured .json within the Context Object, so you can access and use the output throughout the rest of your flow.
Enable
By default, the feature/node is disabled. To enable the feature, Dynamic Conversations Features.
Add to a Task
Steps to add the GenAI Node to a Dialog Task:
- Go to Build > Conversational Skills > Dialog Tasks and select the task that you are working with.
- You can add the GenAI Node just like any other node. You can find it in the main list of nodes.
Configure GenAI Node
Component Properties
The component properties empower you to configure the following settings. The changes made within this section affect this node across all instances and dialog tasks.
General Settings
It allows you to provide a Name and Display Name for the node. The node name cannot contain spaces.
Advance Settings
Adjusting the settings allows you to fine-tune the model’s behavior to meet your needs. The default settings work fine for most cases. You can tweak the settings and find the right balance for your use case. A few settings are common in the features, and a few are feature-specific:
- Model: The selected model for which the settings are displayed.
- Prompt/Instructions or Context: Add feature/use case-specific instructions or context to guide the model.
- Conversation History Length: This setting allows you to specify the number of recent messages sent to the LLM as context. These messages include both user messages and virtual assistant (VA) messages. The default value is 10. This conversation history can be seen from the debug logs.
Note: Applicable only if you are using a custom prompt. - Temperature: The setting controls the randomness of the model’s output. A higher temperature, like 0.8 or above, can result in unexpected, creative, and less relevant responses. On the other hand, a lower temperature, like 0.5 or below, makes the output more focused and relevant.
- Max Tokens: It indicates the total number of tokens used in the API call to the model. It affects the cost and the time taken to receive a response. A token can be as short as one character or as long as one word, depending on the text.
- Fallback Behavior: Fallback behavior lets the system determine the optimal course of action on LLM call failure or the Guardrails are violated. You can select fallback behavior as:
- Trigger the Task Execution Failure Event
- Skip the current node and jump to a particular node. The system skips the node and transitions to the node the user selects. By default, ‘End of Dialog’ is selected.
Dialog Details
Under Dialog Details, configure the following:
Pre-Processor Script
This property helps execute a script as the first step when the GenAI Node is reached. Use the script to manipulate data and incorporate it into rules or exit scenarios as required. The Pre-processor Script has the same properties as the Script Node. Learn more.
To define a pre-processor script, click Define Script, add the script you want to execute, and click Save.
System Context
Add a brief description of the use case context to guide the model.
Entities
Specify the entities to be collected by LLM during runtime. In the Entities section, click + Add, enter an Entity Name, and select the Entity Type from the drop-down list.
Most entity types are supported. Here are the exceptions: custom, composite, list of items (enumerated and lookup), and attachment. See Entity Types for more information.
Rules
Add the business rules that the collected entities should respect. In the rules section, click + Add, then enter a short and to-the-point sentence, such as:
- The airport name should include the IATA Airport Code;
- The passenger’s name should include the last name.
There is a 250-character limit to the Rules field, and you can add a maximum of 5 rules.
Exit Scenarios
Specify the scenarios that should terminate entity collection and return to the dialog task. This means the node ends interaction with the generative AI model and returns to the dialog flow within the XO Platform.
Click Add Scenario, then enter short, clear, and to-the-point phrases that specifically tell the generative AI model when to exit and return to the dialog flow. For example, Exit when the user wants to book more than 5 tickets in a single booking and return "max limit reached"
.
There is a 250-character limit to the Scenarios field, and you can add a maximum of 5 scenarios.
Post-Processor Script
This property initiates the post-processor script after processing every user input as part of the GenAI Node. Use the script to manipulate the response captured in the context variables just before exiting the GenAI Node for both the success and exit scenarios. The Pre-processor Script has the same properties as the Script Node. Â Learn more.
Important Considerations
If the GenAI Node requires multiple user inputs, the post-processor is executed for every user input received.
To define a post-processor script, click Define Script and add the script you want to execute.
Instance Properties
Configure the instance-specific fields for this node. These apply only for this instance and will not affect this adaptive dialog node when used in other tasks. You must configure Instance Properties for each task where this node is used.
User Input
Define how user input validation occurs for this node:
- Mandatory: This entity is required and must be provided before proceeding.
- Allowed Retries: Configure the maximum number of times a user is prompted for a valid input. You can choose between 5-25 retries in 5-retries increments. The default value is 10 retries.
- Behavior on Exceeding Retries: Define what happens when the user exceeds the allowed retries. You can choose to either End the Dialog or Transition to a Node – in which case you can select the node to transition to.
User Input Correction
Decide whether to use autocorrect to mitigate potential user input errors:
- Autocorrect user input: The input will be autocorrected for spelling and other common errors.
- Do not autocorrect user input: The user input will be used without making any corrections.
Advanced Controls
Configure advanced controls for this node instance as follows:
Intent Detection
This applies only to String and Description entities: Select one of these options to determine the course of action if the VA encounters an entity as a part of the user utterance:
- Accept input as entity value and discard the detected intent: The VA captures the user entry as a string or description and ignores the intent.
- Prefer user input as intent and proceed with Hold & Resume settings: The user input is considered for intent detection, and the VA proceeds according to the Hold & Resume settings.
- Ask the user how to proceed: Allow the user to specify if they meant intent or entity.
Interruptions Behavior
To define the interruption handling at this node. You can select from the below options:
- Use the task level ‘Interruptions Behavior’ setting: The VA refers to the Interruptions Behavior settings set at the dialog task level.
- Customize for this node: You can customize the Interruptions Behavior settings by selecting this option and configuring it. You can choose whether to allow interruptions or not, or to allow the end user to select the behavior. You can further customize Hold and Resume behavior. Read the Interruption Handling and Context Switching article for more information.
Custom Tags
Add Custom Meta Tags to the conversation flow to profile VA-user conversations and derive business-critical insights from usage and execution metrics. You can define tags to be attached to messages, users, and sessions. See Custom Meta Tags for details.
Voice Call Properties
Configure Voice Properties to streamline the user experience on voice channels. You can define prompts, grammar, and other call behavior parameters for the node. This node does not require Initial Prompts, Error Prompts, and grammar configuration.
See Voice Call Properties for more information on setting up this section of the GenAI Node.
Connections Properties
Note:Â If the node is at the bottom of the sequence, then only the connection property is visible.
Define the transition conditions from this node. These conditions apply only to this instance and will not affect this node’s use in any other dialog. For a detailed setup guide, See Adding IF-Else Conditions to Node Connections for a detailed setup guide.
All the entity values collected are stored in context variables. For example, {{context.genai_node.bookflight_genainode.entities.entity_1}}
. You can define transitions using the context variables.
This node captures entities in the following structure:
{
"bookflight_genainode": {
"entities": {
"entity_1": "value 1",
"entity_2": "value 2",
"entity_3": "value 3"
},
"exit_scenario": {
"conv_status": "ended"
},
"bot_response": {
"bot": "Thank you for choosing us, your flight ticket details will be shared over email."
}
}
}
Add Custom Prompt for GenAI Node
This step involves adding a custom prompt to the GenAI node to tailor its behavior or responses according to specific requirements. By customizing the prompt, you can guide the AI to generate outputs that align more closely with the desired outcomes of your application.
GenAI Node with custom prompt supports configuring pre and post-processor scripts at both node and prompt levels. This enables platform users to reuse the same custom prompt across multiple nodes while customizing the processing logic, input variables, and output keys for each specific use case.Â
When you configure pre and post-processor scripts at both node and prompt levels, the execution order is: Node Pre-processor → Prompt Pre-processor → Prompt → Prompt Post-processor → Node Post-processor.
Note: Node-level pre and post-processor scripts support Bot Functions in addition to content, context, and environment variables..
For more information on Custom Prompt, see Prompts and Requests Library.
To add a GenAI Node prompt using JavaScript, follow the steps:
- Go to Build > Natural Language > Generative AI & LLM.
- On the top right corner of the Prompts and Requests Library section, click +Add New.
- Enter the prompt name. In the feature dropdown, select GenAI Node and select the model.
- The Configuration section consists of End-point URLs, Authentication, and Header values required to connect to a large language model. These are auto-populated based on the input provided while model integration and are not editable.
- In the Request section, click Start from Scratch. Learn more.
- Click JavaScript. The Switch Mode pop-up is displayed. Click Continue.
- Enter the JavaScript and click Preview.
- On the Preview pop-up, enter the Variable Value and click Test. This will convert the JavaScript to a JSON object and send it to the LLM. You can view the JSON object in the JSON Preview section. The success message is displayed. Click Close.
- You can view the JSON object in the JSON Preview section. Click Close.
- In the request section, click Test. This will make a call to the LLM.
- If the request values are correct, the response from the LLM is displayed. If not, an error message is displayed.
- In the Actual Response section, double-click the Key that should be used to generate the response path. For example, double-click the Content key and click Save.
- The Response Path is displayed. Click Lookup Path.
- The Actual Response and Expected Response are displayed.
- If the response structure matches, the responses will be in green. Click Save. Skip to Step 15.
Note:Â Both Actual Response and Expected Response are not editable.
- If the response structure does not match, the responses will be in red. Click Configure to modify the Actual Response. The Post Processor Script is displayed.
- If the response structure matches, the responses will be in green. Click Save. Skip to Step 15.
- Enter the Exit Scenario Key-Value fields, and Virtual Assistance Response Key, and Collected Entities. The Exit Scenario Key-Value fields help identify when to end the interaction with the GenAI model and return to the dialog flow. A Virtual Assistance Response Key is available in the response payload to display the VA’s response to the user. The Collected Entities is an object within the LLM response that contains the key-value of pairs of entities to be captured.
- Click Save. The request is added and displayed in the Prompts and Requests Library section.
Dynamic Variables
The Dynamic Variables like Context, Environment, and Content variables can now be used in pre-processor scripts, post-processor scripts, and custom prompts.
Keys | Description |
{{User_Input}} | The latest input by the end-user. |
{{Model}} Optional | This specifies the LLM tagged to the GenAI Node in the Dialog Task. |
{{System_Context}} Optional | This contains the initial instructions provided in the GenAI Node that guide how the LLM should respond. |
{{Language}} Optional | The language in which the LLM will respond to the users |
{{Business_Rules}} Optional | Rules mentioned in the GenAI Node are used to understand the user input and identify the required entity values. |
{{Exit_Scenarios}} Optional | Scenarios mentioned in the GenAI Node should terminate entity collection and transition to the next node based on Connection Rules. |
{{Conversation_History_String}} Optional | This contains the messages exchanged between the end-user and the virtual assistant. |
{{Conversation_History_Length}} Optional | This contains a maximum number of messages that the conversation history variable can hold. |
{{Required_Entities}} Optional | This contains the list of entities (comma-separated values) mentioned in the GenAI Node to be captured by the LLM. |
{{Conversation_History}} Optional | Past messages in the conversation are exchanged between the end-user and the virtual assistant. This is an array of objects with role and content as keys. |
{{Collected_Entities}} Optional | List of entities and their values collected by the LLM. This is an object with an entity name as the key and the value as LLM collected value. |