Collected Data

-> Go to BOTwiki

In the context of conversational AI, "collected data" refers to data that is collected and stored during interaction with users in order to be reused in further dialogue or downstream processes. In practical terms, these are context variables: information that a bot actively queries or reads from connected systems and then stores in a structured manner.

This data is essential for steering dialogues in a targeted manner, automating processes, and reliably transferring information to third-party systems: for example, in chatbots, voicebots, and AI workflows with BOTfriends X.

Collected data refers to all information that can be recorded, stored, and reused during a conversation or via integrations. It can be actively collected (e.g., customer number, email address, request, meter reading) or originate from external systems as system-collected data (e.g., CRM/ERP data, contract status, open tickets, customer segment).

In contrast to the general term "data for AI training," the focus here is not on research or model training, but rather on operational use in dialogue: Collected data ensures that the bot can maintain context, collect valid data, and execute processes correctly.

The importance of data collection for conversational AI

For conversational AI solutions such as chatbots and voicebots, the targeted collection of data is crucial because it makes dialogues reliable, structured, and processable. While intents and entities interpret the meaning of a user input, collected data ensures that the relevant information is available as concrete values and can be used in the course of the conversation.

Collected data also contributes to personalizing the user experience: if a customer number, location, or request has already been recorded, the bot can reduce follow-up questions and respond in a more targeted manner. Collected data can also be transferred to downstream systems, for example, for ticket creation, updating customer master data, or processing a service request.

Example: Meter reading query

When meter readings are queried, all relevant information is recorded in the dialog and stored as collected data, e.g.:

  • meter number
  • meter reading
  • reading date

These stored values are then used to automatically transfer the data to a connected system (e.g., billing system or CRM) – without any manual rework.

Methods of data collection for collected data

Collected data can be generated in two ways:

  1. Actively Collected Data (User Collected Data): The bot specifically requests information, validates it (e.g., email format, plausibility of a meter reading), and stores it as a context variable. Typical examples include customer number, email address, inquiry, ZIP code, preferred appointment time, or meter reading.
  2. Data read from systems (system-collected data): Data is loaded from external systems via interfaces and also stored as context variables to control the dialog or trigger actions. Examples include name/title from the CRM, contract status, delivery address, ticket history, or order information.

In automated AI workflows, data collection often takes place via integrations with business systems. Collected data connects dialogue and process logic: the bot collects or loads values, uses them in conversation, and then passes them on in a structured manner.

Quality and challenges with collected data

The quality of collected data is crucial because it flows directly into processes. Incomplete or incorrect values quickly lead to incorrect system entries, interrupted workflows, or unnecessary queries.

Typical challenges include:

  • Validation: Are entries formally correct (e-mail, customer number format) and plausible (meter reading within a realistic range)?
  • Consistency: The same information must not be stored in different formats/spellings.
  • Completeness: If mandatory values are missing, the process cannot be completed correctly.
  • Data protection: It must be clearly defined what data is collected, what it is used for, and how long it is stored.

Clear data schemas, mandatory field logic, validation rules, and clean governance help to ensure security. This is particularly important when collected data is used for system updates or process automation.

Frequently asked questions

"Collected data" is storable information from user interaction or connected systems that a bot stores as context variables and later reuses in dialogues or workflows. Examples include customer number, email address, request, meter number, meter reading, or reading date.

Collected data is used to control dialogues (maintain context, reduce queries) and to execute processes (e.g., create tickets, update data records, transmit meter readings). It serves as a structured basis so that a bot not only "responds" but also reliably completes tasks.

User-collected data is actively requested and stored during the conversation (e.g., "What is your customer number?"). System-collected data is read from systems via interfaces (e.g., name/title from the CRM, contract status, or open tickets) and used as context variables.

AI primarily assists with the automated understanding of inputs (e.g., intent/entities) and dialogue management. Collected data is the part that turns this into concrete, storable values that can be validated and reused in processes. Together, these two elements ensure stable automation.

The bot queries the meter number, meter reading, and reading date, saves each field as collected data, and then automatically transfers the values to the target system. This eliminates manual typing and makes the process significantly faster and less prone to errors.

–> Back to the BOTwiki