You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 127 Next »

Datasets
(Databases and SQL Queries)


Introduction to the Datasets Module

The Datasets Module is designed for data exchange with SQL databases and text files from a diverse set of sources. Essentially, the Dataset module drives bi-directional real-time communication between all Project Modules and the SQL databases.  This module offers compatibility with various database technologies, including ADO.NET, ODBC, OleDB, and native interfaces, providing straightforward configuration with prominent databases such as SQL Server, Oracle, SQLite, and PostgreSQL.

Features include multi-threaded concurrent connections for efficient data handling, a dedicated SQLite editor for streamlined database management, and an intuitive visual query builder that simplifies the process of query creation and manipulation. The Dataset module also integrates real-time tags within query strings, enhancing the dynamism of data interaction. It can manage files and recipes in ASCII, Unicode, or XML formats, broadening its applicability across different data types.

On this page:




Key Concepts and Terms

DatasetDBs

For the Dataset Module to communicate with an external database, a connection must be created with specific parameters. These connections, created within the Datasets → DBs section of the module, are referred to a DatasetDB.

DatasetQueries

In the context of the Dataset Module, a DatasetQuery refers not only to an SQL query string but also to a Project object that has a logical name, an SQL query related to that logical name, and other parameters defined within the Datasets → Queries section. There are many ways to automatically map the results of a query execution with tags.

DatasetTables

A DatasetTable is a logical name created within a solution to set up access to a specific table in a connected database. These tables are listed within the Datasets → Tables section of the module. The tags in the real-time database can be easily mapped to columns in the tables to perform insert, update, or read operations.

DatasetFiles

A Dataset File is a logical name that defines parameters for reading and writing files in ASCII, Unicode, or XML formats.




Understanding the Datasets Module

The Datasets Module enables users to interact with SQL databases seamlessly, allowing them to retrieve and manipulate data within their applications.  Here you can:

  • Generate Custom Visualizations,
  • Implement Intricate Logic
  • Set Up Dynamic Alarms

The module supports real-time Tags within query strings, and manages files and recipes in ASCII, Unicode, or XML formats.

Module Highlights

  • Multi-threaded concurrent connections for efficient data handling.
  • Dedicated SQLite editor for streamlined database management.
  • Intuitive visual query builder that simplifies the process of query creation and manipulation.

Data Utilization

The data retrieved from databases can be utilized in various ways throughout your project:

  • In the Displays Module: Visualization tools like DataGrids can present query results on screens and dashboards, creating custom views of the data that are accessible and easy to understand for your users.
  • In the Scripting Module: Custom scripts can reference query results and trigger specific actions, such as sending notifications, updating tags, or performing calculations, thereby implementing complex logic based on database data.
  • In the Alarm Module: Data can be used to configure or update alarm conditions, triggers, and actions, enabling the creation of dynamic alarm systems that respond to data changes.

Various modules can also add data to your SQL databases. For instance, the Displays Module can log operation inputs and commands, the Scripting Module can calculate analytics, and the Alarm and Historian Modules can retain long-term records and audit trails. 

Processing Data Requests

Datasets Data Server Service

The Datasets Data Server Service ensures high performance and seamless integration while exchanging structured collections of data in real-time. It efficiently processes data requests from other modules, enabling connectivity with various components such as HMI/SCADA screens, scripting, and reporting tools.

Default Databases Used When Running the Project

The Dataset module also serves as a data storage configuration hub for other modules. The Alarm and Historian modules, which generate events and records for long-term retention, utilize the Dataset module for defining data storage. Similarly, the Security Module can utilize user definitions from an external database defined within the Dataset module.

Data Source Virtualization

Data Source Virtualization is an advanced feature that simplifies data management across multiple data sources. It provides a unified interface for accessing, querying, and manipulating data, regardless of the underlying data storage technology. This feature allows modifications to the underlying data sources repositories without impacting the rest of the application.

  • Agnostic, Standards, Centralized Management

Adhering to industry standards, the module is designed to work seamlessly with various data storage technologies. It enables centralized management of data connections and configurations, streamlining the process of integrating different data sources into your projects.

→ Read more about Advanced Datasets Topics.




Configuring the Datasets Module

  1. First, access Datasets → DBs and establish a database connection. After creating a DB, input the server path in the Connection String property.
  2. Access Datasets → Queries to create Queries using the visual Query Editor or Provided SQL Strings.

    Use Meaningful Names

    We recommend names such as, Dataset.Query, to identify Query Usage, to improve the solution maintenance.

    Enhance Query Flexibility with Real-Time Parameters

    Adding real-time parameters to your queries offers flexibility and adaptability, enabling dynamic data retrieval based on changing conditions. Instead of hardcoding specific values, you can use dynamically populated placeholders at runtime. 

    Instead of using Queries like the following:

    • SELECT * FROM your_table WHERE col1 = 5

    Use Queries with real-time parameters:

    • SELECT * FROM your_table WHERE col2 = {{tag.Test}} 

    In this example, {{tag.Test}} serves as a placeholder for a real-time parameter.

  3. Access Datasets → Tables to manage your project tables and connect inserted data with Tags for efficient data handling.
  4. Establish file-based data exchange configurations at Datasets → Files. You can configure interactions with plain text or XML files as needed.
  5. Finally, leverage logically named objects throughout the solution, applying them in script calculations, visualization displays, and other components.


You can also configure and execute Stored Procedures at Datasets → Queries to extend the Query's functionality.

** Why is there an anchor below? What links go to this? **

Managing DB Connections

When using SQLite databases, the Dataset Module can automatically create the database locally if it doesn't already exist. For other database types, the database itself must already exist before you set your connection.

Any user can create new connections in the solution. However, only the Administrator can configure database login passwords.

There are four database connections preloaded into any new solution.

Datasets DB - Pre-defined database connections

DB

Database

Path Location

Usage

Retentive

SQLite

<ProjectNameAndPath>.dbRetentive

Stores values for Retentive Tags.

RuntimeUsers

SQLite

 <ProjectNameAndPath>.dbRuntimeUsers

Stores dynamically created Users.

AlarmHistorian

SQLite

 <ProjectNameAndPath>.dbAlarmHistorian

Stores Alarm and AuditTrail records.

TagHistorian

SQLite

<ProjectNameAndPath>.dbTagHistorian

Stores Tag Historian and Annotations.

If needed to use another database for the pre-defined connections, follow these steps below:

  1. Rename or delete the previous DB. This step is necessary, as the system would not allow the creation of two objects with the same name. 

  2. Create a new DB with the same name as the previous DB, with the required database and connection settings.

→ Read more about Dataset DBs.

Dataset Queries Configuration

Users can enhance data manipulation by configuring queries with SQL statements in the Dataset module. The Visual Query Editor simplifies the process, allowing users to drag and drop tables, define relationships, and apply filters through an intuitive graphical interface.

→ Read more about the Visual SQL Query Builder.

Queries can be saved and executed seamlessly within the module.

→ Read more about Dataset Queries.

Dataset Tables Configuration

Use the Dataset Tables to manage data tables and configure table properties such as structure, relationships, and data types. It provides an intuitive interface to streamline customization and ensure efficient data handling. 

→ Read more about Dataset Tables.

Dataset Files Configuration

The Dataset files are used to customize file interactions in the Dataset module. With this feature you can configure file-related properties, define formats and establish connections. 

→ Read more about Dataset Files.




Working with the Datasets module

Runtime Execution

One of the critical features of the Datasets module is the ability to execute SQL queries and retrieve data in real-time. Here are some ways to leverage the Runtime execution features of the Dataset module:

  • Create SQL queries to retrieve data from external databases.
  • Use query results to trigger events and actions within the platform environment.
  • Configure event triggers based on specific query criteria, such as changes to a particular data point or an exceeded threshold value.

Monitoring the Datasets Module


Intra-Module Interaction

The Datasets module easily integrates with other modules within the software environment. Here are some examples of how to use the Dataset module in conjunction with other modules:

  • Displays Module: Visualization tools like Data Grids can present query results on screens and dashboards, creating custom views of the data that are accessible and easy to understand for your users.
  • Scripting Module: Custom scripts can reference query results and trigger specific actions, such as sending notifications, updating tags, or performing calculations, thereby implementing complex logic based on database data.
  • Alarm Module: Data can be used to configure or update alarm conditions, triggers, and actions, enabling the creation of dynamic alarm systems that respond to data changes
  • Unified Namespaces Module: You can utilize Tags from your project as dynamic placeholders within SQL queries or file names. This capability facilitates real-time data manipulation within the query strings. Moreover, you can map the results of the queries to Tags or employ Tags to populate values when inserting or modifying existing records. 

The Datasets module drives bi-directional real-time communication between all Project Modules and the SQL databases. By leveraging these integration options, users can gain greater insight and control over their data sources within the platform. They can execute SQL queries and trigger actions based on query results, providing robust tools for working with data.

External System Interaction

The Dataset module enables seamless interaction with external databases, enhancing data integration and accessibility. The following is a list of possible databases you can connect:

  • SQL Server
  • PostgreSQL
  • Excel




Advanced Datasets Module Topics

Database Connections And Interactions

The Dataset module facilitates efficient database interactions by utilizing TServer services. While it doesn't directly interact with databases, it manages synchronous and asynchronous executions for optimal performance.

→ Read more about Databases Connection And Interactions.

Data Management

The Dataset Module offers versatile methods for managing data and concurrency within solutions, including Data Table tags and Async Contents.

→ Read more about Data Management.

Datasets Runtime Attributes

The Datasets Namespace exposes properties and methods from the .NET objects used by the Historian Module execution. You can use these properties and methods on your Displays or to create Scripts and Alarms.

→ Read more about Dataset Runtime Attributes.

Preventing SQL Injections

When calling Stored Procedures in the database, there's a significant concern about SQL injection, especially when passing parameters as plain text in SQL statements. To mitigate this, utilize the .Net API, where parameters are added to a list, effectively preventing code injection. Use the parameters in Stored Procedure calls with the following syntax:

execute <StoredProcedureName> <@paramName1>={Tag.Tag1}, <@paramName2>="{Tag.TagStr}", <@paramName3>={Tag.OutValue} OUTPUT, ... , @RETURN_VALUE={Tag.RetValue} RETURNVALUE

This approach creates parameters for Stored Procedure calls and guards against SQL injection attacks. Each parameter value is strictly interpreted as a value and not part of the SQL command, providing a critical layer of security against unauthorized database manipulation.

Network Gateway Access And Time Zone Handling

In some scenarios, specific databases might not be accessible from all computers due to network topology or security constraints. The ServerIP column can redirect database access commands to a machine with the necessary permissions.  Here, only TWebServer would be active, handling database interaction, receiving commands, and establishing connections.

The platform treats all Date and Time type Tags as UTC. Consequently, the dates written to and read from the standard database, such as historical data and alarms, are also in UTC. If it's necessary to read and write dates in the local time zone, account for the time difference when preparing the SQL statement or using the WhereCondition  property. Other external data tables, from third parties or the client, can use any time zone. To assist in handling these data, you can configure the DateTimeMode column with the type of time zone used in the database (UTC or Local Time), and the system will automatically convert it when reading and writing to Tags.

Backup Of Solutions SQLite Data Bases

There are several ways to backup SQLite databases in a solution. One simple way is to make a copy of the database file itself, but this method may not be suitable if the database is actively being written to or is very large. Here are a few suggestions for backing up SQLite databases in your solution:

  • SQLite provides a built-in backup command that can be used to create a backup of a database file. Here is the syntax for this command:

    sqlite3 source_database.sqlite .backup backup_file.sqlite

    This command creates a backup of the source database and stores it in a new file called backup_file.sqlite. Allowing you to copy this file to a safe location for backup purposes.

  • SQLite also provides an Online Backup API that you can use to create a backup of a database file while your application is still using it. This API provides several benefits over the backup command, such as the ability to perform incremental backups and monitor the progress of the backup process. 

→ Read more about SQLite Backup API.

Regardless of your chosen method, it's crucial to store your backup files in a secure location and test them regularly to ensure that you can use them to restore your database if needed.




Best Practices and Troubleshooting

Common Issues and Solutions

Connection loss between project and database

Database Timeout Configuration: The database may have a timeout setting that automatically disconnects idle connections after a certain period of time. It's recommended to check the database's timeout setting and adjust it, if necessary, to ensure that the connection remains active overnight.

Power Settings: It's also suggested to check the computer's power settings to ensure that it's not entering sleep or hibernation mode during a certain idle moment, which could cause loss of connection to the database. Adjusting these settings to keep the computer active during these idle moments may resolve the issue.

Database Connection Problem

In the DB configuration, there is always a "Test" button to ensure that the connection is happening correctly. When there is a problem, the return of this button is an error message, usually returned by the database provider itself. The most common errors are: invalid user, invalid password, computer without access to the database, incorrect form of authentication.

Issue: Error accessing the Database Table

Once the connection is established, the Table configuration is specific to a table. In the "Table" combobox, the list of available tables automatically appears. It is possible, via script, to change which table will be accessed. However, care must be taken that the table exists and that the configuration is done using the correct name. The same care must be taken when Queries are used, as it is the user's responsibility to type the correct table name, as well as the syntax of the separators.

Error in the Syntax of the Query

It is the user's responsibility to type the correct SQLStatement of a query using the QueryBuilder. Table name, column, values, all can generate an error if used incorrectly. For example: comparing different types may not return the expected result, strings in general should be in single quotes. The separators and clauses available can vary between databases, for example:

SQLServer

Query syntax
SELECT TOP 10 * FROM table WHERE column = value

SQLite

Query syntax
SELECT * FROM table WHERE column = value LIMIT 10;

Oracle

Query syntax
SELECT * FROM table WHERE column = value AND ROWNUM <= 10;

or new Oracle version

Query syntax
SELECT * FROM table WHERE column = value FETCH FIRST 10 ROWS ONLY;

IBM DB2

Query syntax
SELECT * FROM table WHERE column = value FETCH FIRST 10 ROWS ONLY;


ServerIP without TWebServer Running on the Remote Machine

In some cases, the computer may not have access to the database. In this case, it is possible to create a gateway, routing the commands to be executed on the computer that has access to the database. The ServerIP field should be configured with the IP and port (<IP>:<port>), pointing to this computer that has access permission. This computer must have the software with the TWebServer running installed. It will automatically perform this gateway service and send the commands to the database.

DataTable Returned NULL

When a query is returned null, some error has occurred. Some common errors include: connection failure with the database, table not found, Dataset module is not running, incorrect query syntax. Check the return of the method using WithStatus when using a synchronous method or use the LastStatus and LastStatusMessage property when using asynchronous mode.

DataTable Returned with 0 Rows

When this happens, in general, there is a connection with the database and the table name is correct. In this case, the search condition is usually wrong, or the table is really empty. Check if the column names are correct and if the separators and clauses are valid.

Dataset Module is Down

Although the TServer is responsible for forwarding requests to the database, the management and communication with the TServer is done by the Dataset module, as well as the treatment of responses. Therefore, if you are having basic problems with access and execution of database access, the first thing to check is whether the module is set up to run and is actually running.

Very High Response Time

Sometimes, it may seem that access to the database is not being made, but what might actually be happening is that some accesses can return a very large amount of data, or the database may be overloaded, or with a bad configuration, causing it to have a low performance. Or even, the network itself can be overloaded and slow, and all these factors can impact the response time. In these cases, it is important to execute the Query directly in the database environment to be sure that the problem is not on the side of the database. Do this and check how long the database itself takes to execute the query. It is also worth checking the volume of data exchanged to have an idea of ??the related side effects.

** What are all these hashtags below for? **

#Update of a table with the wrong schema (select before update)

The Dataset module uses ADO technology, and many things are resolved at the level of this API. When we are going to perform an Update on a table, the schema of the table and controls in the .Net DataTable type are used. Therefore, if you are going to perform an update passing a Tag or .net DataTable object as a parameter, it is important that this object respects the schema of the destination Table in the database. Normally, a Select command must have been given at some point to obtain the correct schema used by the bank. After this, it is easy to add, remove, and modify values in this DataTable and update it back to the physical table in the database.

#Where condition CaseSensitive

Case sensitivity in a WHERE clause depends on the database and the configuration of the database you are using. For example, in MySQL, queries are case-insensitive by default, which means 'abc' and 'ABC' would be considered equal. However, this can be changed with specific database settings. In SQL Server, case sensitivity is also determined by the database configuration. In PostgreSQL, queries are case-sensitive by default, so 'abc' and 'ABC' would be considered different. Therefore, it really depends on the specific database and the settings of that database. If you need to ensure case-insensitivity in a query, you can use functions like UPPER() or LOWER() to convert all values to upper or lower case before comparison. For example:

Query syntax
SELECT * FROM table WHERE LOWER(column) = LOWER(value);

This query will return records where the column matches the value, regardless of capitalization.

#Performance

The Dataset module's performance depends on many factors, including database performance, network latency, and the complexity of executing SQL queries. The platform will minimize overhead and execute queries as efficiently as possible. However, ultimately, the performance of the Dataset module is tied to these external factors. It's essential to design your database schema and queries with performance in mind and consider investing in high-quality hardware and network infrastructure to ensure optimal performance.

Best Practices and Recommendations

#Error Handling

Error handling in the Dataset module is straightforward. Suppose an error occurs during the execution of a command. The error message will update the module's Error property (Last Status). You can monitor this property to handle errors in your application. Furthermore, suppose an error occurs during the execution of a synchronous method. In that case, the process will return an empty Data Table, and update the Error property. Alternatively, you can call methods like SelectCommandWithStatus, where the status will be an output parameter in the method.

These are some of the key points you should keep in mind while using the Dataset module. It's a powerful tool that provides much flexibility. However, it also requires careful use and understanding of its features and limitations. Always keep best practices in mind when designing and implementing your database interactions. Feel free to seek expert advice if you need clarification on anything.





In this section:


  • No labels