Abstract data layer
The abstract data layer is a layer between the physical storage of data and the applications that use it, allowing abstraction from physical implementation and hardware specifics, providing a single interface for working with data. It provides:
- Hardware Independence: Data is stored on different devices (servers, cloud, IoT devices) but the application interacts with it through a single interface.
- Simplification of development: Developers work with abstract entities without worrying about the details of physical storage.
- Flexibility: Changes to the physical structure of data (e.g., migration to the cloud) do not affect applications.
Unified data model
The system uses a unified data model, which is adapted for different types of equipment and allows you to work with them in a uniform way, regardless of their source or type of equipment. It includes common data structures, uniform processing rules, and semantic consistency of data across all systems and subsystems.
For example, a unified system data model includes:
- Entities: Equipment, channels, sessions, units, reference parameters.
- Relationships: Equipment has channels, channels are related to data, data is related to sessions.
- Attributes: Attributes are defined for each entity (for example,
equipment.serial_number
,channel_data.value
).
Structure of the Unified System Data Model
The Unified System Data Model is a structure for storing and managing data related to hardware, channels, sessions, and measurements. Let’s look at each table and its purpose, as well as the relationships between them.
Table channel_data
This table stores the data received from the equipment channels within specific sessions.
Field | Type | Description |
---|---|---|
id |
bigint |
Unique record identifier (primary key). |
equipment_id |
bigint |
Reference to equipment (equipment.id ). |
seance_id |
bigint |
Reference to session (seances.id ). |
channel_id |
bigint |
Reference to channel (channels.id ). |
unit_id |
bigint |
Reference to unit of measurement (units.id ). |
archive_type_id |
integer |
Archive type reference (referenceparameters.id ). |
event_time |
integer |
The time of the event (eg timestamp). |
value |
character varying(100) |
The value received from the channel. |
created_at |
timestamp(6) without time zone |
Time when the record was created. |
updated_at |
timestamp(6) without time zone |
Time when the record was last updated. |
Relationships:
- The
unit_id
foreign key references theunits
table. - The
archive_type_id
foreign key references thereferenceparameters
table. - The
equipment_id
foreign key references theequipment
table. - The
channel_id
foreign key references thechannels
table. - The
seance_id
foreign key refers to theseances
table.
Table units
This table stores the units of measurement used for the channel data.
Field | Type | Description |
---|---|---|
id |
bigint |
Unique record identifier (primary key). |
name |
character varying(30) |
Name of the unit of measurement. |
varname |
character varying(30) |
Short variable name for the unit of measurement. |
description |
character varying(100) |
Description of the unit of measurement. |
conversion_factor |
double precision |
Conversion factor for a unit of measurement. |
rounding |
smallint |
Number of decimal places to round to. |
synonyms |
character varying[] |
Array of synonyms for the unit of measurement. |
created_at |
timestamp(6) without time zone |
Time when the record was created. |
updated_at |
timestamp(6) without time zone |
Time when the record was last updated. |
Table referenceparameters
This table stores reference parameters such as archive types or other qualifiers.
Field | Type | Description |
---|---|---|
id |
bigint |
Unique identifier of the record (primary key). |
name |
character varying(30) |
The name of the unit of measure. |
varname |
character varying(30) |
Short variable name for the unit of measure. |
description |
character varying(100) |
Description of the unit of measurement. |
parent_id |
integer |
A reference to the parent parameter (hierarchy). |
referencemodel_id |
integer |
Reference to the directory model. |
created_at |
timestamp(6) without time zone |
Time when the record was created. |
updated_at |
timestamp(6) without time zone |
Time when the record was last updated. |
deleted_at |
timestamp(6) without time zone |
Time when the record was deleted (soft delete). |
Relationships:
- The foreign key
parent_id
references the same table (referenceparameters.id
), allowing hierarchies to be built.
Table equipment
Field | Type | Description |
---|---|---|
id |
bigint |
Unique identifier of the record (primary key). |
equipment_type_id |
bigint |
Reference to the equipment type. |
serial_number |
character varying(25) |
Serial number of the equipment. |
manufacture_date |
timestamp(6) without time zone |
The date of manufacture of the equipment. |
installation_date |
timestamp(6) without time zone |
Installation date of the equipment. |
program_version |
character varying(100) |
Software version of the equipment. |
created_at |
timestamp(6) without time zone |
Time when the record was created. |
updated_at |
timestamp(6) without time zone |
Time when the record was last updated. |
Table channels
This table stores information about the equipment channels.
Field | Type | Description |
---|---|---|
id |
bigint |
Unique identifier of the record (primary key). |
equipment_type_id |
bigint |
Reference to the equipment type (equipment_type.id ). |
unit_id |
bigint |
Reference to the unit of measurement (units.id ). |
name |
character varying(100) |
Channel name. |
varname |
character varying(20) |
Short variable name for the channel. |
created_at |
timestamp(6) without time zone |
Time when the record was created. |
updated_at |
timestamp(6) without time zone |
Time when the record was last updated. |
Relationships:
- The
equipment_type_id
foreign key references theequipment_type
table. - The
unit_id
foreign key refers to theunits
table.
Table seances
This table stores information about the equipment’s communication sessions.
Field | Type | Description |
---|---|---|
id |
bigint |
Unique identifier of the record (primary key). |
telemetry_id |
bigint |
Reference to telemetry (telemetry.id ). |
event_time |
integer |
The time of the event. |
evtid |
smallint |
The identifier of the event. |
trycnt |
smallint |
The number of attempts. |
tryfl |
character varying(12) |
The flag of the attempt. |
state |
smallint |
Session status. |
btm |
integer |
Battery charge. |
rssi |
integer |
Signal strength (if used). |
created_at |
timestamp(6) without time zone |
Time when the record was created. |
updated_at |
timestamp(6) without time zone |
Time when the record was last updated. |
General structure and relationships
Main table - channel_data
which links channel data to equipment, sessions and units.
Relationship tables:
units
- units of measurement.referenceparameters
- archive types and other qualifiers.
Equipment Tables:
equipment
- equipment information.channels
- equipment channels.
Sessions table - seances
, which stores information about communication sessions.
Usage Example:
- Channel data (
channel_data
) comes from equipment (equipment
) within specific sessions (seances
). - Each channel (
channels
) has its own unit of measurement (units
). - The type of archive (
referenceparameters
) determines how data should be stored or processed.
Benefits of a unified model
This data model provides flexibility and scalability to store and analyze data from different equipment.
Let’s list some of the benefits of a unified data model:
- Consistency: Data has the same structure and semantics across all systems.
- Scalability: It is easy to add new data sources or equipment types.
- Data Protection: Unified authentication, authorization, and encryption mechanisms.
- Simplify Analysis: Data can be analyzed using unified tools.
Equipment abstraction
Equipment abstraction is a key system design principle that allows you to separate the logic of data handling from the physical characteristics of the equipment. This is especially important in systems that utilize heterogeneous devices.
Equipment abstraction means that the system deals with data at a logical level that is independent of the physical devices on which the data is stored or processed. This is achieved by:
- Unification of interfaces: A single way of accessing data, regardless of equipment.
- Hiding implementation details: Physical location of data, transmission protocols, and other technical aspects are hidden from applications.
- Adapters: Conversion of data from an equipment-specific format to a unified format.
Implementing Equipment Abstraction
Let us consider the implementation of this approach in our system
Suppose we have an IoT device that sends gas consumption data. The implemented algorithms allow us to:
- Receive the data from the device through an adapter.
- Convert them into a
channel_data
structure. - Store them in a database.
- Provide access to the data via API.
For each type of equipment, adapters are created that handle device-specific protocols and convert the data into a unified format.
Let’s look at an example:
For an IoT device, data can come in JSON format:
|
|
The adapter converts data into a channel_data
structure:
|
|
Data Request:
GET /api/v1/channel_data?equipment_id=1&archive_type=daily&channel_id=3
Response:
|
|
Advantages of Equipment Abstraction
- Hardware Independence: Applications work with data without knowing where it physically resides.
- Flexibility: Easily add new devices or modify existing devices.
- Simplification of development: Developers work with abstract entities without worrying about hardware details.
- Scalability: Data can be stored on different devices, but the system continues to operate in a uniform manner.
##Data Normalization Data normalization is the process of organizing data in a database in a way that minimizes redundancy and improves data integrity. In the context of the abstract data layer, normalization plays a key role in creating a unified and efficient data model that can be used to handle different types of hardware and data sources.
Data normalization is the process of dividing data into logical tables and establishing relationships between them in order to:
- Eliminating duplicate data.
- Simplifying data maintenance and updating.
- Ensuring data integrity.
- Improving query performance.
- The system’s unified data model is reduced to (third) normal Boyce-Codd form in terms of atomic (scalar) values. Some data model entities use composite structures to optimize the handling of rare non-standard attributes.
Benefits of normalization
- Removes redundancy: Data is stored in one place, reducing redundancy.
- Data Consistency: Maintaining data consistency is simplified.
- Flexibility: It is easy to make changes to the data structure.
- Productivity: Query performance is improved (in most cases).
Normalization in the context of the abstract data layer
Data normalization is an important step in the design of the abstract data layer.
It allows you to:
- Create a unified data model that can be used to handle different types of hardware.
- Eliminate redundancy and duplication of data.
- Ensure data integrity and consistency, regardless of its source.
- Simplify the integration of new devices and systems.