Personal tools
You are here: Home Topics Knowledge based pre-processing of process data

Knowledge based pre-processing of process data

This research project aims at a dramatic reduction of the effort for the pre-processing of process data. To this end, knowledge-based methodology for efficient pre-processing is developed.

Due to the progress in information technology that has been made in the last years the amount of measurement data which are available from running production processes has increased remarkably. The analysis of these measurement data offers optimization potential with respect to cost and quality and the possibility to detect faults in a production process in case of problems. In that field especially small and medium-sized enterprises (SME) are operating which offer special software tools, project management and consulting for process optimization. Despite the high availability of process measurement data the economic potential of process data analyses is only rarely exploited. The reason for that phenomenon is the enormous initial effort, often up to 80% of the whole data analysis, for data pre-processing.

Knowledge-based methodology for efficient pre-processing

This research project aims at a dramatic reduction of this initial effort by developing a knowledge-based methodology for the efficient pre-processing of process data:

  • Consistency of different steps for pre-processing when using various tools (subsequently) will be assured in order to prevent errors due to data conversion that is needed between these tools.
  • Pre-processing steps will be made comprehensible so that the use of the pre-processed data can be assessed even at a later date.
  • Reuse of algorithms and procedures for pre-processing will be supported.
  • Possible errors in the data (missing values, etc.) or in pre-processing procedures will be prevented or at least be detected at an earlier stage.

Key result components

 In order to achieve these goals the project focuses on three key result components:

  • a catalogue of best practices in data pre-processing,
  • a software architecture that makes use of this knowledge and supports the consistent interactive use of different external tools (that are not required to be fully integrated) for pre-processing, and
  • a software demonstrator that implements this architecture.

Catalogue of best practices

The catalogue of methods will be a kind of manual that contains a list of known steps in the field of data pre-processing, e.g. variance estimation and denoising. The manual will be augmented by important properties, such as input and output parameters, e.g. variance as an input for denoising, and tools that support the user in performing the corresponding pre-processing step. Established procedures will be given in the manual, too, in order to document best practice procedures and make them available to the users. We catalogue of best practices for pre-processing will be made available to the public in order to reach a broad range of users who would be able to extend this knowledge base.

Software architecture

 

A software architecture for knowledge-based data pre-processing

 

The architecture will support the comprehensive data pre-processing using different (external) tools interactively based on the knowledge in the catalogue of best practices. Key components are a data bus for the consistent and transparent access of data independent from their storage location and format, and a control component that documents work steps performed on the data, to make these steps comprehensive and re-usable and that suggests possible next steps using the knowledge base.

Software demonstrator

Feasibility will be demonstrated by a software demonstrator for data pre-processing that will implement the architecture. Practical applicability of the software demonstrator will be tested during the project and with increasing complexity together with the corresponding partners from industry.

Benefits for users

Economic benefit will be obtained especially in process systems engineering where the analyzed measurement quantities influence product quality or production costs directly. All industry in the field of process data analysis but especially SME which could focus on their key competences and do not have specialized departments with the corresponding human resources will gain benefit from the results of the submitted research project.

Document Actions