CN112115196A - Industrial big data visual modeling method and system - Google Patents

Industrial big data visual modeling method and system Download PDF

Info

Publication number
CN112115196A
CN112115196A CN202010780964.8A CN202010780964A CN112115196A CN 112115196 A CN112115196 A CN 112115196A CN 202010780964 A CN202010780964 A CN 202010780964A CN 112115196 A CN112115196 A CN 112115196A
Authority
CN
China
Prior art keywords
node
data
model
processing
canvas
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010780964.8A
Other languages
Chinese (zh)
Inventor
林泽生
陈东清
黄章树
刘绍清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Haichuang Technology Fujian Group Co Ltd
Original Assignee
China Haichuang Technology Fujian Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Haichuang Technology Fujian Group Co Ltd filed Critical China Haichuang Technology Fujian Group Co Ltd
Priority to CN202010780964.8A priority Critical patent/CN112115196A/en
Publication of CN112115196A publication Critical patent/CN112115196A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a visual modeling method and a visual modeling system for industrial big data, wherein the method comprises the steps of summarizing structured data and unstructured data; processing and storing the data by adopting an ETL technology; model control is quoted, a model analysis field is set, parameters are configured, and a model is constructed; and evaluating and testing the model, if the model meets the requirements, applying the model, otherwise, reconsidering the input field, adjusting the related parameters until the input field meets the requirements, reducing the requirements of the user programming capability, and facilitating modeling by the user.

Description

Industrial big data visual modeling method and system
Technical Field
The invention relates to a visual modeling method and system for industrial big data.
Background
Industrial enterprises accumulate a large amount of data, and many enterprises want to mine data values by using data analysis tools, but the existing mining data values have the following problems: (1) the programming capability of business personnel is weak, and the model is difficult to be established quickly to analyze and mine industrial data; (2) the data analysis process is complex, and the data modeling capability of business personnel is insufficient.
Disclosure of Invention
The invention aims to solve the technical problem of providing a visual modeling method and system for industrial big data, which can reduce the requirement of user programming capability and facilitate modeling by users.
One of the present invention is realized by: an industrial big data visualization modeling method comprises the following steps:
step 1, summarizing structured and unstructured data;
step 2, processing and storing the data by adopting an ETL technology;
step 3, quoting a model control, setting a model analysis field, configuring parameters and constructing a model;
and 4, evaluating and testing the model, if the model meets the requirements, applying the model, otherwise, reconsidering the input field, and adjusting the relevant parameters until the model meets the requirements.
Further, the step 3 is further specifically:
newly building canvas nodes: selecting a type node menu so as to determine the type of the adopted node, drawing a corresponding node in the Canvas on the right side according to the mouse placement, and recording the node type and the node position information;
establishing node association: clicking nodes in Canvas, drawing arrows to perform node association, and storing the node association in a tree form;
parameter configuration: double-clicking a node in Canvas, opening an HTML configuration interface of a node of a corresponding type, acquiring a data source parameter configured by a previous node in the configuration interface by a user according to node association, sending a request to acquire a corresponding data resource, then performing corresponding parameter configuration according to a data source, and persistently storing parameter information of the node in a system configuration database;
performing a model: and sending an execution request, calling a corresponding function to perform operation according to the parameter information of the node, and then displaying an operation result at the browser end.
Further, the step 1 is further specifically: analyzing the demand service, summarizing the related data of the structure and the non-structure, or uploading the related data.
Further, the processing of the data by the ETL technique in step 2 includes missing value processing, abnormal value processing, discretization processing, and data normalization processing.
The second invention is realized by the following steps: an industrial big data visualization modeling system comprises the following modules:
the data source management module is used for summarizing structured and unstructured data;
the data processing module is used for processing and storing the data by adopting an ETL technology;
a model building module refers to a model control, sets a model analysis field, configures parameters and builds a model;
and the test module is used for evaluating and testing the model, performing model application if the model meets the requirements, and otherwise, reconsidering the input field and adjusting related parameters until the model meets the requirements.
Further, the construction model module is further specifically:
newly building canvas nodes: selecting a type node menu so as to determine the type of the adopted node, drawing a corresponding node in the Canvas on the right side according to the mouse placement, and recording the node type and the node position information;
establishing node association: clicking nodes in Canvas, drawing arrows to perform node association, and storing the node association in a tree form;
parameter configuration: double-clicking a node in Canvas, opening an HTML configuration interface of a node of a corresponding type, acquiring a data source parameter configured by a previous node in the configuration interface by a user according to node association, sending a request to acquire a corresponding data resource, then performing corresponding parameter configuration according to a data source, and persistently storing parameter information of the node in a system configuration database;
performing a model: and sending an execution request, calling a corresponding function to perform operation according to the parameter information of the node, and then displaying an operation result at the browser end.
Further, the data source management module is further specifically: analyzing the demand service, summarizing the related data of the structure and the non-structure, or uploading the related data.
Further, the processing of the data by the data processing module using the ETL technique includes missing value processing, abnormal value processing, discretization processing, and data normalization processing.
The invention has the following advantages: the visual modeling of business personnel on industrial data is realized, the dependence of the business personnel on a big data analysis program is reduced, the modeling speed is increased, the model application efficiency is improved, and the management level is improved.
Drawings
The invention will be further described with reference to the following examples with reference to the accompanying drawings.
FIG. 1 is a flow chart of the method of the present invention.
FIG. 2 is a schematic block diagram of the system of the present invention.
FIG. 3 is a system block diagram of an embodiment of the present invention.
Fig. 4 is a schematic diagram illustrating the effect of the embodiment of the present invention.
Detailed Description
As shown in FIG. 1, the invention relates to an industrial big data visualization modeling method, which comprises the following steps:
step 1, analyzing a demand service, summarizing structured and unstructured related data, or uploading related data;
step 2, processing and storing the data by adopting an ETL technology, wherein the processing of the data by adopting the ETL technology comprises missing value processing, abnormal value processing, discretization processing and data standardization processing;
step 3, newly building canvas nodes: selecting a type node menu so as to determine the type of the adopted node, drawing a corresponding node in the Canvas on the right side according to the mouse placement, and recording the node type and the node position information;
establishing node association: clicking nodes in Canvas, drawing arrows to perform node association, and storing the node association in a tree form;
parameter configuration: double-clicking a node in Canvas, opening an HTML configuration interface of a node of a corresponding type, acquiring a data source parameter configured by a previous node in the configuration interface by a user according to node association, sending a request to acquire a corresponding data resource, then performing corresponding parameter configuration according to a data source, and persistently storing parameter information of the node in a system configuration database;
performing a model: sending an execution request, calling a corresponding function to perform operation according to the parameter information of the node, and then displaying an operation result at a browser end;
and 4, evaluating and testing the model, if the model meets the requirements, applying the model, otherwise, reconsidering the input field, and adjusting the relevant parameters until the model meets the requirements.
As shown in FIG. 2, the industrial big data visualization modeling system of the invention comprises the following modules:
the data source management module analyzes the demand service, summarizes the related data of the structuring and the non-structuring, or uploads the related data;
the processing data module is used for processing and storing data by adopting an ETL technology, and the processing of the data by adopting the ETL technology comprises missing value processing, abnormal value processing, discretization processing and data standardization processing;
constructing a model module, and newly building canvas nodes: selecting a type node menu so as to determine the type of the adopted node, drawing a corresponding node in the Canvas on the right side according to the mouse placement, and recording the node type and the node position information;
establishing node association: clicking nodes in Canvas, drawing arrows to perform node association, and storing the node association in a tree form;
parameter configuration: double-clicking a node in Canvas, opening an HTML configuration interface of a node of a corresponding type, acquiring a data source parameter configured by a previous node in the configuration interface by a user according to node association, sending a request to acquire a corresponding data resource, then performing corresponding parameter configuration according to a data source, and persistently storing parameter information of the node in a system configuration database;
performing a model: sending an execution request, calling a corresponding function to perform operation according to the parameter information of the node, and then displaying an operation result at a browser end;
and the test module is used for evaluating and testing the model, performing model application if the model meets the requirements, and otherwise, reconsidering the input field and adjusting related parameters until the model meets the requirements.
One embodiment of the present invention:
platform architecture
As shown in fig. 3, the data source layer collects structured and unstructured data sources, processes the data using the ETL technique, and stores the data in a data warehouse, a distributed database, and a Hadoop platform. The preprocessed data can conveniently support multidimensional data display, real-time data query, data sharing, data modeling and the like, and industrial big data intelligent application with the advantages of reducing cost, improving production efficiency, improving product quality and creating more economic benefits is provided for enterprises based on data analysis and mining technology.
Part (II) core algorithm and industry application model
The core algorithm of the system comprises a traditional statistical analysis method, various classical data mining algorithms such as clustering, association, decision tree and prediction and machine learning algorithms, and can efficiently support industrial big data analysis modeling. Based on a classical analysis model, the big data analysis system is deeply applied to the aspects of logistics, equipment, production, management decision and the like, and an industrial algorithm model is precipitated.
(III) visual modeling Process
The big data analysis system follows CRISP-DM (cross-industry data mining) cross-industry data mining process standard, and industrial big data visual modeling and application are rapidly realized.
The system adopts a B/S architecture, the front end uses Vue + Element + G6 to provide a visual modeling function, the display of a modeling node network is carried out through G6, the back end uses Springboot + Mybatis to realize, a data source can support simple desktop data such as EXCEL, CSV, text files and the like, and also can support non-relational data such as MySQL, Oracle mainstream relational data, redis, mangold and the like.
The visual modeling process is as follows, as shown in FIG. 4.
(1) According to the analysis demand service, relevant data are gathered and sorted, and the user can also be supported to upload data by himself;
(2) preprocessing data, including missing value processing, abnormal value processing, discretization processing, data standardization processing and the like; the missing value processing mode comprises neglecting, directly removing samples, filling with maximum frequency, mean filling, interpolation and the like for repairing. The abnormal value judgment standard is a positive standard deviation and a negative standard deviation (1 time, 2 times and 3 times) of the mean value, and the processing mode comprises the steps of neglecting, directly removing the sample and restoring the mean value. The discretization processing comprises self-defining coding, one-hot coding and virtual variable coding. The data standardization mode comprises range standardization and logarithmic standardization.
(3) And (4) quoting the model control, setting a model analysis field, configuring relevant parameters and constructing a model. The main implementation process is as follows: newly building canvas nodes: and selecting a type node menu so as to determine the type of the adopted node, drawing a corresponding node in the Canvas on the right side according to the mouse placement, and recording the information such as the node type, the node position and the like. Establishing node association: and clicking the nodes in the Canvas, drawing arrows to perform node association, and performing record storage on the node association in a tree form. Parameter configuration: double-clicking a node in Canvas, opening an HTML configuration interface of a node of a corresponding type, acquiring a configured data source parameter of a previous node according to node association in the configuration interface by a user, sending a request to acquire a corresponding data resource, and then configuring the corresponding parameter according to the data source. The parameter information of the nodes is persistently stored in a system configuration database; executing the model: sending an execution request, calling a corresponding function to perform operation according to the parameter information of the node, and then displaying an operation result at a browser end;
(4) carrying out evaluation test on the model, wherein the main evaluation criteria comprise root mean square relative error, classification accuracy, contour coefficient and cross validation performance, if the model meets the requirements, the next step of model application can be carried out, otherwise, the input field is considered again, and relevant parameters are adjusted;
(5) and on the basis of satisfaction of the test model, applying the model to carry out service prediction.
Although specific embodiments of the invention have been described above, it will be understood by those skilled in the art that the specific embodiments described are illustrative only and are not limiting upon the scope of the invention, and that equivalent modifications and variations can be made by those skilled in the art without departing from the spirit of the invention, which is to be limited only by the appended claims.

Claims (8)

1. A visual modeling method for industrial big data is characterized in that: the method comprises the following steps:
step 1, summarizing structured and unstructured data;
step 2, processing and storing the data by adopting an ETL technology;
step 3, quoting a model control, setting a model analysis field, configuring parameters and constructing a model;
and 4, evaluating and testing the model, if the model meets the requirements, applying the model, otherwise, reconsidering the input field, and adjusting the relevant parameters until the model meets the requirements.
2. The industrial big data visualization modeling method according to claim 1, characterized in that: the step 3 is further specifically as follows:
newly building canvas nodes: selecting a type node menu so as to determine the type of the adopted node, drawing a corresponding node in the Canvas on the right side according to the mouse placement, and recording the node type and the node position information;
establishing node association: clicking nodes in Canvas, drawing arrows to perform node association, and storing the node association in a tree form;
parameter configuration: double-clicking a node in Canvas, opening an HTML configuration interface of a node of a corresponding type, acquiring a data source parameter configured by a previous node in the configuration interface by a user according to node association, sending a request to acquire a corresponding data resource, then performing corresponding parameter configuration according to a data source, and persistently storing parameter information of the node in a system configuration database;
performing a model: and sending an execution request, calling a corresponding function to perform operation according to the parameter information of the node, and then displaying an operation result at the browser end.
3. The industrial big data visualization modeling method according to claim 1, characterized in that: the step 1 is further specifically as follows: analyzing the demand service, summarizing the related data of the structure and the non-structure, or uploading the related data.
4. The industrial big data visualization modeling method according to claim 1, characterized in that: the step 2 of processing the data by using the ETL technology comprises missing value processing, abnormal value processing, discretization processing and data standardization processing.
5. An industrial big data visualization modeling system is characterized in that: the system comprises the following modules:
the data source management module is used for summarizing structured and unstructured data;
the data processing module is used for processing and storing the data by adopting an ETL technology;
a model building module refers to a model control, sets a model analysis field, configures parameters and builds a model;
and the test module is used for evaluating and testing the model, performing model application if the model meets the requirements, and otherwise, reconsidering the input field and adjusting related parameters until the model meets the requirements.
6. The industrial big data visualization modeling system according to claim 5, wherein: the construction model module is further specifically:
newly building canvas nodes: selecting a type node menu so as to determine the type of the adopted node, drawing a corresponding node in the Canvas on the right side according to the mouse placement, and recording the node type and the node position information;
establishing node association: clicking nodes in Canvas, drawing arrows to perform node association, and storing the node association in a tree form;
parameter configuration: double-clicking a node in Canvas, opening an HTML configuration interface of a node of a corresponding type, acquiring a data source parameter configured by a previous node in the configuration interface by a user according to node association, sending a request to acquire a corresponding data resource, then performing corresponding parameter configuration according to a data source, and persistently storing parameter information of the node in a system configuration database;
performing a model: and sending an execution request, calling a corresponding function to perform operation according to the parameter information of the node, and then displaying an operation result at the browser end.
7. The industrial big data visualization modeling system according to claim 5, wherein: the data source management module is further specifically: analyzing the demand service, summarizing the related data of the structure and the non-structure, or uploading the related data.
8. The industrial big data visualization modeling system according to claim 5, wherein: the processing data module adopts an ETL technology to process data, and comprises missing value processing, abnormal value processing, discretization processing and data standardization processing.
CN202010780964.8A 2020-08-06 2020-08-06 Industrial big data visual modeling method and system Pending CN112115196A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010780964.8A CN112115196A (en) 2020-08-06 2020-08-06 Industrial big data visual modeling method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010780964.8A CN112115196A (en) 2020-08-06 2020-08-06 Industrial big data visual modeling method and system

Publications (1)

Publication Number Publication Date
CN112115196A true CN112115196A (en) 2020-12-22

Family

ID=73799461

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010780964.8A Pending CN112115196A (en) 2020-08-06 2020-08-06 Industrial big data visual modeling method and system

Country Status (1)

Country Link
CN (1) CN112115196A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113110311A (en) * 2021-03-05 2021-07-13 中海创科技(福建)集团有限公司 Industrial big data online modeling and model sharing method, device, equipment and medium
CN113742432A (en) * 2021-08-20 2021-12-03 广州市易工品科技有限公司 Editable ER graph generation method and device
CN114254021A (en) * 2021-12-30 2022-03-29 重庆允成互联网科技有限公司 Multi-source data processing method and data factory
CN115826932A (en) * 2023-02-10 2023-03-21 中国电子科技集团公司第十五研究所 Assessment model and assessment model construction method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020109704A1 (en) * 2000-12-20 2002-08-15 Microsoft Corporation Dynamic, live surface and model elements for visualization and modeling
CN103970109A (en) * 2014-05-27 2014-08-06 重庆大学 Flow reappearing visualization method based on industrial field data under Web environment
CN107450902A (en) * 2016-04-29 2017-12-08 埃森哲环球解决方案有限公司 System architecture with visual modeling tool
CN109523318A (en) * 2018-11-16 2019-03-26 杭州市商务委员会(杭州市粮食局) The analysis platform of electronic commerce data visual modeling transboundary

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020109704A1 (en) * 2000-12-20 2002-08-15 Microsoft Corporation Dynamic, live surface and model elements for visualization and modeling
CN103970109A (en) * 2014-05-27 2014-08-06 重庆大学 Flow reappearing visualization method based on industrial field data under Web environment
CN107450902A (en) * 2016-04-29 2017-12-08 埃森哲环球解决方案有限公司 System architecture with visual modeling tool
CN109523318A (en) * 2018-11-16 2019-03-26 杭州市商务委员会(杭州市粮食局) The analysis platform of electronic commerce data visual modeling transboundary

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113110311A (en) * 2021-03-05 2021-07-13 中海创科技(福建)集团有限公司 Industrial big data online modeling and model sharing method, device, equipment and medium
CN113742432A (en) * 2021-08-20 2021-12-03 广州市易工品科技有限公司 Editable ER graph generation method and device
CN114254021A (en) * 2021-12-30 2022-03-29 重庆允成互联网科技有限公司 Multi-source data processing method and data factory
CN115826932A (en) * 2023-02-10 2023-03-21 中国电子科技集团公司第十五研究所 Assessment model and assessment model construction method

Similar Documents

Publication Publication Date Title
CN112115196A (en) Industrial big data visual modeling method and system
CN118311914B (en) A production line data acquisition control method and system for intelligent workshop
CN113887749B (en) Multi-dimensional monitoring and disposal method, equipment and platform of power Internet of Things based on cloud-edge collaboration
CN111339375A (en) Universal big data model configuration and analysis method
CN108491991B (en) Constraint analysis system and method based on industrial big data product construction period
CN116599857B (en) Digital twin application system suitable for multiple scenes of Internet of things
CN108052020B (en) A chemical process simulator for intelligent manufacturing
CN120748172B (en) Building engineering construction progress early warning method and system based on machine learning
CN118982175A (en) AIGC-based intelligent analysis system and method for business management data
WO2025139186A1 (en) Big data analysis and processing system based on deep learning
CN119066056A (en) An intelligent silicon rod cutting process management method and system
CN118396606A (en) Manufacturing industry product production optimization method based on multi-mode large model
CN117972356A (en) Engineering cost control method based on big data analysis
CN113626285A (en) Model-based job monitoring method and device, computer equipment and storage medium
CN115562660A (en) Lightweight knowledge base design applied to product quality analysis
CN119719059A (en) Educational resource sharing system and method based on cloud platform
CN118586782A (en) A mathematical and statistical analysis method based on quality management decision-making system
CN114789557B (en) Multi-nozzle printing method of 3d printer
CN113515500B (en) Visual data processing system and processing method
CN117196523A (en) Robot flow automation method and device
CN116150989A (en) Photovoltaic module manufacturing method and photovoltaic module
CN113191569A (en) Enterprise management method and system based on big data
CN119539622B (en) Quality monitoring system of full life cycle
CN121351514B (en) Method and System for Analyzing the Compressive Bearing Capacity of Composite Reinforced Sea Sand Concrete Columns
CN110135647A (en) A control method and control device for realizing trend prediction based on feature modeling

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20201222

WD01 Invention patent application deemed withdrawn after publication