CN115080654B - A projection-based visualization method for abnormal component inspection - Google Patents

A projection-based visualization method for abnormal component inspection Download PDF

Info

Publication number
CN115080654B
CN115080654B CN202210599858.9A CN202210599858A CN115080654B CN 115080654 B CN115080654 B CN 115080654B CN 202210599858 A CN202210599858 A CN 202210599858A CN 115080654 B CN115080654 B CN 115080654B
Authority
CN
China
Prior art keywords
data
component
view
map
point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210599858.9A
Other languages
Chinese (zh)
Other versions
CN115080654A (en
Inventor
朱敏
杨啸
高雯雯
李长林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan University
Original Assignee
Sichuan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan University filed Critical Sichuan University
Priority to CN202210599858.9A priority Critical patent/CN115080654B/en
Publication of CN115080654A publication Critical patent/CN115080654A/en
Application granted granted Critical
Publication of CN115080654B publication Critical patent/CN115080654B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04845Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range for image manipulation, e.g. dragging, rotation, expansion or change of colour
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/20Administration of product repair or maintenance
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/04Manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Databases & Information Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Quality & Reliability (AREA)
  • Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Human Computer Interaction (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Data Mining & Analysis (AREA)
  • Strategic Management (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Manufacturing & Machinery (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Primary Health Care (AREA)
  • Image Processing (AREA)

Abstract

本发明公开了一种基于投影的异常构件检查可视化方法,先对航空燃气涡扇发动机数据进行预处理,包含聚类、降维、异常检测等关键步骤;然后根据数据属性,采用位置、透明度、颜色等视觉通道进行可视化映射。设计构件投影视图,该视图采用散点图与等值线图呈现训练数据的分布概览,针对视图中的异常构件实例设计了一种特殊字形,用以发现异常数据的相似模式;设计数据检查视图,在视图上部用五种颜色及其间的连线编码构件在工况下的时序工作模式,在视图下部用面积图展示监测参数的真实数据。本发明能够辅助预测性维护人员直观有效地发现数据中的异常,并对其异常原因和异常模式进行分析,进而指导后续构件的工作部署,以延长构件的使用寿命。

The present invention discloses a projection-based visualization method for abnormal component inspection. First, the aviation gas turbofan engine data is preprocessed, including key steps such as clustering, dimensionality reduction, and anomaly detection; then, visual mapping is performed using visual channels such as position, transparency, and color according to data attributes. A component projection view is designed, which uses scatter plots and contour plots to present an overview of the distribution of training data, and a special glyph is designed for abnormal component instances in the view to find similar patterns of abnormal data; a data inspection view is designed, and the timing working mode of the component under working conditions is encoded with five colors and the connecting lines therebetween in the upper part of the view, and the real data of the monitoring parameters is displayed in the lower part of the view with an area chart. The present invention can assist predictive maintenance personnel to intuitively and effectively discover anomalies in the data, and analyze the causes and patterns of the anomalies, thereby guiding the work deployment of subsequent components to extend the service life of the components.

Description

Projection-based abnormal component inspection visualization method
Technical Field
The invention relates to the technical field of information visualization and visual analysis, in particular to a projection-based abnormal component inspection visualization method.
Background
With the advent of industry 4.0, modern industrial systems have now presented a new trend towards large, automated, complex developments. Due to the improvement of the complexity of the system, the failure of key equipment can bring about huge economic loss and even casualties. Taking an airplane as an example, due to uncertainty of a working environment, the engine has the problems of inconsistent exhaust gas temperature or rotor rotation speed, surge and the like, namely, subsystems or components show abnormal states. How to find abnormal components from mass data and analyze the abnormal modes and the reasons of the abnormal components is a focus of attention of predictive maintenance personnel.
In the existing aviation system, the abnormal component checking method is often independently operated as a black box and integrated into an alarm system, an analyst can only obtain a model calculation result, and abnormal discovery and attribution analysis are difficult to develop on example-level data. In the field of artificial intelligence and data mining, although a plurality of anomaly detection methods are proposed successively in recent years, the method focuses on improving the accuracy of a model, causes of anomalies of the model are lacking from the viewpoint of data, and the statistical analysis method has the defects of poor result interpretation, complex operation of the analysis method and the like.
The information visualization and visual analysis method provides a new solution for predictive maintenance personnel, and the analysis personnel can explore rules and modes behind the data by utilizing the intuitiveness of visual presentation and flexible interaction between views. The information visualization method converts the data into perceivable graphics, symbols, colors and the like to enhance the data identification efficiency and transfer effective information. Meanwhile, the visual analysis method can combine human intelligence and machine intelligence, and utilizes the interactive visual interface to introduce human knowledge and experience into the analysis flow, so as to assist analysts to complete data analysis and reasoning decision comprehensively and intuitively.
Visual analysis works to provide effective assistance in data insight in terms of anomaly discovery. In the specific practice of abnormal component identification, although the existing visual analysis work can identify abnormal data and give an alarm in time, only the comparison analysis of the abnormal data and the normal data is considered, the exploration and attribution of similar modes of the abnormal data are lacking, and the research of the similar modes has reference value for similar fault diagnosis, abnormal component screening and high-quality data set construction.
Disclosure of Invention
In view of the above problems, the present study is to provide a projection-based abnormal component inspection visualization method, which not only can help predictive maintenance personnel to quickly find abnormal components in data, but also can support abnormal mode discovery and abnormal attribution of component data examples, and can also guide work deployment of subsequent components so as to prolong the service life of the components. The specific technical scheme is as follows:
a projection-based abnormal-component-inspection visualization method, comprising the steps of:
S1, data acquisition and processing
After the data of the aviation gas turbofan engine are obtained, the data are cleaned and processed, and the data are arranged into structural data which can be directly used in a visual view;
s2 visual mapping
Visually mapping the data obtained in step S1 through a visual channel:
Designing a component projection view, and using a scatter diagram and a contour diagram to present a distribution overview of data, wherein the degree of closeness of the distribution overview represents the similarity of a component degradation mode;
displaying real data of the monitored variables by using an area chart, and vertically stacking and discharging the area chart so that an analyst can find the relation among the variables;
s3, visual layout and realization
Visual layout is carried out on the visual module which completes mapping in the step S2, and the visual layout is realized:
in the component projection view, mapping the component into each point in the scatter diagram, and reserving the relative positions among the scatter diagrams, wherein aiming at the diagrams, the running condition of the component is recorded for an improved radar diagram in the middle, and a plurality of overlapped circular rings are arranged on the outer side;
in the data inspection view, a time axis layout method is adopted, a dotted line diagram at the top of the view shows a working condition sequence running in the current time period, and a time axis module with an area diagram is arranged at the bottom of the view;
S4, interactive design
Under the scattered point view, when the mouse is suspended on a certain data point, the point is highlighted for a user to check the local area of the scattered point in detail through zooming, a closed irregular graph is drawn on the scattered point, the scattered point is replaced by a font for deep exploration of data, and the length of a time slice coded by each circular ring in the font is controlled through a sliding bar.
Further, in step S1, the data processing includes:
(1) And (3) identifying working conditions:
dividing the working state according to 3 operation conditions of height, mach number and sea level temperature, and clustering working conditions by using k-means;
(2) Data dimension reduction:
The component is characterized by multidimensional time sequence data, namely:
Xk=[t1,t2…,ti,…,tT],k=1,2,…,n,ti∈RM
Wherein n is the number of components, M is the number of selected features, T is the length of a life cycle, T i is the value of each feature at the moment i, and k is the number of the components;
And (3) polymerizing data by adopting a sliding time window to obtain:
wherein, w is the size of a time window, x i is the value of each characteristic of the window w_i after aggregation;
For the resulting x w_i, using t-distributed random neighborhood embedding, let the x w_i vector be projected into one-dimensional space, then we will represent the building block as:
Xk∈RT-w+1,k=1,2,...,n
aiming at the degradation curve obtained after the dimension reduction, the dimension reduction of t-SNE is carried out to a two-dimensional space, and X k is expressed as:
Xk∈R2,k=1,2,...,n
wherein n is the number of components;
(3) Outlier calculation:
Defining anomalies using Pearson coefficients, two time series of the s-th monitored variable for building blocks numbered k i and k j The correlation is defined as follows:
Wherein: The value of the s-th monitored variable, which is the component numbered k i, at time point t; The value of the s-th monitored variable for component k j at time t; TS is the length of the current time interval; Is that Average value in time interval TS;
wherein outliers are defined as:
Wherein n is the number of members.
Further, in step S2, the visual mapping is specifically:
s21, mapping the shape, the position and the color of the scatter diagram in the component projection view, wherein the position of the scatter diagram is used for mapping the dimension reduction result of the component;
s22, carrying out color and area mapping on the contour map in the component projection view, namely filling blank positions among contour lines by adopting specific colors, mapping the density of the component projected in a local area by using the depth of the filling color, and mapping the quantity of behaviors by using the area of petals;
S23, carrying out shape, size and color mapping on an improved radar chart of a special font in a component projection view, wherein different working condition categories are mapped by using the colors of points;
S24, mapping positions, radians and colors of an improved radar chart of a special font in a component projection view, wherein different ring positions encode different monitoring indexes;
S25, mapping the colors and the positions of the data inspection view, wherein six symbols are adopted to map six working conditions, connecting lines between origins of different colors represent time sequence relations of the components working under different working conditions, and the left side of the area diagram is connected by using three Bezier curves aiming at highly relevant monitoring variables.
Further, the visual layout scheme of the component projection view specifically includes:
S3a, mapping the scatter diagram and the contour line on a screen according to the width and the height of the screen distributed by the system, and mapping the distribution density of components in different position areas through the density and the change of the transparency of the line in the contour line to complete the basic layout of the view;
S3b, for the component font view, selecting each slice range and the monitoring variable to be displayed in a control column, and calculating the font size according to the number of the monitoring variables;
s3c, after the size is determined, the fonts are scattered according to a collision detection algorithm based on force-guided layout, so that the problem of vision shielding of the view caused by dense data of local areas and sparse remote areas is solved;
S3d, counting the operation times of each component under six working conditions, and determining the distance from each point in the improved radar chart to the central point through a linear scale;
and S3e, for the superimposed ring graph, sequentially drawing the ring from inside to outside according to the selection sequence of the monitoring variables, cutting the ring into a plurality of fragments according to the time slice length, and completing the drawing of each fragment according to the abnormal value of each time slice.
Further, the mapping the density of the component distribution in the different location areas by the density and the transparency change of the contour line in S3a specifically includes:
S3a1:two-dimensional scalar field computation
1A) For a given set of data points (x 1,x2…,xn;y1,y2,…,yn), determining its minimum and maximum observations, i.e., (x 1,y1) and (x n,yn), where n is the number of components;
1b) Estimating a grid group distance h to be used in performing two-dimensional grid division, thereby obtaining grid demarcation points (a 0,a1,…,aVe) and (b 0,b1,…,bHe) of data, wherein a ve+1-ave=h,bhe+1-bhe =h, ve=0, 1,..;
the mesh group distance h is calculated as follows:
Wherein IQR (x) is the difference between the upper quartile value and the lower quartile value of the sample, n is the number of components;
1c) Counting the scattered data frequency in each grid area, namely (g 0,0,g0,1,…,gVe,He);
S3a2 contour calculation
2A) Determining an initial threshold value, comparing the data value in the grid with the set threshold value, marking the grid larger than the threshold value as 1, and otherwise marking the grid value as 0, thereby obtaining a binary image;
2b) Constructing a binary index according to four values of corners of each cell, scanning around the cell in a clockwise direction, and generating a four-bit index by using bitwise OR operation from the highest displacement to the lowest displacement of the upper left corner;
2c) Accessing a pre-built look-up table using the cell index, listing the desired edges representing the cells;
2d) Applying linear interpolation between the raw data values to find the exact position of the contour along the cell edges;
s3a3 scale mapping and color mapping
The contour fill color is the contour color curve transitions smoothly from white to dark, and the system represents the area density as a value of 0 to 1.
Further, in the step S3c, the discrete opening method of the font is as follows:
S3c1, obtaining the maximum value and the minimum value of xy coordinates of all data selected by the noose, mapping the difference value between the maximum value and the minimum value into the width and the height of the canvas, recalculating the real coordinates of the data in the canvas according to a linear scale, and reprojecting the data points into a new canvas;
s3c2, assuming the data points of the canvas as dotted particles with the same mass radius, and enabling the particles to move by adding different mechanical models;
s3c3, adding collision force to each particle, and calculating the position and the speed of the particle after the delta t time by using Verlet;
and S3c4, repeatedly iterating for a plurality of times to obtain the position of the particle at the final speed of 0, namely the position of the final data point, wherein the particles are related and separated and tend to be stable.
Further, the S3d specifically is:
S3d1, mapping the operation statistics under each working condition into a distance cRadius from the center of the radar chart to the axis, wherein the calculation process is as follows:
Wherein: The maximum operation times under the working condition o; The sum o is the running times of the current component under the working condition O, radarRadius is the radius of the radar chart, and O is the total number of working condition categories;
S3d2, calculating X-axis coordinates xpos o and Y-axis coordinates ypos o of the point in the graph according to the distance between the point and the circle center, wherein the X-axis coordinates xpos o and the Y-axis coordinates ypos o are specifically as follows:
s3d3, mapping the life cycle length of the component by the size of the center circle, mapping the operation times of the size of the scattered points on the radar coordinate axis under the working condition, keeping the same with the position away from the circle center, and calculating the secondary coding as follows:
Wherein rul min is the shortest life cycle of the component, rul max is the longest life cycle of the component, R is the maximum radius of the central circle, R is the maximum radius of the coordinate axis circle, rulRadius is the radius of the central circle, and dotArea o is the distance from the point represented by the working condition o to the circle center;
S3d4, mapping the abnormal value of the subsequence of each monitoring variable into the length of an arc, and calculating the occupied angle of the arc as follows:
wherein RULLen is the life cycle length of the member degradation; representing the length of the s-th monitored variable of component k within the time interval TS;
s3d5, the distance between the circle represented by the S-th monitoring variable and the center of the circle is as follows:
rRadiuss=rDis+(rGap+rBandWidth)·(s-1),s=1,2,3,...,M
Where M is the number of selected features, rDis inner ring radius, rGap is the spacing distance between rings, and rBandWidth is the width of the rings.
Further, in step S3, the specific process of the visual layout and implementation of the data inspection view is as follows:
S31, determining a life cycle range of display, designing an area diagram, and connecting two highly-related monitoring variables by adopting a cubic Bezier curve according to calculation;
S32, determining the colors corresponding to the working conditions, determining the display forms of the dots, including the layout positions and the connection line drawing modes, and laying out and realizing the dot-line graph.
The beneficial effects of the invention are as follows:
1) The invention aims to overcome the defect of the existing method in the aspect of component abnormal data pattern analysis. The traditional alarm system often cannot acquire the specific reasons of the data abnormality. According to the visualization method, abnormal data in the data can be intuitively found through dimension reduction projection, and a special font is designed for positioning the abnormal reasons of the abnormal data subset. The abnormal mode can be clearly and intuitively displayed through the juxtaposition of the special fonts.
2) The invention overcomes the defects that the traditional system can only check the original data, has large cognitive load and is difficult to locate abnormally. The visualization method subdivides the working conditions of the components into six types, and a user can directly check the working condition sequence of the working of the components to explain the component abnormality caused by the abnormality of the working condition sequence. Meanwhile, the three-time Bessel connection can assist a user to find a monitoring index with extremely high relevance, and the abnormality can be conveniently located.
Drawings
FIG. 1 is a schematic view of the overall flow chart of the projection-based abnormal component inspection visualization method of the present invention.
FIG. 2 is a schematic diagram of a visual coding and visualization layout of a projection view of a component in the present invention.
FIG. 3 is a schematic diagram of a visual coding and visual layout of a medium value graph of the present invention.
Fig. 4 is a schematic diagram of visual coding and visual layout of glyphs in the present invention.
Fig. 5 is a schematic diagram showing the implementation effect and overall layout of the data inspection view in the present invention.
Detailed Description
The invention is described in further detail below with reference to the drawings and specific data.
According to the invention, through an effective information visualization method and combining a multi-view linkage strategy and a flexible interaction means, multi-angle analysis of anomalies in aviation gas turbofan engine data is realized, predictive maintenance personnel are helped to find anomalies in the data, and the anomalies are analyzed in a mode and a reason of the anomalies. The technical scheme includes that the method comprises the steps of data acquisition and processing, visual mapping, visual layout and realization and interactive design. The method comprises the following specific steps:
step one, data acquisition and processing
And screening effective information according to aviation gas turbofan engine data provided by the NASA of the American aerospace agency, and storing the data.
1. Data acquisition the FD004 data set in the data is selected for use, wherein the FD004 data set contains data from initial state operation to complete failure of 249 engines. Wherein the maximum life cycle of the engine is 543 and the minimum life cycle is 128. The specific data comprises 26 dimensions, wherein 1-2 dimensions are machine numbers and time points, 3-5 dimensions are working conditions of the machine, namely height, mach number and sea level temperature, and 6-26 dimensions are monitoring indexes of an engine, namely fan inlet total temperature, low-pressure compressor temperature, high-pressure compressor temperature and low-pressure turbine temperature, fan inlet pressure, bypass conveying pipe pressure, high-pressure compressor air pressure, physical fan rotating speed, physical core rotating speed, engine pressure ratio, high-pressure compressor static pressure, fuel quantity and high-pressure turbine static pressure ratio, corrected fan rotating speed, corrected core rotating speed, bypass ratio, fuel-air ratio in a combustion chamber, exhaust valve heat content, required fan rotating speed, corrected required fan rotating speed, high-pressure turbine coolant discharge and low-pressure turbine coolant discharge.
2. And the data processing comprises working condition identification and data dimension reduction operation.
(1) Working condition identification, namely, the working conditions of the engine can be divided according to 3 operating conditions of height, mach number and sea level temperature because the engine works in different working conditions. And clustering the working conditions by using k_means. The main flow of the algorithm is that 1) k is designated, namely data is divided into k categories, 2) k points are randomly selected from the data to serve as the mass centers of each cluster, 3) the distance between the data points and the mass centers is measured through a certain distance calculation method, the data points are divided into the nearest cluster mass centers, 4) the mass centers of each cluster are recalculated, 5) if one of three conditions that the mass centers of newly formed clusters are not changed any more, the points are kept in the same cluster and the maximum iteration times is reached, iteration is ended, otherwise, the steps 3 to 5 are repeated. Experiments find that the effect is best when k=6, and the working state of the engine is divided into working condition 1, working condition 2, working condition 3, working condition 4, working condition 5 and working condition 6.
(2) The data dimension reduction means that the component can be characterized by multidimensional time sequence data, namely:
Xk=[t1,t2…,ti,…,tT],k=(1,2,…,n),tj∈RM
Wherein n is the number of components, M is the number of selected features, T is the life cycle length, T i is the value of each feature at the moment i, and k is the component number.
In order to reduce the deviation caused by inaccurate calculation amount and data acquisition, a sliding time window is adopted to aggregate data, so that the method comprises the following steps of:
Wherein, w is the size of a time window, x w_i is the value of each characteristic of the window w_i after aggregation;
for the obtained x w_i, the random neighborhood embedding is carried out by utilizing t-distribution, and the algorithm steps are as follows:
a) Describing the similarity between vectors by using Euclidean distance, wherein the similarity is expressed by conditional probability:
Wherein σ w_i is the variance.
B) Calculating the similarity between points in the two-dimensional space to be projected, and using conditional probability representation:
Where y w_i is the position of the data point in the low-dimensional space.
C) Kullback-Leibler divergence is commonly used to measure the distance between two probability distributions to minimize the sum of K-L divergences:
Wherein n is the number of members.
D) To alleviate the problem of crowding data points in low dimensional space, the similarity between data points is calculated using t-distribution.
Where y w_i is the position of the data point in the low-dimensional space.
Through the above steps, the x w_i vector is projected into one-dimensional space, and the component can then be represented as:
Xk∈RT-w+1,k=(1,2,...,n)
For the one-dimensional time sequence after dimension reduction, the dimension reduction is performed in the two-dimensional space by using the t-SNE, and X k can be expressed as follows:
Xk∈R2,k=(1,2,...,n)
Wherein n is the number of members.
(3) Outlier calculation Using Pearson coefficient to define anomalies for two time series of the No. k i,kj s-th monitored variableThe correlation is defined as follows:
Wherein: The value of the s-th monitored variable, which is the component numbered k i, at time point t; The value of the s-th monitored variable for component k j at time t; TS is the length of the current time interval; Is that Average value in time interval TS;
wherein outliers are defined as:
Wherein n is the number of members.
Step two, visual mapping
After data acquisition and processing, the visual mapping scheme design is carried out on the component projection view (shown in fig. 2) and the data inspection (shown in fig. 5) in the invention.
1. Component projection view
(1) Scatter diagram
The method comprises the steps of carrying out shape, position and color mapping on a scatter diagram in a component projection view, wherein the dimension reduction result of the component is mapped by the position of the scatter diagram, the clustering result of the color mapping component is obtained, and the transparency of the point is mapped to the life cycle length of the component. Specific examples are as follows:
the position is that one point in the scatter diagram is mapped with a component, the two-dimensional coordinates of the point in the view represent the dimension reduction result of the component, and the distance between the points is mapped with the similarity degree between the degradation tracks of the component;
Color-clustering the components into 6 classes according to their degradation trajectories, each class being labeled with a different color. The points with the same color are gathered together, so that the clustering effect is better;
transparency-transparency of a dot characterizes the length of the lifecycle of a component, the longer the lifecycle, the less the transparency.
(2) Contour map
Color mapping the contour map in the component projection view, namely filling blank positions among contour lines by adopting specific colors, and mapping the density of the component projected in the local area by using the darkness of the filling colors.
The area of petals is used for mapping the number of behaviors. Specific examples are as follows:
Contour color-for blank positions between contours, the system fills them with green. If the density of the projected component is larger in the local area, the filled color is darker, so that an analyst is assisted to know the real situation of data distribution;
the larger the area, the more times are indicated, and the smaller the area, the less times are indicated.
(3) Special character form
This portion adopts a radial layout, comprising two portions INNERAREA and OuterArea, as shown in fig. 4 (a). At section INNERAREA, the number of runs per operating condition is mapped using a modified radar map. At section OuterArea, superimposed loop graphs are used to represent the degree of anomaly of the different monitoring indicators over various time periods.
InnerArea
The method comprises the steps of carrying out shape, size and color mapping on an improved radar chart of a special font in a component projection view, mapping different working condition categories by using the colors of points, mapping the life cycle length of the component by using the size of a center circle, encoding the running number of the component under six working conditions by using the shape of the radar chart, and comparing the running conditions of the component during degradation by comparing the shapes of hexagons. Specific examples are as follows:
color, namely mapping different working condition categories through 6 colors.
Size: the size of the center circle encodes the lifecycle length of the component.
The shape comprises mirror image arrangement of six shafts, the running quantity under six working conditions is encoded through the distance between the points on the shafts and the shaft center, the points on the shafts are connected end to form an irregular hexagon, and the running condition of the component during degradation can be roughly compared by comparing the shapes of the hexagons.
OuterArea
The method comprises the steps of mapping positions, radians and colors of an improved radar chart of a special font in a component projection view, wherein different ring positions encode different monitoring indexes, the radian encodes the length of a time slice, and the color of the ring encodes the abnormality degree of the time slice. Specific examples are as follows:
The location OuterArea includes a plurality of rings, each ring representing a monitoring index. In fig. 4 (a), four circles from inside to outside are the monitored data of the total fan inlet temperature, the high pressure compressor temperature, the low pressure turbine temperature, and the fan inlet pressure, respectively. Wherein the number of rings and the representative monitoring index can be selected by the user himself.
The radian, namely slicing the monitored data in time, wherein the length of the time slice is selected by a user, the radian maps the length of the time slice, and the degradation time period is divided into five time slices in the figure 4 (a);
color-each circular slice has a different color, and the degree of abnormality of the time slice is encoded using the color.
2. Data inspection view
The color and position mapping of the data inspection view comprises the steps of mapping six working conditions by adopting six symbols, representing the time sequence relation of the component working under different working conditions by connecting lines between origins of different colors, and connecting the left side of an area diagram by using a three-time Bezier curve aiming at highly relevant monitoring variables. Specific examples are as follows:
(1) Point diagram
The color adopts six color systems with obvious contrast ratio of red, orange, yellow, green, cyan and blue to map each working condition.
Length is the number of runs of the length coding member under the working condition by utilizing the rectangle.
(2) Area map
Height-the height of the area map maps the size of the real data.
The two ends of the red connecting line are respectively highly relevant monitoring variables, and when the Pearson coefficient is greater than 0.5, the system can be automatically connected with the corresponding variables.
Step three, visual layout and realization
1. Component projection view visualization layout and implementation
When the number of the components is too large, the scattered points in the scattered points can be blocked, so that an analyst cannot effectively acquire the data density in the current area. By mapping the density of the line and the change of the transparency in the contour map (shown in fig. 3) to the distribution density of the components in the areas at different positions, a user can clearly find dense or sparse areas in the data, and the method has important significance for finding the outlier islands in the data. The implementation method is as follows:
(1) Two-dimensional scalar field computation
A) For a given set of data points (x 1,x2…,xn;y1,y2,…,yn), its minimum and maximum observations, i.e., (x 1,y1) and (x n,yn), are determined, where n is the number of components.
B) The grid set distance h that should be used in performing the two-dimensional grid division is estimated to obtain grid demarcation points (a 0,a1,…,aVe) and (b 0,b1,…,bHe) of the data, where a ve+1-ave=h,bhe+1-bhe =h, ve=0, 1,... The calculation mode of h is as follows:
Where IQR (x) is the difference between the upper and lower quartile values of the sample, and n is the number of components.
C) The scatter data frequency in each grid area is counted, i.e., (g 0,0,g0,1,…,gVe,He).
(2) Contour line calculation
A) Step one, determining an initial threshold value, comparing a data value in a grid with a set threshold value, marking the grid larger than the threshold value as 1, and otherwise marking the grid value as 0, thereby obtaining a binary image.
B) Assume that a pixel block of 2 x2 in a binary image is a contour element. A binary index is constructed from four values at each cell corner, scanned around the cell in a clockwise direction, and a four-bit index is generated from the highest displacement in the upper left corner to the lowest using a bitwise OR operation.
C) A pre-built look-up table is accessed using the cell index, which contains 16 entries listing the edges required to represent the cells.
D) Linear interpolation is applied between the raw data values to find the exact position of the contour along the cell edges. And obtaining a contour line of a certain threshold value through the four steps, and repeating the four steps with num% serving as a density interval to obtain num contour lines.
(3) Scale mapping and color mapping
And according to the width and the height of the screen distributed by the system, mapping the scatter diagram and the contour line onto the screen according to a linear scale, and completing the basic layout of the view.
Contour fill color-contour color curve transitions smoothly from white to dark green. The system expresses the area density as a value of 0 to 1, and then the final color expression is obtained by the following formula:
Wherein hexColor is RGB color hexadecimal value, data i is density of the current area, data min is density minimum value, and data max is density maximum value.
After user interaction selects the collection of components, the view will preserve the relative positions between the components, expanding into a special glyph as shown in the right half of FIG. 2. The period of time and the monitored amount of abnormality of the member degradation process can be found by the glyph inspection. Meanwhile, the degradation pattern of the abnormal member can be found by discriminating the degree of similarity between glyphs. The special font implementation process is as follows:
(1) Spatial layout implementation
After the abnormal components to be analyzed are selected, the font design of the canvas display component set needs to be regenerated according to the circled area, and the shape selected by the user lasso cannot be obtained, so that when the local area is data-intensive and the remote area is sparse, serious visual occlusion of the view can occur, as shown in fig. 2. The collision detection algorithm based on the force guiding layout is realized, the mutual intersection between fonts is avoided, and the following is realized:
a) And obtaining the maximum value and the minimum value of xy coordinates of all data selected by the noose, mapping the difference value between the maximum value and the minimum value into the width and the height of the canvas, re-calculating the real coordinates of the data in the canvas according to the linear scale, and re-projecting the data points into a new canvas.
B) The data points of the canvas are assumed to be dotted particles with the same mass radius, and the particles are made to move by adding different mechanical models.
C) A certain collision force is added to each particle, and the position and velocity of the particle after Δt time are calculated using Verlet.
D) And repeatedly iterating for a plurality of times, and obtaining the position of the particle when the final speed is 0, namely the position of the final data point, wherein the particles are separated in a correlation way and tend to be stable. At this time, it can be found that the relative positions between the scattered points are well preserved, and cross overlapping occurs between the glyphs.
(2) Font design implementation
For a single component, the data includes operation statistics of the component under each working condition when the component is degraded, and the operation statistics are specifically defined as follows:
D={Io|o∈[1,O]}
wherein O is the type of working condition, and I o is the number of times of operation under the working condition O.
As shown in fig. 4 (b), the operation statistics under each condition are mapped to the distance cRadius from the center of the radar chart to the axis, and the calculation process is as follows:
Wherein: The maximum operation times under the working condition o; The method is characterized by comprising the steps of determining the minimum operation times under the working condition O, determining the operation times of a current component under the working condition O by sum o, determining the radius of a radar chart by radarRadius, and determining the total number of working condition categories by O.
The X coordinate xpos o and the Y coordinate ypos o of the point in the graph are calculated according to the distance between the point and the circle center, and the method is concretely as follows:
the size of the center circle maps the life cycle length of the component, the operation times of the size mapping of the scattered points on the radar coordinate axis under the working condition are consistent with the position away from the center of a circle, the size mapping belongs to secondary coding, and the calculation is as follows:
Wherein rul min is the shortest life cycle of the component, rul max is the longest life cycle of the component, R is the maximum radius of the central circle, R is the maximum radius of the coordinate axis circle, rulRadius is the radius of the central circle, and dotArea o is the distance from the point represented by the working condition o to the center of the circle.
The subsequence outlier of each monitored variable is mapped to the length of an arc, and the angle occupied by the arc is calculated as follows:
wherein RULLen is the life cycle length of the member degradation; representing the length of the s-th monitored variable of the component k within the time point interval TS;
the distance between the circle k and the center of the circle is as follows:
rRadiuss=rDis+(rGap+rBandWidth)·(s-1),s=1,2,3,...,M
where M is the number of selected features, rDis inner ring radius, rGap is the spacing distance between rings, and rBandWidth is the width of the rings, respectively.
2. Data inspection view visualization layout and implementation:
the connection line is drawn-the left side of the area diagram is marked with a red connection line for highly relevant variables as shown in fig. 5. When the Pearson coefficient is greater than 0.5, the system will draw a cubic bezier curve automatically connecting the corresponding variables. Four points are needed for drawing the three-time Bezier curve, namely a starting point spoint, an ending point epoint and two control points cpoint1 and cpoint2, and the result after drawing is shown in FIG. 4. The two control point coordinates are determined as follows:
Wherein spoint x is the x-axis coordinate of the starting point, epoint x is the x-axis coordinate of the ending point, epoint y is the y-axis coordinate of the ending point, and spoint y is the y-axis coordinate of the starting point.
Step four, interactive design
In the component projection view, highlighting, scaling, lasso, time slicing and reconfiguration operations are included:
highlighting, namely under the scattered point view, in order to help a user to quickly know basic information of the component, the system provides a highlight interaction mode, when an analyst focuses a mouse on a certain data point, the point is highlighted by utilizing color, and the information such as the number, the life cycle length, the working condition operation statistics condition and the like of the component is displayed by assisting a prompt box.
Scaling-taking into account that more data points may result in overlap between points, causing visual confusion. The user can view the local area in detail through zooming, and meanwhile, the selection of the data points is facilitated later.
Lasso-users often need to examine this portion of data after finding an outlier or point of interest. The method supports drawing a closed irregular graph on a scatter diagram, if a certain data point is positioned in the middle of the graph when a user completes lasso, the data point is selected, and the font corresponding to the data point is displayed in a pop-up view.
Time slicing-analysts can control the extent of the time slices by means of the slide bars above. Greater flexibility is provided herein in that a user may select overlapping time slices of unequal length to explore the relevance of the monitored variable of the component during degradation.
Reconfiguration-the present visualization method allows the user to switch between the scatter plot and the contour plot via tabs. When the component life cycle is too long, the transparency of the dots may be too low, causing hiding between the dots. Meanwhile, due to uneven data distribution or overlarge data quantity, serious shielding can occur to view parts, and at the moment, vision confusion is relieved by switching to the contour map through reconfiguration.

Claims (6)

1.一种基于投影的异常构件检查可视化方法,其特征在于,包括以下步骤:1. A projection-based abnormal component inspection visualization method, characterized by comprising the following steps: S1:数据获取与处理S1: Data acquisition and processing 获取到航空燃气涡扇发动机数据后,对数据进行清洗和处理,整理成可视化视图能够直接使用的结构性数据;After obtaining the aviation gas turbofan engine data, the data is cleaned and processed, and organized into structured data that can be directly used in a visual view; S2:可视化映射S2: Visual Mapping 通过视觉通道将经步骤S1获得的数据进行可视化映射:Visually map the data obtained in step S1 through the visual channel: 设计构件投影视图,用散点图与等值线图呈现数据的分布概览,其临近程度表示构件退化模式的相似性;用一种在多层圆环中嵌套雷达图的特殊字形对比异常数据中的相似模式;Design component projection views, presenting an overview of data distribution using scatter plots and contour plots, with proximity indicating similarity in component degradation patterns; using a special glyph of radar charts nested in multiple layers of rings to contrast similar patterns in abnormal data; 设计数据检查视图,用水平柱状图映射构件在不同工况下的运行次数;用面积图展示监测变量的真实数据,将面积图竖直堆叠排放以便分析人员发现各变量之间的关系;Design data inspection views, use horizontal bar charts to map the number of times components operate under different working conditions; use area charts to display the real data of monitored variables, and stack the area charts vertically to help analysts discover the relationship between the variables; S3:可视化布局与实现S3: Visualization layout and implementation 对S2中完成映射的视觉模块进行可视化布局并实现:Visualize and layout the visual modules that complete the mapping in S2: 在构件投影视图中,将构件映射为散点图中的每个点,字形保留散点之间的相对位置;针对字形,中间为一个改进的雷达图记录该构件的运行情况,外侧为多个叠加圆环;In the component projection view, the component is mapped to each point in the scatter plot, and the glyph retains the relative position between the scatter points. For the glyph, there is an improved radar chart in the middle to record the operation of the component, and multiple superimposed rings on the outside. 在数据检查视图中,采用时间轴的布局方法,视图顶部的点线图展示在当前时间段下运行的工况序列,视图底部是带有面积图的时间轴模块;In the data inspection view, the timeline layout method is adopted. The point-line graph at the top of the view shows the operating sequence running in the current time period, and the timeline module with an area graph is at the bottom of the view; S4:交互设计S4: Interaction Design 在散点视图下,鼠标悬浮于某个数据点上时高亮该点,供用户通过缩放对散点图的局部区域进行详细查看;通过在散点图上绘制一个闭合的不规则图形,将散点替换为一个字形,用以对数据进行深入探索;通过滑动条控制字形中每个圆环编码的时间片长度;In the scatter plot view, hovering the mouse over a data point highlights the point, allowing users to zoom in and out to view the local area of the scatter plot in detail; by drawing a closed irregular shape on the scatter plot, the scatter point is replaced with a glyph to explore the data in depth; the length of the time slice encoded by each ring in the glyph is controlled by the slider; 在步骤S2中,可视化映射具体为:In step S2, the visualization mapping is specifically as follows: S21:对构件投影视图中的散点图进行形状、位置、颜色映射:用散点的位置映射构件的降维结果;颜色映射构件的聚类结果;点的透明度映射构件的生命周期长度;S21: Map the shape, position and color of the scatter plot in the projection view of the component: use the position of the scatter plot to map the dimension reduction result of the component; use the color to map the clustering result of the component; and use the transparency of the point to map the life cycle length of the component. S22:对构件投影视图中的等值图进行颜色、面积映射:采用特定的颜色对等值线之间的空白位置进行填充;用填充颜色的深浅映射在局部区域投影的构件密度;用花瓣的面积映射行为数量的多少;S22: Perform color and area mapping on the contour map in the component projection view: use specific colors to fill the blank spaces between the contour lines; use the depth of the filling color to map the density of the projected components in the local area; use the area of the petals to map the number of behaviors; S23:对构件投影视图中特殊字形的改进雷达图进行形状、大小、颜色映射:用点的颜色映射不同的工况类别;用中心圆的大小映射该构件的生命周期长度;用雷达图的形状编码该构件在六种工况下的运行数量,通过对比六边形的形状以对比构件在退化期间的运行情况;S23: Shape, size and color mapping of the improved radar chart with special glyphs in the component projection view: Use the color of the points to map different working condition categories; Use the size of the center circle to map the life cycle length of the component; Use the shape of the radar chart to encode the number of operations of the component under six working conditions, and compare the operation of the component during the degradation period by comparing the shapes of the hexagons; S24:对构件投影视图中特殊字形的改进雷达图进行位置、弧度、颜色映射:不同的圆环位置编码不同的监测指标;弧度编码时间片长度的大小;圆环的颜色编码该时间片的异常程度;S24: Map the position, arc and color of the improved radar chart of special glyphs in the component projection view: different ring positions encode different monitoring indicators; the arc encodes the length of the time slice; the color of the ring encodes the abnormality of the time slice; S25:对数据检查视图颜色、位置映射:采用六种符号来映射六种工况;不同颜色原点之间的连线表示构件在不同工况下工作的时序关系;针对高度相关的监测变量,在面积图左侧用用三次贝塞尔曲线连接;S25: Data inspection view color and position mapping: six symbols are used to map six working conditions; the lines between the origins of different colors represent the timing relationship of the components working under different working conditions; for highly correlated monitoring variables, cubic Bezier curves are used to connect them on the left side of the area chart; 所述构件投影视图的可视化布局方案具体为:The visualization layout scheme of the component projection view is specifically as follows: S3a:对于散点图和等值线图,根据系统分配的屏幕宽度和高度,将散点图和等值线按线性比例尺映射到屏幕上,通过等值线图中线的密度与透明度的变化映射不同位置区域内的构件分布密度,完成视图的基本布局;S3a: For scatter plots and contour plots, according to the screen width and height allocated by the system, the scatter plots and contour lines are mapped onto the screen in a linear scale. The density and transparency of the lines in the contour plots are used to map the distribution density of components in different location areas, thus completing the basic layout of the view. S3b:对于构件字形视图,在控制栏选择每个片范围,和需要展示的监测变量,根据监测变量的数量计算字形的大小;S3b: For the component glyph view, select each slice range and the monitoring variables to be displayed in the control bar, and calculate the glyph size according to the number of monitoring variables; S3c:确定大小后,根据基于力引导布局的碰撞检测算法将字形离散开,解决局部区域数据密集,且远处区域稀疏时导致的视图出现的视觉遮挡问题;S3c: After the size is determined, the glyphs are discretized according to the collision detection algorithm based on force-guided layout to solve the visual occlusion problem caused by dense data in local areas and sparse data in distant areas; S3d:统计每个构件在六种工况下的运行次数,通过线性比例尺确定改进雷达图中每个点距离中心点的距离;S3d: Count the number of operations of each component under six working conditions, and determine the distance from each point to the center point in the improved radar chart through a linear scale; S3e:对于叠加环图,根据监测变量的选择顺序从里向外依次绘制圆环,根据时间片长度对圆环切割成多个片段,根据每个时间片异常值大小,完成每个片段的绘制。S3e: For the superimposed ring chart, the rings are drawn from the inside to the outside according to the selection order of the monitoring variables, and the rings are cut into multiple segments according to the length of the time slice. The drawing of each segment is completed according to the size of the outlier in each time slice. 2.根据权利要求1所述的基于投影的异常构件检查可视化方法,其特征在于,在步骤S1中,数据处理包括:2. The projection-based abnormal component inspection visualization method according to claim 1, characterized in that in step S1, data processing comprises: (1)工况识别:(1) Working condition identification: 根据高度、马赫数、海平面温度3种操作条件对工作状态进行划分,利用k-means对工况进行聚类;The working conditions are divided according to three operating conditions: altitude, Mach number, and sea level temperature, and the working conditions are clustered using k-means. (2)数据降维:(2) Data Dimensionality Reduction: 将构件由多维时序数据表征,即为:The component is represented by multi-dimensional time series data, that is: Xk=[t1,t2…,ti,…,tT],k=1,2,…,n,ti∈RM X k =[t 1 ,t 2 ...,t i ,...,t T ],k=1,2,...,n,t i ∈R M 式中:n为构件数量;M为选取特征的数量;T为生命周期长度;ti为时刻i各特征的值;k为构件编号;Where: n is the number of components; M is the number of selected features; T is the life cycle length; ti is the value of each feature at time i; k is the component number; 采用滑动时间窗口对数据聚合,得到:Aggregate the data using a sliding time window and get: 式中:w为时间窗口大小;xw_i为聚合后窗口w_i各特征的值;Where: w is the time window size; x w_i is the value of each feature of window w_i after aggregation; 针对得到的xw_i利用t-分布随机邻域嵌入,使xw_i向量被投影到一维空间中,则将将构件表示为:For the obtained x w_i, use t-distributed random neighborhood embedding to project the x w_i vector into one-dimensional space, then the component is represented as: Xk∈RT-w+1,k=1,2,…,nX kR T-w+1 , k=1,2,…,n 针对降维后得到的退化曲线,再利用t-SNE降维到二维空间中,将Xk表示为:For the degradation curve obtained after dimensionality reduction, t-SNE is used to reduce the dimension to two-dimensional space, and X k is expressed as: Xk∈R2,k=1,2,…,nX kR 2 , k=1,2,…,n 式中:n为构件数量;Where: n is the number of components; (3)异常值计算:(3) Calculation of outliers: 利用Pearson系数定义异常,针对编号为ki和kj的构件的第s个监测变量的两个时间序列其相关性定义如下:Using the Pearson coefficient to define anomalies, for the two time series of the sth monitoring variable of components numbered k i and k j Its relevance is defined as follows: 式中:为编号为ki的构件的第s个监测变量在时间点t下的值;为编号为kj的构件的第s个监测变量在时间点t下的值;TS为当前时间区间的长度;在时间区间TS内的平均值;Where: is the value of the sth monitoring variable of the component numbered k i at time point t; is the value of the sth monitoring variable of the component numbered kj at time point t; TS is the length of the current time interval; for The average value in the time interval TS; 其中异常值定义为:The outliers are defined as: 式中:n为构件数量。Where: n is the number of components. 3.根据权利要求1所述的基于投影的异常构件检查可视化方法,其特征在于,所述S3a中通过等值线图中线的密度与透明度的变化映射不同位置区域内的构件分布密度,具体包括:3. The method for visualizing abnormal component inspection based on projection according to claim 1 is characterized in that the component distribution density in different position areas is mapped by the changes in the density and transparency of the lines in the contour map in S3a, specifically comprising: S3a1:二维标量场计算S3a1: 2D Scalar Field Computation 1a)对于给定的一组数据点(x1,x2…,xn;y1,y2,…,yn),确定其最小观测值和最大观测值,即(x1,y1)与(xn,yn),其中n为构件数量;1a) For a given set of data points (x 1 ,x 2 …,x n ; y 1 ,y 2 ,…,y n ), determine its minimum and maximum observed values, i.e. (x 1 ,y 1 ) and (x n ,y n ), where n is the number of components; 1b)估计在进行二维网格划分时应使用的网格组距h,从而获得数据的网格分界点(a0,a1,…,aVe)与(b0,b1,…,bHe),其中ave+1-ave=h,bhe+1-bhe=h,ve=0,1,…,Ve-1,he=0,1,…,He-1;Ve为竖直方向网格线的数量;He为水平方向网格线的数量;1b) Estimate the grid group spacing h that should be used when performing two-dimensional grid division, so as to obtain the grid dividing points (a 0 ,a 1 ,…,a Ve ) and (b 0 ,b 1 ,…,b He ) of the data, where a ve+1 -a ve =h,b he+1 -b he =h,ve=0,1,…,Ve-1,he=0,1,…,He-1; Ve is the number of grid lines in the vertical direction; He is the number of grid lines in the horizontal direction; 网格组距h的计算方式如下:The grid group distance h is calculated as follows: 式中:IQR(x)为样本的上四分位值与下四分位值之差;n为构件数量;Where: IQR(x) is the difference between the upper quartile and the lower quartile of the sample; n is the number of components; 1c)统计每个网格区域内的散点数据频数,即(g0,0,g0,1,…,gVe,He);1c) Count the frequency of scattered data in each grid area, i.e. (g 0,0 ,g 0,1 ,…,g Ve,He ); S3a2:等值线计算S3a2: Contour calculation 2a)确定一个初始阈值,将网格中数据值与设定的阈值做比较,将大于阈值的网格标记为1,反之将网格值标记为0,由此得到一个二进制图像;2a) Determine an initial threshold, compare the data value in the grid with the set threshold, mark the grid with a value greater than the threshold as 1, and mark the grid value with a value less than the threshold as 0, thereby obtaining a binary image; 2b)假定二进制图像中2*2的像素块为一个轮廓单元;根据每个单元格角落的四个值构建二进制索引,以顺时针方向在单元格周围扫描,使用按位或的操作,从左上角的最高位移动到最低位,生成四位索引;2b) Assume that a 2*2 pixel block in the binary image is a contour unit; construct a binary index based on the four values of each cell corner, scan around the cell in a clockwise direction, use a bitwise OR operation, move from the highest bit in the upper left corner to the lowest bit, and generate a four-bit index; 2c)使用单元格索引访问一个预先构建的查找表,列出表示单元格所需边缘;2c) Use the cell index to access a pre-built lookup table listing the edges needed to represent the cell; 2d)在原始数据值之间应用线性插值,以找到轮廓线沿单元格边缘的确切位置;2d) Apply linear interpolation between the original data values to find the exact position of the contour line along the cell edge; S3a3:比例尺映射和颜色映射S3a3: Scale Mapping and Color Mapping 等值线填充颜色:等值线颜色曲线从白色平滑过渡到深色;系统将区域密度表示为0到1的数值。Contour fill color: The contour color curve smoothly transitions from white to dark; the system represents area density as a value from 0 to 1. 4.根据权利要求1所述的基于投影的异常构件检查可视化方法,其特征在于,所述S3c中,将字形离散开具体方法为:4. The projection-based abnormal component inspection visualization method according to claim 1, characterized in that, in said S3c, the specific method of discretizing the glyphs is: S3c1:获取套索圈选的所有数据的xy坐标的最大值和最小值,将最大值与最小值的差值其映射为画布的宽度和高度,根据线性比例尺重新计算数据在画布中真实的坐标,将数据点重新投影到新的画布中;S3c1: Get the maximum and minimum xy coordinates of all data selected by the lasso, map the difference between the maximum and minimum values to the width and height of the canvas, recalculate the real coordinates of the data in the canvas according to the linear scale, and reproject the data points to the new canvas; S3c2:将画布的数据点假定为质量半径皆相同的带点粒子,通过添加不同的力学模型来使得粒子发生运动;S3c2: Assume the data points on the canvas to be particles with the same mass and radius, and make the particles move by adding different mechanical models; S3c3:为每个粒子添加碰撞力,利用Verlet计算粒子在Δt时间后的位置和速度;S3c3: Add collision force to each particle and use Verlet to calculate the position and velocity of the particle after Δt time; S3c4:反复迭代多次,获取粒子最后速度为0时的位置,即为最终数据点的位置,此时粒子之间相关分离且趋于稳定。S3c4: Repeat the iterations many times to obtain the position of the particle when its final velocity is 0, which is the position of the final data point. At this time, the particles are correlated and separated and tend to be stable. 5.根据权利要求1所述的基于投影的异常构件检查可视化方法,其特征在于,所述S3d具体为:5. The projection-based abnormal component inspection visualization method according to claim 1, characterized in that S3d is specifically: S3d1:将在每种工况下的运行统计数量映射为雷达图中心到轴上的距离cRadius,计算过程如下:S3d1: Map the running statistics under each working condition to the distance cRadius from the center of the radar chart to the axis. The calculation process is as follows: 式中:为工况o下的最大运行次数;为工况o下的最小运行次数;sumo为当前构件在工况o下的运行次数;radarRadius为雷达图半径;O为工况类别总数;Where: is the maximum number of operations under working condition o; is the minimum number of operations under working condition o; sum o is the number of operations of the current component under working condition o; radarRadius is the radius of the radar chart; O is the total number of working condition categories; S3d2:根据点距圆心的距离计算出点在图形中的X坐标xposo和Y轴坐标yposo,具体如下:S3d2: Calculate the X-coordinate xpos o and Y-coordinate ypos o of the point in the figure based on the distance from the point to the center of the circle, as follows: S3d3:中心圆的大小映射该构件生命周期长度,雷达坐标轴上分散的点的大小映射在该工况下的运行次数,与距离圆心的位置保持一致,属于二次编码,计算如下:S3d3: The size of the central circle maps the life cycle length of the component. The size of the scattered points on the radar coordinate axis maps the number of operations under the working condition, which is consistent with the position from the center of the circle. It belongs to secondary coding and is calculated as follows: 式中:rulmin为构件最短生命周期;rulmax为构件最长生命周期;R为中心圆最大半径;r为坐标轴圆最大半径;rulRadius为中心圆的半径;dotAreao为工况o所代表的点距离圆心的距离;In the formula: rul min is the shortest life cycle of the component; rul max is the longest life cycle of the component; R is the maximum radius of the center circle; r is the maximum radius of the coordinate axis circle; rulRadius is the radius of the center circle; dotArea o is the distance from the point represented by working condition o to the center of the circle; S3d4:每个监测变量的子序列异常值映射为弧的长度,圆弧所占用角度计算如下:S3d4: The subsequence outliers of each monitoring variable are mapped to the length of the arc, and the angle occupied by the arc is calculated as follows: 式中:RULLen为构件退化的生命周期长度;表示构件k的第s个监测变量在时间区间TS内的长度;Where: RULLen is the life cycle length of component degradation; represents the length of the sth monitoring variable of component k in the time interval TS; S3d5:第s个监测变量所代表的圆环距离圆心的距离为:S3d5: The distance from the center of the circle represented by the sth monitoring variable is: rRadiuss=rDis+(rGap+rBandWidth)·(s-1),s=1,2,3,…,MrRadius s = rDis+(rGap+rBandWidth)·(s-1), s=1,2,3,…,M 式中:M为选取特征的数量;rDis为内环半径;rGap为环之间的间隔距离;rBandWidth为环的宽度。Where: M is the number of selected features; rDis is the inner ring radius; rGap is the interval distance between rings; rBandWidth is the width of the ring. 6.根据权利要求1所述的基于投影的异常构件检查可视化方法,其特征在于,在步骤S3中,所述数据检查视图的可视化布局与实现的具体过程如下:6. The projection-based abnormal component inspection visualization method according to claim 1 is characterized in that, in step S3, the specific process of visualization layout and implementation of the data inspection view is as follows: S31:确定显示的生命周期范围,设计面积图,根据计算将高度相关的两个监测变量采用三次贝塞尔曲线进行连接;S31: Determine the life cycle range to be displayed, design an area chart, and connect two highly correlated monitoring variables using a cubic Bezier curve according to calculations; S32:确定各工况对应的颜色,确定各圆点的展示形式,包括布局位置和连线绘制方式,将点线图进行布局并实现。S32: Determine the color corresponding to each working condition, determine the display form of each dot, including the layout position and the connection line drawing method, and layout and implement the point-line diagram.
CN202210599858.9A 2022-05-30 2022-05-30 A projection-based visualization method for abnormal component inspection Active CN115080654B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210599858.9A CN115080654B (en) 2022-05-30 2022-05-30 A projection-based visualization method for abnormal component inspection

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210599858.9A CN115080654B (en) 2022-05-30 2022-05-30 A projection-based visualization method for abnormal component inspection

Publications (2)

Publication Number Publication Date
CN115080654A CN115080654A (en) 2022-09-20
CN115080654B true CN115080654B (en) 2025-03-21

Family

ID=83249112

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210599858.9A Active CN115080654B (en) 2022-05-30 2022-05-30 A projection-based visualization method for abnormal component inspection

Country Status (1)

Country Link
CN (1) CN115080654B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117906726B (en) * 2024-03-19 2024-06-04 西安艺琳农业发展有限公司 Abnormal detection system for weight data of live cattle body ruler
CN118520317B (en) * 2024-07-23 2024-09-20 四川福莱宝生物科技有限公司 Biological element data analysis method suitable for high-throughput analysis

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663089B (en) * 2012-04-09 2014-11-26 中国科学院软件研究所 Unstructured data visualization method based on stereographic mapping
JP5538597B2 (en) * 2013-06-19 2014-07-02 株式会社日立製作所 Anomaly detection method and anomaly detection system
CN105426628B (en) * 2015-12-10 2018-06-01 中南大学 A kind of complex product isomery Model Fusion method
CN112101522B (en) * 2020-08-20 2023-04-18 四川大学 Interactive machine learning method based on visualization

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
面向构件剩余使用寿命预测的可视分析研究与实现.《万方数据》.2022,1-96. *

Also Published As

Publication number Publication date
CN115080654A (en) 2022-09-20

Similar Documents

Publication Publication Date Title
Kehrer et al. Visualization and visual analysis of multifaceted scientific data: A survey
Ferstl et al. Time-hierarchical clustering and visualization of weather forecast ensembles
CN115080654B (en) A projection-based visualization method for abnormal component inspection
Eick et al. Visual scalability
Keim Designing pixel-oriented visualization techniques: Theory and applications
Chan A survey on multivariate data visualization
Kobourov Spring embedders and force directed graph drawing algorithms
US9342579B2 (en) Visual analysis of multidimensional clusters
Qu et al. Visual analysis of the air pollution problem in Hong Kong
Jänicke et al. Brushing of attribute clouds for the visualization of multivariate data
Vehlow et al. Radial layered matrix visualization of dynamic graphs
CN103838829B (en) Raster vectorization system based on hierarchical boundary-topology search model
Burch et al. Radial edge splatting for visualizing dynamic directed graphs
Chan et al. Flow-based scatterplots for sensitivity analysis
US11320808B2 (en) Plant data display processing device and plant control system
Bentley et al. Animating multidimensional scaling to visualize n-dimensional data sets
Liao et al. Cluster-based visual abstraction for multivariate scatterplots
Cheng et al. TorusVis^ ND: unraveling high-dimensional torus networks for network traffic visualizations
Vehlow et al. Visualizing dynamic hierarchies in graph sequences
Blaas et al. Smooth graphs for visual exploration of higher-order state transitions
Andrienko et al. Episodes and topics in multivariate temporal data
CN116257667B (en) Visual causal relation analysis method oriented to multivariate time sequence
Hilda et al. A review on the development of big data analytics and effective data visualization techniques in the context of massive and multidimensional data
Cantareira et al. Explainable adversarial attacks in deep neural networks using activation profiles
Burch et al. Edge-stacked Timelines for Visualizing Dynamic Weighted Digraphs.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant