CN108681697A - Feature selection approach and device - Google Patents

Feature selection approach and device Download PDF

Info

Publication number
CN108681697A
CN108681697A CN201810401774.3A CN201810401774A CN108681697A CN 108681697 A CN108681697 A CN 108681697A CN 201810401774 A CN201810401774 A CN 201810401774A CN 108681697 A CN108681697 A CN 108681697A
Authority
CN
China
Prior art keywords
feature
sample
analysis model
sample space
quantitative analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810401774.3A
Other languages
Chinese (zh)
Other versions
CN108681697B (en
Inventor
罗娜
韩平
王冬
王世芳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Academy of Agriculture and Forestry Sciences
Original Assignee
BEIJING AGRICULTURAL QUALITY STANDARDS AND TESTING TECHNOLOGY RESEARCH CENTER
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING AGRICULTURAL QUALITY STANDARDS AND TESTING TECHNOLOGY RESEARCH CENTER filed Critical BEIJING AGRICULTURAL QUALITY STANDARDS AND TESTING TECHNOLOGY RESEARCH CENTER
Priority to CN201810401774.3A priority Critical patent/CN108681697B/en
Publication of CN108681697A publication Critical patent/CN108681697A/en
Application granted granted Critical
Publication of CN108681697B publication Critical patent/CN108681697B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/08Feature extraction

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Investigating Or Analysing Materials By Optical Means (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)

Abstract

本发明实施例公开一种特征选择方法及装置,能实现用于光谱无损检测中目标物测定的特征的选择,具有较好的鲁棒性和稳定性。方法包括:S1、获取样品的光谱数据集;S2、对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;S3、根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。

The embodiment of the present invention discloses a feature selection method and device, which can realize feature selection for target object measurement in spectral nondestructive testing, and have good robustness and stability. The method includes: S1, obtaining the spectral data set of the sample; S2, sampling the spectral data set for a first number of times to obtain the first number of sample spaces, and for each sample space, using the sample space to construct a partial The least squares quantitative analysis model, and based on the partial least squares quantitative analysis model, the importance of the features corresponding to the sample space is sorted; S3, according to the importance sorting results of the features corresponding to the first number of sample spaces The features corresponding to the first number of sample spaces are sorted to obtain a feature sorting result, the number of feature selections is determined based on the feature sorting results, and the features of the aforementioned feature selection number are selected as target features according to the feature sorting results.

Description

特征选择方法及装置Feature selection method and device

技术领域technical field

本发明实施例涉及光谱分析领域,具体涉及一种特征选择方法及装置。Embodiments of the present invention relate to the field of spectral analysis, and in particular to a feature selection method and device.

背景技术Background technique

基于光谱的无损检测技术具有操作简单、快速、无损、无需前处理和辅助试剂等优点,在农业、化工、环境、医学等领域应用广泛。但是,获取的光谱数据通常具有特征波长数量多、样本数量小、特征之间存在共线性等特点,使得直接利用全体特征建立模型效果欠佳。已有研究表明通过特征选择方法对波长进行选择后再建立模型,具有提高模型精度、简化模型复杂度、加快模型运算速度、增强模型可解释性等优点。Spectrum-based non-destructive testing technology has the advantages of simple operation, fast, non-destructive, no need for pretreatment and auxiliary reagents, and is widely used in agriculture, chemical industry, environment, medicine and other fields. However, the obtained spectral data usually has the characteristics of large number of characteristic wavelengths, small sample size, and collinearity between features, which makes the direct use of all features to establish a model ineffective. Existing studies have shown that the feature selection method is used to select the wavelength and then build the model, which has the advantages of improving model accuracy, simplifying model complexity, speeding up model operation speed, and enhancing model interpretability.

在光谱分析领域常见的特征选择方法包括:无信息变量消除法、竞争自适应权重法、连续投影算法、区间偏最小二乘法、以及基于最优化(遗传算法、模拟退火、粒子群等)的特征选择方法等。这些方法都是单特征选择方法,当数据集发生局部少量变异时,所选特征变化较大,稳定性欠佳。Common feature selection methods in the field of spectral analysis include: non-informative variable elimination method, competitive adaptive weight method, continuous projection algorithm, interval partial least squares method, and features based on optimization (genetic algorithm, simulated annealing, particle swarm, etc.) selection method etc. These methods are all single-feature selection methods. When a small amount of local variation occurs in the data set, the selected features change greatly and the stability is not good.

发明内容Contents of the invention

针对现有技术存在的不足和缺陷,本发明实施例提供一种特征选择方法及装置。Aiming at the deficiencies and defects of the prior art, embodiments of the present invention provide a feature selection method and device.

一方面,本发明实施例提出一种特征选择方法,包括:On the one hand, the embodiment of the present invention proposes a feature selection method, including:

S1、获取样品的光谱数据集,其中,所述样品的光谱数据集包括特定数量的样品的光谱数据,以及所述样品中目标物的含量;S1. Obtain the spectral data set of the sample, wherein the spectral data set of the sample includes spectral data of a specific number of samples, and the content of the target substance in the sample;

S2、对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;S2. Sampling the spectral data set for the first number of times to obtain the first number of sample spaces. For each sample space, use the sample space to construct a partial least squares quantitative analysis model, and based on the partial minimum The quadratic quantitative analysis model ranks the importance of the features corresponding to the sample space;

S3、根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。S3. Sorting the features corresponding to the first number of sample spaces according to the importance ranking results of the features corresponding to the first number of sample spaces, obtaining a feature ranking result, and determining the number of feature selections based on the feature ranking result, And according to the feature sorting result, select the number of features mentioned above as the target features.

另一方面,本发明实施例提出一种特征选择装置,包括:On the other hand, an embodiment of the present invention proposes a feature selection device, including:

获取单元,用于获取样品的光谱数据集,其中,所述样品的光谱数据集包括特定数量的样品的光谱数据,以及所述样品中目标物的含量;an acquisition unit, configured to acquire a spectral data set of a sample, wherein the spectral data set of the sample includes spectral data of a specific number of samples, and the content of the target substance in the sample;

排序单元,用于对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;The sorting unit is used to sample the spectral data set for a first number of times to obtain the first number of sample spaces, and for each sample space, use the sample space to construct a partial least squares quantitative analysis model, and based on The partial least squares quantitative analysis model ranks the importance of the features corresponding to the sample space;

选择单元,用于根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。A selection unit, configured to sort the features corresponding to the first number of sample spaces according to the importance ranking results of the features corresponding to the first number of sample spaces, to obtain a feature ranking result, and determine the feature based on the feature ranking result Select the quantity, and select the above-mentioned features according to the feature sorting result to select the number of features as the target features.

第三方面,本发明实施例提供一种电子设备,包括:处理器、存储器、总线及存储在存储器上并可在处理器上运行的计算机程序;In a third aspect, an embodiment of the present invention provides an electronic device, including: a processor, a memory, a bus, and a computer program stored in the memory and operable on the processor;

其中,所述处理器,存储器通过所述总线完成相互间的通信;Wherein, the processor and the memory complete the mutual communication through the bus;

所述处理器执行所述计算机程序时实现上述方法。The above method is implemented when the processor executes the computer program.

第四方面,本发明实施例提供一种非暂态计算机可读存储介质,所述存储介质上存储有计算机程序,该计算机程序被处理器执行时实现上述方法。In a fourth aspect, an embodiment of the present invention provides a non-transitory computer-readable storage medium, where a computer program is stored on the storage medium, and the above method is implemented when the computer program is executed by a processor.

本发明实施例提供的特征选择方法及装置,首先获取样品的光谱数据集;然后对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;最后根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征,相较于单特征选择方法,本方案当数据集发生局部少量变异时,所选特征变化不大,稳定性较好,即能实现用于光谱无损检测中目标物测定的特征的选择,具有较好的鲁棒性和稳定性,另外,以偏最小二乘法为定量分析模型,通过实验发现在实验数据集上,使用本方法进行特征选择后所构建模型精度优于利用全部特征所构建模型精度。In the feature selection method and device provided in the embodiments of the present invention, the spectral data set of the sample is first obtained; then the spectral data set is sampled a first number of times to obtain the first number of sample spaces, and for each sample space, Using the sample space to construct a partial least squares quantitative analysis model, and based on the partial least squares quantitative analysis model, the importance of the features corresponding to the sample space is sorted; finally according to the features corresponding to the first number of sample spaces Sorting the features corresponding to the first number of sample spaces to obtain the feature ranking results, determining the number of feature selections based on the feature ranking results, and selecting the aforementioned feature selection numbers according to the feature ranking results A feature is used as the target feature. Compared with the single feature selection method, when a small amount of local variation occurs in the data set in this scheme, the selected feature does not change much, and the stability is better, that is, it can realize the detection of target objects in spectral nondestructive testing. The selection of features has good robustness and stability. In addition, using the partial least squares method as a quantitative analysis model, it is found through experiments that on the experimental data set, the accuracy of the model constructed by using this method for feature selection is better than that of using The accuracy of the model constructed by all features.

附图说明Description of drawings

图1为本发明特征选择方法一实施例的流程示意图;Fig. 1 is a schematic flow chart of an embodiment of the feature selection method of the present invention;

图2为本发明特征选择装置一实施例的结构示意图;2 is a schematic structural view of an embodiment of the feature selection device of the present invention;

图3为本发明实施例提供的一种电子设备的实体结构示意图。FIG. 3 is a schematic diagram of a physical structure of an electronic device provided by an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明实施例保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are the Some, but not all, embodiments are invented. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the embodiments of the present invention.

参看图1,本实施例公开一种特征选择方法,包括:Referring to Fig. 1, the present embodiment discloses a feature selection method, including:

S1、获取样品的光谱数据集,其中,所述样品的光谱数据集包括特定数量的样品的光谱数据,以及所述样品中目标物的含量;S1. Obtain the spectral data set of the sample, wherein the spectral data set of the sample includes spectral data of a specific number of samples, and the content of the target substance in the sample;

S2、对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征(即波长)进行重要性排序;S2. Sampling the spectral data set for the first number of times to obtain the first number of sample spaces. For each sample space, use the sample space to construct a partial least squares quantitative analysis model, and based on the partial minimum The square quantitative analysis model ranks the importance of the features (ie wavelength) corresponding to the sample space;

本实施例中,利用该样本空间构建一个偏最小二乘定量分析模型,具体过程为:选取该样本空间中的一部分样本作为训练集对偏最小二乘定量分析模型进行训练,选择该样本空间中的剩余样本作为测试集对训练得到的偏最小二乘定量分析模型进行测试,并基于测试结果计算偏最小二乘定量分析模型的均方根误差;重复前述过程,直至得到的偏最小二乘定量分析模型的均方根误差在预设的范围内,此时该均方根误差对应的偏最小二乘定量分析模型即为要构建的偏最小二乘定量分析模型。In this embodiment, the sample space is used to build a partial least squares quantitative analysis model. The specific process is: select a part of the samples in the sample space as the training set to train the partial least squares quantitative analysis model, and select the partial least squares quantitative analysis model in the sample space. The remaining samples of the partial least squares quantitative analysis model are used as the test set to test the trained partial least squares quantitative analysis model, and the root mean square error of the partial least squares quantitative analysis model is calculated based on the test results; the above process is repeated until the partial least squares quantitative analysis model is obtained. The root mean square error of the analysis model is within a preset range, and the partial least squares quantitative analysis model corresponding to the root mean square error is the partial least squares quantitative analysis model to be constructed.

S3、根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。S3. Sorting the features corresponding to the first number of sample spaces according to the importance ranking results of the features corresponding to the first number of sample spaces, obtaining a feature ranking result, and determining the number of feature selections based on the feature ranking result, And according to the feature sorting result, select the number of features mentioned above as the target features.

本发明实施例提供的特征选择方法,首先获取样品的光谱数据集;然后对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;最后根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征,相较于单特征选择方法,本方案当数据集发生局部少量变异时,所选特征变化不大,稳定性较好,即能实现用于光谱无损检测中目标物测定的特征的选择,具有较好的鲁棒性和稳定性,另外,以偏最小二乘法为定量分析模型,通过实验发现在实验数据集上,使用本方法进行特征选择后所构建模型精度优于利用全部特征所构建模型精度。In the feature selection method provided by the embodiment of the present invention, the spectral data set of the sample is first obtained; then the spectral data set is sampled a first number of times to obtain the first number of sample spaces, and for each sample space, the Constructing a partial least squares quantitative analysis model in the sample space, and sorting the importance of the features corresponding to the sample space based on the partial least squares quantitative analysis model; finally according to the importance of the features corresponding to the first number of sample spaces sort the features corresponding to the first number of sample spaces to obtain a feature sorting result, determine the number of feature selections based on the feature sorting results, and select the previously described feature selection number of features according to the feature sorting results As the target feature, compared with the single feature selection method, when the data set has a small amount of local variation, the selected feature does not change much and the stability is better, that is, it can realize the feature used for the determination of the target object in the spectral non-destructive testing. selection, has good robustness and stability. In addition, using the partial least squares method as the quantitative analysis model, it is found through experiments that on the experimental data set, the accuracy of the model constructed by using this method for feature selection is better than that of using all features. The accuracy of the built model.

在前述方法实施例的基础上,所述采样可以使用bootstrap重采样方法,每次采样为放回采样,每个样本空间的容量均与所述光谱数据集的容量相同。On the basis of the foregoing method embodiments, the sampling may use a bootstrap resampling method, each sampling is a replacement sampling, and the capacity of each sample space is the same as that of the spectral data set.

在前述方法实施例的基础上,所述基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序,可以包括:On the basis of the foregoing method embodiments, the importance ranking of the features corresponding to the sample space based on the partial least squares quantitative analysis model may include:

获取该偏最小二乘定量分析模型中每个特征所对应的系数,通过对所述系数的绝对值按照从大到小的顺序进行排序,获得该样本空间下的特征排序向量。The coefficient corresponding to each feature in the partial least squares quantitative analysis model is obtained, and the feature ranking vector in the sample space is obtained by sorting the absolute values of the coefficients in descending order.

本实施例中,该样本空间下的特征排序向量由该样本空间下的各个特征的重要性表征量组成,每个特征的重要性表征量为该特征所对应的系数的绝对值。In this embodiment, the feature ranking vector in the sample space is composed of the importance characterization of each feature in the sample space, and the importance characterization of each feature is the absolute value of the coefficient corresponding to the feature.

在前述方法实施例的基础上,所述根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,可以包括:On the basis of the foregoing method embodiments, the ranking of the features corresponding to the first number of sample spaces according to the importance ranking results of the features corresponding to the first number of sample spaces to obtain the feature ranking results may include :

利用线性加权求和方法集成各个样本空间下的特征排序向量,得到所述特征排序结果。A linear weighted sum method is used to integrate the feature ranking vectors in each sample space to obtain the feature ranking result.

本实施例中,所述的利用线性加权求和方法集成各个样本空间下的特征排序向量,得到所述特征排序结果的具体过程为:对于每一个特征,对该特征在所有特征排序向量中对应的值进行加权求和,其中,该特征在每个特征排序向量中对应的值的权重基于由该特征排序向量对应的样本空间确定的偏最小二乘定量分析模型得到,具体计算过程为:首先,分别利用各个样本空间构建偏最小二乘定量分析模型,计算各个样本空间构建得到的偏最小二乘定量分析模型的均方根误差值,第i个样本空间构建得到的偏最小二乘定量分析模型的均方根误差值记为Ei,该值越小表示模型正确率越高;其次,计算Ei的倒数,记为Oi;再次,对这些Oi求和,得到SUM;最后,将每个Oi除以SUM,得到权重Wi,即为各个特征在第i个样本空间下的特征排序向量中对应的值的权重;按照前一步得到的各个特征对应的求和结果的大小,对各个特征进行排序,得到所述特征排序结果,所述特征排序结果中对应的求和结果越大的特征排在越靠前的位置。In this embodiment, the specific process of using the linear weighted summation method to integrate the feature ranking vectors under each sample space to obtain the feature ranking result is: for each feature, corresponding to the feature in all feature ranking vectors The weighted summation of the values of the features, where the weight of the corresponding value of the feature in each feature sorting vector is obtained based on the partial least squares quantitative analysis model determined by the sample space corresponding to the feature sorting vector. The specific calculation process is: first , using each sample space to construct a partial least squares quantitative analysis model, calculate the root mean square error value of the partial least squares quantitative analysis model constructed by each sample space, and the partial least squares quantitative analysis model obtained by the ith sample space The root mean square error value of the model is recorded as E i , the smaller the value, the higher the correct rate of the model; secondly, calculate the reciprocal of E i and record it as O i ; thirdly, sum these O i to get SUM; finally, Divide each O i by SUM to get the weight W i , which is the weight of the corresponding value of each feature in the feature sorting vector under the i-th sample space; according to the size of the summation result corresponding to each feature obtained in the previous step , sort each feature to obtain the feature sorting result, and the feature with a larger summation result in the feature sorting result is ranked at a higher position.

在前述方法实施例的基础上,所述基于所述特征排序结果确定特征选择数量,可以包括:On the basis of the foregoing method embodiments, the determination of the number of feature selections based on the feature ranking results may include:

通过设置所述特征选择数量为不同的值,并基于每次设置的值,依据所述特征排序结果在所述光谱数据集上选择该次设置的值个特征,基于特征选择结果构建偏最小二乘定量分析模型;By setting the number of feature selections to different values, and based on the values set each time, select the set values and features on the spectral data set according to the feature sorting results, and construct a partial least squares based on the feature selection results. Multiply the quantitative analysis model;

选取前一步得到的各个最小二乘定量分析模型中对应的均方根误差最小的最小二乘定量分析模型所对应的特征选择数量为目的值。Select the number of feature selections corresponding to the least squares quantitative analysis model with the smallest root mean square error among the least squares quantitative analysis models obtained in the previous step as the target value.

本实施例中,选取所述目的值可以采用交叉验证方法。In this embodiment, a cross-validation method may be used for selecting the target value.

在前述方法实施例的基础上,所述不同的值中每两个相邻的值相差可以为1。Based on the foregoing method embodiments, the difference between every two adjacent values among the different values may be 1.

本实施例中,所述不同的值的数量可以与所述光谱数据集的容量相同。In this embodiment, the number of different values may be the same as the capacity of the spectral data set.

参看图2,本实施例公开一种特征选择装置,包括:Referring to Figure 2, this embodiment discloses a feature selection device, including:

获取单元1,用于获取样品的光谱数据集,其中,所述样品的光谱数据集包括特定数量的样品的光谱数据,以及所述样品中目标物的含量;An acquisition unit 1, configured to acquire a spectral data set of a sample, wherein the spectral data set of the sample includes spectral data of a specific number of samples, and the content of the target substance in the sample;

排序单元2,用于对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;A sorting unit 2, configured to sample the spectral data set a first number of times to obtain the first number of sample spaces, and for each sample space, use the sample space to construct a partial least squares quantitative analysis model, and Based on the partial least squares quantitative analysis model, the importance of the features corresponding to the sample space is sorted;

选择单元3,用于根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。The selection unit 3 is configured to sort the features corresponding to the first number of sample spaces according to the importance ranking results of the features corresponding to the first number of sample spaces, to obtain a feature ranking result, and determine based on the feature ranking result Select the number of features, and select the features of the aforementioned feature selection number as the target features according to the sorting result of the features.

具体地,所述获取单元1获取样品的光谱数据集,其中,所述样品的光谱数据集包括特定数量的样品的光谱数据,以及所述样品中目标物的含量;所述排序单元2对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;所述选择单元3根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。Specifically, the acquisition unit 1 acquires the spectral data set of the sample, wherein the spectral data set of the sample includes the spectral data of a specific number of samples, and the content of the target substance in the sample; the sorting unit 2 sorts the The spectral data set is sampled for a first number of times to obtain the first number of sample spaces. For each sample space, a partial least squares quantitative analysis model is constructed using the sample space, and based on the partial least squares quantitative analysis The model sorts the importance of the features corresponding to the sample space; the selection unit 3 sorts the features corresponding to the first number of sample spaces according to the importance ranking results of the features corresponding to the first number of sample spaces , to obtain the feature ranking result, determine the number of feature selections based on the feature ranking result, and select the features of the aforementioned feature selection number as the target features according to the feature ranking result.

本发明实施例提供的特征选择装置,首先获取样品的光谱数据集;然后对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;最后根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征,相较于单特征选择方法,本方案当数据集发生局部少量变异时,所选特征变化不大,稳定性较好,即能实现用于光谱无损检测中目标物测定的特征的选择,具有较好的鲁棒性和稳定性,另外,以偏最小二乘法为定量分析模型,通过实验发现在实验数据集上,使用本方法进行特征选择后所构建模型精度优于利用全部特征所构建模型精度。The feature selection device provided by the embodiment of the present invention first acquires the spectral data set of the sample; then performs a first number of samples on the spectral data set to obtain the first number of sample spaces, and for each sample space, use the Constructing a partial least squares quantitative analysis model in the sample space, and sorting the importance of the features corresponding to the sample space based on the partial least squares quantitative analysis model; finally according to the importance of the features corresponding to the first number of sample spaces sort the features corresponding to the first number of sample spaces to obtain a feature sorting result, determine the number of feature selections based on the feature sorting results, and select the previously described feature selection number of features according to the feature sorting results As the target feature, compared with the single feature selection method, when the data set has a small amount of local variation, the selected feature does not change much and the stability is better, that is, it can realize the feature used for the determination of the target object in the spectral non-destructive testing. selection, has good robustness and stability. In addition, using the partial least squares method as the quantitative analysis model, it is found through experiments that on the experimental data set, the accuracy of the model constructed by using this method for feature selection is better than that of using all features. The accuracy of the built model.

在前述装置实施例的基础上,所述选择单元,具体可以用于:On the basis of the foregoing device embodiments, the selection unit may specifically be used for:

通过设置所述特征选择数量为不同的值,并基于每次设置的值,依据所述特征排序结果在所述光谱数据集上选择该次设置的值个特征,基于特征选择结果构建偏最小二乘定量分析模型;By setting the number of feature selections to different values, and based on the values set each time, select the set values and features on the spectral data set according to the feature sorting results, and construct a partial least squares based on the feature selection results. Multiply the quantitative analysis model;

选取前一步得到的各个最小二乘定量分析模型中对应的均方根误差最小的最小二乘定量分析模型所对应的特征选择数量为目的值。Select the number of feature selections corresponding to the least squares quantitative analysis model with the smallest root mean square error among the least squares quantitative analysis models obtained in the previous step as the target value.

本实施例的特征选择装置,可以用于执行前述方法实施例的技术方案,其实现原理和技术效果类似,此处不再赘述。The feature selection device of this embodiment can be used to implement the technical solutions of the foregoing method embodiments, and its implementation principles and technical effects are similar, and will not be repeated here.

本发明的优点是:将集成学习思想引入到特征选择领域,应用于样品光谱定量分析中特征波长的选取,本方法基于偏最小二乘算法,收敛速度快,可并行,稳定性高,且基于选择后的波长建立模型要优于全波段预测效果,为光谱波长选择提供了一种新方法,具有较高的实用价值。The advantages of the present invention are: the idea of integrated learning is introduced into the field of feature selection, and it is applied to the selection of characteristic wavelengths in the quantitative analysis of sample spectra. The established model of the selected wavelength is better than the prediction effect of the whole band, which provides a new method for spectral wavelength selection and has high practical value.

图3示出了本发明实施例提供的一种电子设备的实体结构示意图,如图3所示,该电子设备可以包括:处理器11、存储器12、总线13及存储在存储器12上并可在处理器11上运行的计算机程序;FIG. 3 shows a schematic diagram of the physical structure of an electronic device provided by an embodiment of the present invention. As shown in FIG. 3 , the electronic device may include: a processor 11, a memory 12, a bus 13, and a A computer program running on the processor 11;

其中,所述处理器11,存储器12通过所述总线13完成相互间的通信;Wherein, the processor 11 and the memory 12 complete mutual communication through the bus 13;

所述处理器11执行所述计算机程序时实现上述各方法实施例所提供的方法,例如包括:获取样品的光谱数据集;对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。When the processor 11 executes the computer program, it implements the methods provided in the above method embodiments, for example, including: acquiring a spectral data set of a sample; sampling the spectral data set a first number of times to obtain the first number of sample spaces, for each sample space, use the sample space to construct a partial least squares quantitative analysis model, and based on the partial least squares quantitative analysis model, sort the importance of the features corresponding to the sample space; according to the The importance sorting results of the features corresponding to the first number of sample spaces sort the features corresponding to the first number of sample spaces to obtain the feature sorting results, determine the number of feature selections based on the feature sorting results, and according to the described Feature sorting result selection The aforementioned feature selection quantity features are used as target features.

本发明实施例提供一种非暂态计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现上述各方法实施例所提供的方法,例如包括:获取样品的光谱数据集;对所述光谱数据集进行第一数量次采样,获得所述第一数量个样本空间,对于每一个样本空间,利用该样本空间构建一个偏最小二乘定量分析模型,并基于该偏最小二乘定量分析模型对该样本空间所对应的特征进行重要性排序;根据所述第一数量个样本空间对应的特征的重要性排序结果对所述第一数量个样本空间对应的特征进行排序,得到特征排序结果,基于所述特征排序结果确定特征选择数量,并按照所述特征排序结果选择前所述特征选择数量个特征作为目标特征。An embodiment of the present invention provides a non-transitory computer-readable storage medium, on which a computer program is stored. When the computer program is executed by a processor, the methods provided in the above-mentioned method embodiments are implemented, for example, including: acquiring spectral data of a sample set; the spectral data set is sampled for the first number of times to obtain the first number of sample spaces, and for each sample space, a partial least squares quantitative analysis model is constructed using the sample space, and based on the partial minimum The quadratic quantitative analysis model sorts the importance of the features corresponding to the sample space; sorts the features corresponding to the first number of sample spaces according to the importance sorting results of the features corresponding to the first number of sample spaces, Obtain the feature sorting result, determine the number of feature selections based on the feature sorting result, and select the features of the aforementioned feature selection number as target features according to the feature sorting result.

本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present application may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowcharts and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.

需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。术语“上”、“下”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。除非另有明确的规定和限定,术语“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的连通。对于本领域的普通技术人员而言,可以根据具体情况理解上述术语在本发明中的具体含义。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that there is a relationship between these entities or operations. There is no such actual relationship or order between them. Furthermore, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, but also includes elements not expressly listed. other elements of or also include elements inherent in such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element. The orientation or positional relationship indicated by the terms "upper", "lower", etc. is based on the orientation or positional relationship shown in the drawings, and is only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the referred device or element must Having a particular orientation, being constructed and operating in a particular orientation, and therefore not to be construed as limiting the invention. Unless otherwise clearly specified and limited, the terms "installation", "connection" and "connection" should be interpreted in a broad sense, for example, it may be a fixed connection, a detachable connection, or an integral connection; it may be a mechanical connection, It can also be an electrical connection; it can be a direct connection, or an indirect connection through an intermediary, or an internal communication between two components. Those of ordinary skill in the art can understand the specific meanings of the above terms in the present invention according to specific situations.

本发明的说明书中,说明了大量具体细节。然而能够理解的是,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。类似地,应当理解,为了精简本发明公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释呈反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。本发明并不局限于任何单一的方面,也不局限于任何单一的实施例,也不局限于这些方面和/或实施例的任意组合和/或置换。而且,可以单独使用本发明的每个方面和/或实施例或者与一个或更多其他方面和/或其实施例结合使用。In the description of the invention, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure the understanding of this description. Similarly, it should be appreciated that in the above description of exemplary embodiments of the invention, in order to streamline the present disclosure and to facilitate understanding of one or more of the various inventive aspects, various features of the invention are sometimes grouped together into a single embodiment , figure, or description of it. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the Detailed Description are hereby expressly incorporated into this Detailed Description, with each claim standing on its own as a separate embodiment of this invention. It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention is not limited to any single aspect, nor to any single embodiment, nor to any combination and/or permutation of these aspects and/or embodiments. Furthermore, each aspect and/or embodiment of the invention may be used alone or in combination with one or more other aspects and/or embodiments thereof.

最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围,其均应涵盖在本发明的权利要求和说明书的范围当中。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than limiting them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: It is still possible to modify the technical solutions described in the foregoing embodiments, or perform equivalent replacements for some or all of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the various embodiments of the present invention. All of them should be covered by the scope of the claims and description of the present invention.

Claims (10)

1. a kind of feature selection approach, which is characterized in that including:
S1, the spectroscopic data collection for obtaining sample, wherein the spectroscopic data collection of the sample includes the spectrum of certain amount of sample The content of object in data and the sample;
S2, the first quantity time sampling is carried out to the spectroscopic data collection, the first quantity sample space is obtained, for each A sample space builds an offset minimum binary Quantitative Analysis Model using the sample space, and fixed based on the offset minimum binary It measures analysis model and importance ranking is carried out to the feature corresponding to the sample space;
S3, according to the importance ranking result of the corresponding feature of the first quantity sample space to the first quantity sample The corresponding feature in this space is ranked up, and obtains feature ordering as a result, determining feature selecting number based on the feature ordering result Amount, and select the preceding feature selecting quantity feature as target signature according to the feature ordering result.
2. according to the method described in claim 1, it is characterized in that, the sampling uses bootstrap method for resampling, every time It is sampled as putting back to sampling, the capacity of each sample space is identical as the capacity of spectroscopic data collection.
3. according to the method described in claim 2, it is characterized in that, the described offset minimum binary Quantitative Analysis Model that is based on is to this Feature corresponding to sample space carries out importance ranking, including:
The coefficient corresponding to each feature in the offset minimum binary Quantitative Analysis Model is obtained, the absolute value to the coefficient is passed through It is ranked up according to sequence from big to small, obtains the feature ordering vector under the sample space.
4. according to the method described in claim 3, it is characterized in that, described corresponding according to the first quantity sample space The importance ranking result of feature is ranked up the corresponding feature of the first quantity sample space, obtains feature ordering knot Fruit, including:
The feature ordering summed under each sample space of method integration using linear weighted function is vectorial, obtains the feature ordering knot Fruit.
5. according to the method described in claim 1, it is characterized in that, described determine feature selecting based on the feature ordering result Quantity, including:
It is different values by the way that the feature selecting quantity is arranged, and based on the value being arranged every time, according to the feature ordering knot Fruit selects the value feature of the secondary setting on the spectroscopic data collection, and it is quantitative that feature based selection result builds offset minimum binary Analysis model;
Choose the least square of corresponding root-mean-square error minimum in each least square Quantitative Analysis Model that back obtains It is worth for the purpose of feature selecting quantity corresponding to Quantitative Analysis Model.
6. according to the method described in claim 1, it is characterized in that, each two adjacent value differs 1 in the different value.
7. a kind of feature selecting device, which is characterized in that including:
Acquiring unit, the spectroscopic data collection for obtaining sample, wherein the spectroscopic data collection of the sample includes certain amount of The content of object in the spectroscopic data of sample and the sample;
It is empty to obtain the first quantity sample for carrying out the first quantity time sampling to the spectroscopic data collection for sequencing unit Between, for each sample space, an offset minimum binary Quantitative Analysis Model is built using the sample space, and partially based on this Least square Quantitative Analysis Model carries out importance ranking to the feature corresponding to the sample space;
Selecting unit, for according to the importance ranking result of the corresponding feature of the first quantity sample space to described the The corresponding feature of one quantity sample space is ranked up, and obtains feature ordering as a result, being determined based on the feature ordering result Feature selecting quantity, and select the preceding feature selecting quantity feature as target signature according to the feature ordering result.
8. device according to claim 7, which is characterized in that the selecting unit is specifically used for:
It is different values by the way that the feature selecting quantity is arranged, and based on the value being arranged every time, according to the feature ordering knot Fruit selects the value feature of the secondary setting on the spectroscopic data collection, and it is quantitative that feature based selection result builds offset minimum binary Analysis model;
Choose the least square of corresponding root-mean-square error minimum in each least square Quantitative Analysis Model that back obtains It is worth for the purpose of feature selecting quantity corresponding to Quantitative Analysis Model.
9. a kind of electronic equipment, which is characterized in that including:Processor, memory, bus and storage on a memory and can located The computer program run on reason device;
Wherein, the processor, memory complete mutual communication by the bus;
The processor realizes the method as described in any one of claim 1-6 when executing the computer program.
10. a kind of non-transient computer readable storage medium, which is characterized in that be stored with computer journey on the storage medium Sequence realizes the method as described in any one of claim 1-6 when the computer program is executed by processor.
CN201810401774.3A 2018-04-28 2018-04-28 Feature selection method and device Expired - Fee Related CN108681697B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810401774.3A CN108681697B (en) 2018-04-28 2018-04-28 Feature selection method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810401774.3A CN108681697B (en) 2018-04-28 2018-04-28 Feature selection method and device

Publications (2)

Publication Number Publication Date
CN108681697A true CN108681697A (en) 2018-10-19
CN108681697B CN108681697B (en) 2021-03-23

Family

ID=63802758

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810401774.3A Expired - Fee Related CN108681697B (en) 2018-04-28 2018-04-28 Feature selection method and device

Country Status (1)

Country Link
CN (1) CN108681697B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109856060A (en) * 2019-03-15 2019-06-07 首都师范大学 The detection method and system of synthetic dyestuff concentration in assembled alcoholic drinks
CN111175243A (en) * 2019-12-31 2020-05-19 汉谷云智(武汉)科技有限公司 Method and system for quickly selecting spectral interval
CN112819062A (en) * 2021-01-26 2021-05-18 淮阴工学院 Fluorescence spectrum quadratic characteristic selection method based on mixed particle swarm and continuous projection
CN113567375A (en) * 2021-07-29 2021-10-29 中南大学 Self-adaptive multi-metal ion concentration regression prediction method and system based on linear feature separation
CN113919510A (en) * 2021-11-01 2022-01-11 上海勃池信息技术有限公司 Sample feature selection method, device, equipment and medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103472013A (en) * 2013-06-09 2013-12-25 温州大学 Visible-near infrared spectrum PLS-DA modeling method combining Adaboost algorithm
CN103528990B (en) * 2013-10-31 2017-07-28 天津工业大学 A kind of multi-model Modeling Method of near infrared spectrum
CN105630743B (en) * 2015-12-24 2018-05-01 浙江大学 A kind of system of selection of spectrum wave number
CN106053429B (en) * 2016-05-27 2019-04-05 福建师范大学 Urine modified nucleoside determination method based on surface enhanced resonance raman spectra
CN106770007B (en) * 2016-11-29 2019-06-21 中国石油大学(华东) A Method for Selecting Feature Wavelengths of Near Infrared Spectrum Based on Support Vector Machine Model
CN106644983B (en) * 2016-12-28 2019-12-31 浙江大学 A Spectral Wavelength Selection Method Based on PLS-VIP-ACO Algorithm

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109856060A (en) * 2019-03-15 2019-06-07 首都师范大学 The detection method and system of synthetic dyestuff concentration in assembled alcoholic drinks
CN111175243A (en) * 2019-12-31 2020-05-19 汉谷云智(武汉)科技有限公司 Method and system for quickly selecting spectral interval
CN112819062A (en) * 2021-01-26 2021-05-18 淮阴工学院 Fluorescence spectrum quadratic characteristic selection method based on mixed particle swarm and continuous projection
CN113567375A (en) * 2021-07-29 2021-10-29 中南大学 Self-adaptive multi-metal ion concentration regression prediction method and system based on linear feature separation
CN113567375B (en) * 2021-07-29 2022-05-10 中南大学 An adaptive multi-metal ion concentration regression prediction method and system based on linear feature separation
CN113919510A (en) * 2021-11-01 2022-01-11 上海勃池信息技术有限公司 Sample feature selection method, device, equipment and medium

Also Published As

Publication number Publication date
CN108681697B (en) 2021-03-23

Similar Documents

Publication Publication Date Title
CN110687072B (en) A Selection and Modeling Method of Calibration Set and Validation Set Based on Spectral Similarity
CN108681697A (en) Feature selection approach and device
Maillet et al. COMMET: comparing and combining multiple metagenomic datasets
CN107478580A (en) Heavy metal content in soil evaluation method and device based on high-spectrum remote-sensing
JP6602818B2 (en) Method for predicting chemical shift values of NMR spin systems in fluid class samples, especially biological fluid samples
CN109902411B (en) Soil heavy metal content detection modeling method and device, and detection method and device
JP7482782B2 (en) Sequence-based protein structure and properties determination
JP7063389B2 (en) Processing equipment, processing methods, and programs
CN108830253B (en) Screening model establishing method, spectrum screening device and method
CN117789038B (en) A training method for data processing and recognition model based on machine learning
US20210157824A1 (en) Preprocessing for a Classification Algorithm
US9400868B2 (en) Method computer program and system to analyze mass spectra
CN119863694A (en) Performance optimization integrated learning soil salinity prediction precision improvement method
CN117314914B (en) A defect identification method and related equipment for engineering non-destructive inspection images
CN110032762A (en) Heavy metal content in soil detects modeling method and device, detection method and device
CN116150678B (en) A method and device for classifying topaz color based on chromaticity coordinates
CN117079741A (en) A method, device and medium for predicting molecular insulation strength based on neural network
CN113035363B (en) A probability density weighted hybrid sampling method for genetic metabolic disease screening data
EP4356153A1 (en) Systems and methods for provisioning training data to enable neural networks to analyze signals in nmr measurements
CN113130073B (en) Method for selecting influence indexes by utilizing automation mechanism and electronic device
CN111398238A (en) Laser-induced fluorescence spectrum identification method for edible oil doped with castor oil
CN109145887B (en) A Threshold Analysis Method Based on Spectral Latent Variable Confusion Discrimination
JP7056747B2 (en) Information processing equipment, processing equipment, information processing method, processing method, determination method, and program
CN119721298A (en) A rapid estimation method for lithium content in clay
Enshaei et al. Feature selection with targeted projection pursuit

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220808

Address after: 100097 No. 9 middle garden, Shuguang garden, Beijing, Haidian District

Patentee after: BEIJING ACADEMY OF AGRICULTURE AND FORESTRY SCIENCES

Address before: 1011, seed building, Beijing Academy of agriculture and Forestry Sciences, No. 9, Shuguang garden middle road, Haidian District, Beijing 100097

Patentee before: BEIJING RESEARCH CENTER FOR AGRICULTURAL STANDARDS AND TESTING

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20210323