CN107958091A - A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping - Google Patents

A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping Download PDF

Info

Publication number
CN107958091A
CN107958091A CN201711459522.8A CN201711459522A CN107958091A CN 107958091 A CN107958091 A CN 107958091A CN 201711459522 A CN201711459522 A CN 201711459522A CN 107958091 A CN107958091 A CN 107958091A
Authority
CN
China
Prior art keywords
financial
questions
nlp
information
entities
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711459522.8A
Other languages
Chinese (zh)
Inventor
马天平
侯玥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Beta Technology Co Ltd
Original Assignee
Beijing Beta Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Beta Technology Co Ltd filed Critical Beijing Beta Technology Co Ltd
Priority to CN201711459522.8A priority Critical patent/CN107958091A/en
Publication of CN107958091A publication Critical patent/CN107958091A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/06Asset management; Financial planning or analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Operations Research (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Animal Behavior & Ethology (AREA)
  • Game Theory and Decision Science (AREA)
  • Human Resources & Organizations (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping, pass through the financial vertical collection of illustrative plates of foundation, it is associated with NLP, that is, natural language processing, so as to establish a set of perfect intelligent finance problem interactive system, with traditional search, question answering system is different by dynamic response, such a method can carry out active rhetorical question, recommend, excavate the problem of user's deep layer is wanted to ask, in flow, the problem of such a method receives user, first pass through NLP technologies, then semanteme is analyzed, retrieved by semanteme into financial vertical collection of illustrative plates, draw a circle to approve the scene domain of question answering, choose most suitable answer and be pushed to client, the problem of client can so being proposed, problem is better understood from reference to knowledge mapping, it can more accurately retrieve answer at the same time.

Description

NLP artificial intelligence method and interaction system based on financial vertical knowledge graph
Technical Field
The invention relates to the field of artificial intelligence dialogue systems and robot languages.
Background
At present, financial managers generally play a role of shopping guide in the purchasing process of financial products, and customers need continuous communication companions in the whole thinking decision process. Unlike traditional product sales, for financial products, purchasing is only a start and the subsequent process requires more communication to serve as a care companion. However, it is obviously impossible to provide communication in all weather by using manual service to ensure high-quality and efficient service. The rise of AI intelligence makes it possible to technically realize all-weather efficient intelligent service for customers.
In the prior art, the technologies relied on by the man-machine conversation system are mainly divided into the following two types:
the method is based on a question-answer library technology, namely, keywords in sentences are extracted to be matched with a large amount of sorted questions and answers. Many intelligent customer service adopt this kind of method to replace the cost of some manual customer service, and this kind of method disadvantage is that there is no correlation among the massive data, in addition if some sentence word orders are changed, the meaning is totally different, but the matched answer is probably unanimous;
the technology based on the search engine is to screen and answer the questions of the user by using the results returned by the search engine, and on one hand, the technology obtains low-quality responses, and on the other hand, the obtained answers are not controllable.
It can be seen that there are at least the following problems in the prior art:
1. the problems and corresponding answers of the traditional method are relatively independently stored, and all the problems are not related, so that the user problems cannot be effectively presumed through semantics;
2. the traditional method is more lateral to solving common user problems and problems before sale, and most personalized problems after sale cannot be answered accurately;
3. the traditional method is based on template matching, the problem richness is achieved by generalization of artificial mass problems, various questions need to be thought of in the same problem, and a large amount of time and energy are consumed.
Disclosure of Invention
Aiming at the problems in the prior art, the application provides a brand-new solution.
The application discloses an NLP (natural language processing) artificial intelligence method based on a financial vertical knowledge graph to realize intelligent financial problem interaction between a machine and a client, which specifically comprises the following steps:
s1, asking questions, and the client asks financial related questions;
s2, processing the natural language NLP, and processing the problems proposed by the customers through the NLP technology;
s3, semantic analysis and understanding, selecting proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
s4, information retrieval, namely retrieving information obtained through semantic analysis and understanding in the financial vertical map;
and S5, outputting the result, generating answer output according to the search result to answer the question of the client, and guiding and mining the question.
The financial vertical map is produced by combing and storing financial domain knowledge and the relationship between the financial domain knowledge through a database by utilizing a machine learning technology.
Further, the making of the financial vertical map comprises the following steps:
1) preparing related learning materials of the financial field;
2) learning the relevant learning data of the financial field in a semi-supervised mode by means of a machine learning technology;
3) combing the learned financial domain knowledge entities and the relations between the financial domain knowledge entities and storing the learned financial domain knowledge entities in a database to obtain a database;
4) semi-supervised maintenance is carried out on the relationship between the financial field knowledge entities, and synonyms representing the same relationship are aggregated in the entity range related to the relationship;
5) the picture description is added to common entities and used as a retrieval item for selection, extraction and use in the subsequent result output, so that the diversification of interactive answer forms can be improved.
When each entity concept is stored, other existing entities are associated, and finally, a financial vertical map with mutual relations among the entities is formed.
Further, the NLP natural language processing includes the following steps in chronological order:
s2.1, processing data, wherein the processing data comprises the processes of complex and simple conversion, Chinese word segmentation, part of speech tagging, data cleaning, syntax analysis, entity recognition and/or voice character conversion;
s2.2, classifying the data processed problems by the classification model from the three aspects of problem type, user behavior and emotion recognition;
s2.3, extracting information, wherein the information extraction is sequentially carried out according to the steps of extraction based on part of speech tagging, extraction based on semantic analysis and extraction based on speech piece analysis;
the extraction based on Part-of-Speech tagging is called PoS (Part-of-Speech) extraction for short, namely, the extraction is selected according to tagged nouns, verbs, adjectives or other parts-of-Speech;
the extraction based on semantic analysis selects a mode based on industry keywords or an entity list to extract and call information, and the calling method is a common statistical method known in the industry, such as chi-square, information gain, mutual information technology and the like.
And the sentence analysis is used for extracting and calling information in a dependency tree mode, wherein the dependency tree represents the dependency relationship among words in a sentence.
S2.4, completing information, detecting the deletion of sentence structures in the questions proposed by the customers, and completing the missing part;
s2.5 information queue: putting the extracted key information into a queue, removing the key information from the queue after more than 5 times of conversation, and simultaneously resolving and paraphrasing the indicative pronouns in the sentences.
In which conversation and queue specifically means that the key information of the latest conversation spoken by the user is put into the information queue, and the key information entered earlier (for example, 5 rounds ahead) is moved out of the queue. The queue information is used for controlling the semantic range of information retrieval and providing a basis for multi-round problem reply.
Further, the information retrieval comprises the following steps in sequence:
s4.1, classifying the questions, and defining the scene range of the answer of the questions according to the information obtained by semantic analysis and understanding, wherein the types of the questions comprise: special scene type questions, pre-or post-engagement questions, QA (question and answer type) questions.
And S4.2, searching the questions, searching the information obtained by semantic analysis and understanding in the financial vertical map within the range of the circled scene after the question types are determined to obtain required entities and/or relations, and generating answers according to word co-occurrence similarity, sentence pattern matching strategies and/or word sequence consideration modes for sending the answers to the clients.
Word co-occurrence similarity is a statistical-based model, and in a segment of language expression, several words often co-occur (i.e., co-occur) in the same sentence or the same paragraph, so that the words are considered to be related in meaning. The similarity calculation is carried out by adopting the method, and the entity searched by the semantics is calculated by adopting the method, so that the accuracy of the subsequent output result is improved.
The sentence pattern matching strategy has the effect that relevant entities required by the response are extracted through the information retrieval step, but the relevant entities are not a complete sentence. Through the sentence pattern matching strategy, the answer sentence pattern can be correspondingly selected through the known question sentence pattern, and the blank slot filling of the 'information completion' part is multiplexed to generate the answer.
The word order consideration is a supplement of the co-occurrence similarity, after the entities are selected, a group of similar entities is actually selected, the priority ranking of the group of entities is determined according to the matching degree of the searched group of entities and the questions, the higher priority is presented in the answers, and the lower priority is the alternative answers.
Further, the result output comprises outputting the generated answer, the guided and excavated question to the customer, wherein the guided and excavated question is further guided and excavated according to the entity associated with the answer in the financial vertical map, and forms a new question to be output to the customer for the customer to answer or confirm, so that a more real and effective dialogue interaction state can be realized.
In addition, the application also discloses a system for carrying out intelligent financial problem interaction by using the NLP artificial intelligence method based on the financial vertical knowledge graph, which specifically comprises an input module, an NLP natural language processing module, a semantic analysis and understanding module, an information retrieval module and a result output module;
wherein,
the input module is used for receiving financial related questions put forward by a customer and sending the questions to the NLP natural language processing module for processing;
the NLP natural language processing module processes the problems proposed by the customers through an NLP technology;
the semantic analysis and understanding module selects proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
the information retrieval module retrieves the information obtained by semantic analysis and understanding in the financial vertical map;
and the result output module generates answer output according to the retrieval result so as to answer the question of the client, and guides and mines the question.
And the established financial vertical map is associated with NLP (natural language processing), so that a set of complete intelligent financial problem interaction system is established. Different from the passive response of the traditional searching and question-answering system, the method can actively ask a question, recommend and mine questions deeply asked by the user. In the process, the method receives the questions of the user, firstly, the NLP technology is used, then, the semantics are analyzed, the financial vertical map is searched through the semantics, the scene range of question answers is defined, and the most suitable answers are selected and pushed to the client.
By doing so, on one hand, the questions proposed by the customers can be better understood by combining the knowledge graph, and on the other hand, the answers can be more accurately retrieved. A plurality of knowledge points exist in the whole financial activity, the relation of the related knowledge points is combed and appropriately stored by a graph database for subsequent reading and retrieval of an NLP system.
Compared with the prior art, the application has the advantages that:
1. the established financial knowledge map can continuously perfect and enrich entity/relation data in the database, when each entity concept is stored, other existing entities can be intelligently associated, and a financial vertical map with complex relation is finally formed, so that complex personalized problems of customers can be solved, and meanwhile, the diversification of interactive answer forms can be improved by storing different types of data (such as picture information) and a special information retrieval and answer generation output mechanism.
2. Active question asking can be performed, questions deeply asked by the user can be recommended and mined, and the more real and effective conversation interaction state is realized.
3. The client problems can be better and more efficiently understood, the most appropriate answers are accurately retrieved, and the conversation communication efficiency is greatly improved.
Drawings
FIG. 1 is an illustration of an example of a converged knowledge entity in a financial vertical graph stored in a graph database;
FIG. 2 is a general flow diagram of an intelligent dialog;
FIG. 3 is a detailed flow diagram of a financial vertical knowledge-graph based NLP artificial intelligence method to achieve intelligent financial problem interaction between machines and customers.
Detailed Description
For the purpose of full disclosure, the present invention will be described in further detail with reference to the following examples. It should be understood that the specific examples described below are only for illustrating the present invention and are not intended to limit the scope of the present invention.
The application specifically discloses an NLP artificial intelligence method based on a financial vertical knowledge graph to realize intelligent financial problem interaction between a machine and a client, and the method specifically comprises the following steps:
s1, asking questions, and the client asks financial related questions;
s2, processing the natural language NLP, and processing the problems proposed by the customers through the NLP technology;
s3, semantic analysis and understanding, selecting proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
s4, information retrieval, namely retrieving information obtained through semantic analysis and understanding in the financial vertical map;
and S5, outputting the result, generating answer output according to the search result to answer the question of the client, and guiding and mining the question.
The financial vertical map is produced by combing and storing financial domain knowledge and the relationship between the financial domain knowledge through a database by utilizing a machine learning technology.
An example of a financial knowledge entity in a financial vertical as shown in FIG. 1, the production of the financial vertical comprising:
1) preparing financial field-related learning materials, which are materials of bank-related knowledge in the example shown in fig. 1;
2) learning the relevant learning materials of the financial field by using a semi-supervised mode by means of a machine learning technology, wherein in the example, the learning materials contain information of four major lines of China and relevant relations thereof;
3) the learned financial field knowledge entities and the relations between the financial field knowledge entities are combed and stored in a database to obtain a graph database, in the example, learning that 'four major banks in China have China Bank, China Industrial Bank, China agricultural Bank and China construction Bank', 5 entities and 4 groups of relations are learned through the sentence, namely four major banks in China-having-China Bank, China Industrial Bank in China-having-China, four major banks in China-having-China agricultural Bank in China-having-China Bank in China and four major banks in China-having-China construction Bank in China;
4) and performing semi-supervised maintenance on the relationship between the knowledge entities in the financial field, and aggregating synonyms representing the same relationship in the entity range related to the relationship, for example, receiving the financial knowledge again, wherein the four major banks in China comprise a Chinese bank, a Chinese industrial and commercial bank, a Chinese agricultural bank and a Chinese construction bank. New relations including appear among 5 existing entities again, and two semantically existing relations including and including are consistent, can be combined into a relation, and records close relations to the entities, so that the accuracy of machine learning is improved;
5) adding picture description to common entities, using the picture description as a retrieval item for subsequent selection and extraction, and importing related pictures to the common entities for selection and extraction during reply in order to improve the diversity of interactive answer forms by storing the pictures.
When each entity concept is stored, other existing entities are associated, and finally, a financial vertical map with mutual relations among the entities is formed.
As shown in fig. 3, fig. 3 shows a detailed flow of NLP artificial intelligence method based on financial vertical knowledge graph to realize intelligent financial problem interaction between machine and customer,
the NLP natural language processing in step S2 includes the following steps in order:
s2.1, processing data, wherein the processing data comprises the processes of complex and simple conversion, Chinese word segmentation, part of speech tagging, data cleaning, syntax analysis, entity recognition and/or voice character conversion;
the mathematical expression of the method adopted by Chinese word segmentation is as follows:
(T)=(W1)P(W2|W1)P(W3|W2)…P(Wn|Wn-1)
p represents the probability of occurrence, where W1Represents the result of the first word region recognition, so P (W)1) Representing the probability of the first word occurring. P (W)2|W1) Representing the probability of a second word appearing if the first word appears, W can be seen1W2Whether it is a common word. Thus, a sentence with divided words is generated until the end of the sentence. Such as: china | four great | banks | which |?
S2.2, classifying the data processed problems by the classification model from the three aspects of problem type, user behavior and emotion recognition;
the question type is mainly the question to distinguish the user is the 5w type question (where, what, while, why, how), the regular question or the non-question type communication.
User behavior refers to the question that the user makes on which page, under what circumstances.
And the emotion recognition can extract the related emotion words according to the conversation process of the user.
S2.3, extracting information, wherein the information extraction is sequentially carried out according to the steps of extraction based on part of speech tagging, extraction based on semantic analysis and extraction based on speech piece analysis;
the extraction based on Part-of-Speech tagging is called PoS (Part-of-Speech) extraction for short, namely, the extraction is selected according to tagged nouns, verbs, adjectives or other parts-of-Speech;
the extraction based on semantic analysis selects a mode based on industry keywords or an entity list to extract and call information, and the calling method is a common statistical method known in the industry, such as chi-square, information gain, mutual information technology and the like.
And the sentence analysis is used for extracting and calling information in a dependency tree mode, wherein the dependency tree represents the dependency relationship among words in a sentence.
More specifically, the mathematical expression in which information extraction is performed by the chi-square:
the feature extraction algorithm is divided into two categories of feature selection and feature extraction. The chi-square test belongs to the better algorithm in the feature selection algorithm.
t and c are two random variables, chi 2 represents chi-square value for checking the correlation of data, t represents a word, and c represents a category. For example, t may represent a flower and c may represent a plant.
When we find N articles, t is classified as containing and not containing, and c is classified as belonging and not belonging. So that the following tables can be generated, corresponding to the letters in the above formulas, respectively.
Feature selection c1. Belonging to the genus of "plants" c2. Not belonging to "plants" Total of
t1. comprising "flowers" A B A+B
t2. not containing "flowers" C D C+D
Total number of A+C B+D N
The mathematical expression in which information extraction is performed by information gain:
in the case where the feature is that Y is fixed, the conditional entropy of X is H (X | Y), and P (X | Y) is the probability of occurrence. The colloquial explanation is how much the uncertainty of X is reduced after knowing Y, compared to when it is not. The two variables X, Y of the information gain differ in position, which is a means to consider Y as reducing X uncertainty. While the two variables of mutual information are in the same place.
The mathematical expression for extracting information through mutual information is as follows:
g(D,A)=H(D)-H(D|A)
h (D) represents the self information quantity, i.e. the sent information, H (D | A) represents the condition information quantity, and g (D, A) represents the mutual information quantity. The mutual information amount is self information amount-condition information amount. Information extraction is carried out through mutual information, and the internal bonding strength of the character strings is calculated by utilizing the mutual information, so that the extracted information is more complete and is not the state of the keyword. Such as natural language processing, which finds these words as a fixed academic vocabulary through mutual information, and therefore does not break down into three words for understanding.
S2.4, completing information, namely detecting the deletion of a sentence structure in a question proposed by a client (also called slot blank detection), and completing the missing part (also called blank slot filling);
s2.5 information queue: putting the extracted key information into a queue, removing the key information from the queue after more than 5 times of conversation, and simultaneously resolving and paraphrasing the indicative pronouns in the sentences.
In which conversation and queue specifically means that the key information of the latest conversation spoken by the user is put into the information queue, and the key information entered earlier (for example, 5 rounds ahead) is moved out of the queue. The queue information is used for controlling the semantic range of information retrieval and providing a basis for multi-round problem reply.
Wherein the information retrieval in step S4 includes the following steps in order of sequence:
s4.1, classifying the questions, and defining the scene range of the answer of the questions according to the information obtained by semantic analysis and understanding, wherein the types of the questions comprise: special scene type questions, pre-or post-engagement questions, QA (question and answer type) questions.
And S4.2, searching the questions, searching the information obtained by semantic analysis and understanding in the financial vertical map within the range of the circled scene after the question types are determined to obtain required entities and/or relations, and generating answers according to word co-occurrence similarity, sentence pattern matching strategies and/or word sequence consideration modes for sending the answers to the clients.
Co-occurrence similarity calculation mathematical expression:
Similarity(s1,s2)=αSDMG(s1,s2)+βSDMG(s1,s2)+γSDMG(s1,s2)
s1, s2 are two words, α, β, γ are co-occurrence correlations in three respective spatial vectors.
Wherein the outputting of the result of step S4 includes outputting the generated answer, guided and mined question to the customer, wherein the guided and mined question is further guided and mined according to the entity associated with the answer in the financial vertical map to form a new question to be output to the customer for the customer to answer or confirm, so that a more real and effective dialogue interaction state can be realized. For example:
the end user asks: which are the four banks in china?
And (3) dialog answers: the four banks in China all have China Bank, China Industrial and commercial Bank, China agricultural Bank, China construction Bank, do you know? Wherein the bank of the chinese industry and commerce was established in 1984.
The embodiment can be specifically realized by a system for intelligent financial problem interaction by using an NLP artificial intelligence method based on a financial vertical knowledge graph, wherein the system comprises an input module, an NLP natural language processing module, a semantic analysis and understanding module, an information retrieval module and a result output module;
wherein,
the input module is used for receiving financial related questions put forward by a customer and sending the questions to the NLP natural language processing module for processing;
the NLP natural language processing module processes the problems proposed by the customers through an NLP technology;
the semantic analysis and understanding module selects proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
the information retrieval module retrieves the information obtained by semantic analysis and understanding in the financial vertical map;
and the result output module generates answer output according to the retrieval result so as to answer the question of the client, and guides and mines the question.
The above-described example embodiments merely represent embodiments of the present patent, and the description thereof should not be construed as limiting the scope of the patent. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the patent idea, which falls within the protection scope of the present invention. Therefore, the protection scope of this patent shall be subject to the appended claims. The above-mentioned embodiments only express the embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (6)

1. An NLP artificial intelligence method based on a financial vertical knowledge graph is characterized by comprising the following steps:
s1, asking questions, and the client asks financial related questions;
s2, processing the natural language NLP, and processing the problems proposed by the customers through the NLP technology;
s3, semantic analysis and understanding, wherein phrases and keywords representing entities/relations are selected according to the processed information for subsequent retrieval;
s4, information retrieval, namely retrieving information obtained through semantic analysis and understanding in the financial vertical map;
s5, outputting results, generating answers according to the search results to output answers to the questions of the clients, and guiding and mining the questions;
the financial vertical map is produced by combing and storing financial domain knowledge and the relationship between the financial domain knowledge through a database by utilizing a machine learning technology.
2. The NLP artificial intelligence method based on financial vertical knowledge-graph as claimed in claim 1, wherein the making of the financial vertical knowledge-graph comprises:
1) preparing related learning materials of the financial field;
2) learning the relevant learning data of the financial field in a semi-supervised mode by means of a machine learning technology;
3) combing the learned financial domain knowledge entities and the relations between the financial domain knowledge entities and storing the learned financial domain knowledge entities in a database to obtain a database;
4) semi-supervised maintenance is carried out on the relationship between the financial field knowledge entities, and synonyms representing the same relationship are aggregated in the entity range related to the relationship;
5) adding picture description to common entities, and using the picture description as a retrieval item for selection, extraction and use when outputting subsequent results, thereby improving the diversification of interactive answer forms;
when each entity concept is stored, other existing entities are associated, and finally, a financial vertical map with mutual relations among the entities is formed.
3. The NLP artificial intelligence method based on financial vertical knowledge-graph as claimed in claim 1, wherein the NLP natural language processing comprises the following steps in order of precedence:
s2.1, processing data, wherein the processing data comprises the processes of complex and simple conversion, Chinese word segmentation, part of speech tagging, data cleaning, syntax analysis, entity recognition and/or voice character conversion;
s2.2, classifying the data processed problems by the classification model from the three aspects of problem type, user behavior and emotion recognition;
s2.3, extracting information, wherein the information extraction is sequentially carried out according to the steps of extraction based on part of speech tagging, extraction based on semantic analysis and extraction based on speech piece analysis;
s2.4, completing information, detecting the deletion of sentence structures in the questions proposed by the customers, and completing the missing part;
s2.5 information queue: putting the extracted key information into a queue, removing the key information from the queue after more than 5 times of conversation, and simultaneously resolving and paraphrasing the indicative pronouns in the sentences.
4. The NLP artificial intelligence method based on financial vertical knowledge-graph as claimed in claim 1, wherein said information retrieval comprises the following steps in the order of precedence:
s4.1, classifying the questions, and defining the scene range of the answer of the questions according to the information obtained by semantic analysis and understanding, wherein the types of the questions comprise: special scene questions, pre-or post-engagement questions, QA (question and answer) questions;
and S4.2, searching the questions, searching the information obtained by semantic analysis and understanding in the financial vertical map within the range of the circled scene after the question types are determined to obtain required entities and/or relations, and generating answers according to word co-occurrence similarity, sentence pattern matching strategies and/or word sequence consideration modes for sending the answers to the clients.
5. The NLP artificial intelligence method based on financial vertical knowledge graph as claimed in claim 1, wherein the result output includes outputting the generated answers, guided and mined questions to the customer, wherein the guided and mined questions are further guided and mined according to the entity associated with the answers in the financial vertical knowledge graph to form new questions to be output to the customer for the customer to answer or confirm.
6. A system for intelligent financial problem interaction by using the NLP artificial intelligence method based on financial vertical knowledge graph according to any one of claims 1-5, wherein the system comprises an input module, an NLP natural language processing module, a semantic analysis and understanding module, an information retrieval module and a result output module;
wherein,
the input module is used for receiving financial related questions put forward by a customer and sending the questions to the NLP natural language processing module for processing;
the NLP natural language processing module processes the problems proposed by the customers through an NLP technology;
the semantic analysis and understanding module selects proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
the information retrieval module retrieves the information obtained by semantic analysis and understanding in the financial vertical map;
and the result output module generates answer output according to the retrieval result so as to answer the question of the client, and guides and mines the question.
CN201711459522.8A 2017-12-28 2017-12-28 A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping Pending CN107958091A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711459522.8A CN107958091A (en) 2017-12-28 2017-12-28 A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711459522.8A CN107958091A (en) 2017-12-28 2017-12-28 A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping

Publications (1)

Publication Number Publication Date
CN107958091A true CN107958091A (en) 2018-04-24

Family

ID=61957098

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711459522.8A Pending CN107958091A (en) 2017-12-28 2017-12-28 A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping

Country Status (1)

Country Link
CN (1) CN107958091A (en)

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108733654A (en) * 2018-05-21 2018-11-02 宁波薄言信息技术有限公司 A kind of information processing method
CN108920603A (en) * 2018-06-28 2018-11-30 厦门快商通信息技术有限公司 A kind of customer service bootstrap technique based on customer service machine mould
CN108932340A (en) * 2018-07-13 2018-12-04 华融融通(北京)科技有限公司 The construction method of financial knowledge mapping under a kind of non-performing asset operation field
CN109344174A (en) * 2018-09-13 2019-02-15 深圳易投云智能科技有限公司 Financial analysis method and system
CN109559144A (en) * 2018-09-20 2019-04-02 欧阳江 A kind of personalization securities industry customer service system and method
CN109658271A (en) * 2018-12-19 2019-04-19 前海企保科技(深圳)有限公司 A kind of intelligent customer service system and method based on the professional scene of insurance
CN109829041A (en) * 2018-12-25 2019-05-31 出门问问信息科技有限公司 Question processing method and device, computer equipment and computer readable storage medium
CN109918489A (en) * 2019-02-28 2019-06-21 上海乐言信息科技有限公司 A kind of knowledge question answering method and system of more strategy fusions
CN110019742A (en) * 2018-06-19 2019-07-16 北京京东尚科信息技术有限公司 Method and apparatus for handling information
CN110019687A (en) * 2019-04-11 2019-07-16 宁波深擎信息科技有限公司 A kind of more intention assessment systems, method, equipment and the medium of knowledge based map
CN110347894A (en) * 2019-05-31 2019-10-18 平安科技(深圳)有限公司 Knowledge mapping processing method, device, computer equipment and storage medium based on crawler
CN110443382A (en) * 2019-07-09 2019-11-12 青岛聚好联科技有限公司 A kind of user interaction approach based on intelligent customer service platform
CN110543546A (en) * 2019-07-18 2019-12-06 华瑞新智科技(北京)有限公司 Method and device for inquiring hidden danger problems in industrial equipment
CN110597952A (en) * 2019-08-20 2019-12-20 深圳壹账通智能科技有限公司 Information processing method, server, and computer storage medium
CN110727773A (en) * 2019-10-11 2020-01-24 沈阳民航东北凯亚有限公司 Information providing method and device
CN110807091A (en) * 2019-03-01 2020-02-18 王涵 A hotel intelligent question answering recommendation and decision support analysis method and system
CN110990526A (en) * 2019-11-21 2020-04-10 腾讯科技(深圳)有限公司 Query statement display method and related equipment
CN111091454A (en) * 2019-11-05 2020-05-01 新华智云科技有限公司 Financial public opinion recommendation method based on knowledge graph
WO2020114269A1 (en) * 2018-12-05 2020-06-11 中兴通讯股份有限公司 Robo-advisor implementation method and system
CN111309888A (en) * 2020-02-25 2020-06-19 百度在线网络技术(北京)有限公司 Man-machine conversation method, device, electronic equipment and storage medium
CN111739541A (en) * 2019-03-19 2020-10-02 上海云思智慧信息技术有限公司 A voice-based conference assistance method and system, storage medium and terminal
CN111753021A (en) * 2020-06-17 2020-10-09 第四范式(北京)技术有限公司 Method, apparatus, device and readable storage medium for constructing knowledge graph
WO2020221142A1 (en) * 2019-04-28 2020-11-05 华为技术有限公司 Picture book-based question and answer interaction method and electronic device
CN112015919A (en) * 2020-09-15 2020-12-01 重庆广播电视大学重庆工商职业学院 Dialogue management method based on learning auxiliary knowledge graph
CN112328808A (en) * 2020-11-03 2021-02-05 四川长虹电器股份有限公司 Knowledge graph-based question and answer method and device, electronic equipment and storage medium
CN113051934A (en) * 2021-03-26 2021-06-29 国网河北省电力有限公司信息通信分公司 User question and sentence mixing system for processing semantic similarity and word order similarity
CN113312489A (en) * 2021-04-13 2021-08-27 武汉烽火众智数字技术有限责任公司 Panorama retrieval system and method based on NLP and graph database
CN113326367A (en) * 2021-06-30 2021-08-31 四川启睿克科技有限公司 Task type dialogue method and system based on end-to-end text generation
CN113434645A (en) * 2021-05-26 2021-09-24 上海声通信息科技股份有限公司 Financial product intelligence customer service system
CN114706991A (en) * 2022-01-27 2022-07-05 清华大学 Knowledge network construction method, device, equipment and storage medium
CN115114453A (en) * 2022-06-29 2022-09-27 中国银行股份有限公司 Intelligent customer service implementation method and device based on knowledge graph
CN117157635A (en) * 2021-04-14 2023-12-01 三菱电机株式会社 Learning device, reasoning device, program, learning method, and reasoning method
CN117151122A (en) * 2023-10-30 2023-12-01 湖南三湘银行股份有限公司 A method and system for bank customer service conversation question and answer processing based on natural language processing
CN119623619A (en) * 2024-08-16 2025-03-14 中国科学院深圳先进技术研究院 Intellectual property and academic assistant system and implementation method, device, electronic device and storage medium thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104050256A (en) * 2014-06-13 2014-09-17 西安蒜泥电子科技有限责任公司 Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method
CN105868313A (en) * 2016-03-25 2016-08-17 浙江大学 Mapping knowledge domain questioning and answering system and method based on template matching technique
CN106776711A (en) * 2016-11-14 2017-05-31 浙江大学 A kind of Chinese medical knowledge mapping construction method based on deep learning
CN106919655A (en) * 2017-01-24 2017-07-04 网易(杭州)网络有限公司 A kind of answer provides method and apparatus
CN107341215A (en) * 2017-06-07 2017-11-10 北京航空航天大学 A kind of vertical knowledge mapping classification ensemble querying method of multi-source based on Distributed Computing Platform

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104050256A (en) * 2014-06-13 2014-09-17 西安蒜泥电子科技有限责任公司 Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method
CN105868313A (en) * 2016-03-25 2016-08-17 浙江大学 Mapping knowledge domain questioning and answering system and method based on template matching technique
CN106776711A (en) * 2016-11-14 2017-05-31 浙江大学 A kind of Chinese medical knowledge mapping construction method based on deep learning
CN106919655A (en) * 2017-01-24 2017-07-04 网易(杭州)网络有限公司 A kind of answer provides method and apparatus
CN107341215A (en) * 2017-06-07 2017-11-10 北京航空航天大学 A kind of vertical knowledge mapping classification ensemble querying method of multi-source based on Distributed Computing Platform

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
公安部第三研究所: "多摄像机协同关注目标检测跟踪技术", 《多摄像机协同关注目标检测跟踪技术 *
陆刚等: "机电工程网络信息资源检索与利用", 《机电工程网络信息资源检索与利用 *

Cited By (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108733654A (en) * 2018-05-21 2018-11-02 宁波薄言信息技术有限公司 A kind of information processing method
CN110019742A (en) * 2018-06-19 2019-07-16 北京京东尚科信息技术有限公司 Method and apparatus for handling information
CN110019742B (en) * 2018-06-19 2024-05-17 北京京东尚科信息技术有限公司 Method and device for processing information
CN108920603A (en) * 2018-06-28 2018-11-30 厦门快商通信息技术有限公司 A kind of customer service bootstrap technique based on customer service machine mould
CN108920603B (en) * 2018-06-28 2021-12-21 厦门快商通信息技术有限公司 Customer service guiding method based on customer service machine model
CN108932340A (en) * 2018-07-13 2018-12-04 华融融通(北京)科技有限公司 The construction method of financial knowledge mapping under a kind of non-performing asset operation field
CN109344174A (en) * 2018-09-13 2019-02-15 深圳易投云智能科技有限公司 Financial analysis method and system
CN109559144A (en) * 2018-09-20 2019-04-02 欧阳江 A kind of personalization securities industry customer service system and method
CN111353013A (en) * 2018-12-05 2020-06-30 中兴通讯股份有限公司 Method and system for realizing intelligent delivery and reception
WO2020114269A1 (en) * 2018-12-05 2020-06-11 中兴通讯股份有限公司 Robo-advisor implementation method and system
CN109658271A (en) * 2018-12-19 2019-04-19 前海企保科技(深圳)有限公司 A kind of intelligent customer service system and method based on the professional scene of insurance
CN109829041A (en) * 2018-12-25 2019-05-31 出门问问信息科技有限公司 Question processing method and device, computer equipment and computer readable storage medium
CN109918489A (en) * 2019-02-28 2019-06-21 上海乐言信息科技有限公司 A kind of knowledge question answering method and system of more strategy fusions
CN110807091A (en) * 2019-03-01 2020-02-18 王涵 A hotel intelligent question answering recommendation and decision support analysis method and system
CN110807091B (en) * 2019-03-01 2023-08-18 王涵 Hotel intelligent question-answer recommendation and decision support analysis method and system
CN111739541A (en) * 2019-03-19 2020-10-02 上海云思智慧信息技术有限公司 A voice-based conference assistance method and system, storage medium and terminal
CN110019687B (en) * 2019-04-11 2021-03-23 宁波深擎信息科技有限公司 Multi-intention recognition system, method, equipment and medium based on knowledge graph
CN110019687A (en) * 2019-04-11 2019-07-16 宁波深擎信息科技有限公司 A kind of more intention assessment systems, method, equipment and the medium of knowledge based map
WO2020221142A1 (en) * 2019-04-28 2020-11-05 华为技术有限公司 Picture book-based question and answer interaction method and electronic device
CN110347894A (en) * 2019-05-31 2019-10-18 平安科技(深圳)有限公司 Knowledge mapping processing method, device, computer equipment and storage medium based on crawler
CN110443382A (en) * 2019-07-09 2019-11-12 青岛聚好联科技有限公司 A kind of user interaction approach based on intelligent customer service platform
CN110543546B (en) * 2019-07-18 2022-03-22 华瑞新智科技(北京)有限公司 Method and device for inquiring hidden danger problems in industrial equipment
CN110543546A (en) * 2019-07-18 2019-12-06 华瑞新智科技(北京)有限公司 Method and device for inquiring hidden danger problems in industrial equipment
CN110597952A (en) * 2019-08-20 2019-12-20 深圳壹账通智能科技有限公司 Information processing method, server, and computer storage medium
CN110727773B (en) * 2019-10-11 2022-02-01 沈阳民航东北凯亚有限公司 Information providing method and device
CN110727773A (en) * 2019-10-11 2020-01-24 沈阳民航东北凯亚有限公司 Information providing method and device
CN111091454A (en) * 2019-11-05 2020-05-01 新华智云科技有限公司 Financial public opinion recommendation method based on knowledge graph
CN110990526B (en) * 2019-11-21 2024-01-30 腾讯科技(深圳)有限公司 Query statement display method and related equipment
CN110990526A (en) * 2019-11-21 2020-04-10 腾讯科技(深圳)有限公司 Query statement display method and related equipment
CN111309888A (en) * 2020-02-25 2020-06-19 百度在线网络技术(北京)有限公司 Man-machine conversation method, device, electronic equipment and storage medium
CN111309888B (en) * 2020-02-25 2023-10-24 百度在线网络技术(北京)有限公司 Man-machine conversation method and device, electronic equipment and storage medium
CN111753021A (en) * 2020-06-17 2020-10-09 第四范式(北京)技术有限公司 Method, apparatus, device and readable storage medium for constructing knowledge graph
CN112015919A (en) * 2020-09-15 2020-12-01 重庆广播电视大学重庆工商职业学院 Dialogue management method based on learning auxiliary knowledge graph
CN112328808A (en) * 2020-11-03 2021-02-05 四川长虹电器股份有限公司 Knowledge graph-based question and answer method and device, electronic equipment and storage medium
CN113051934A (en) * 2021-03-26 2021-06-29 国网河北省电力有限公司信息通信分公司 User question and sentence mixing system for processing semantic similarity and word order similarity
CN113312489A (en) * 2021-04-13 2021-08-27 武汉烽火众智数字技术有限责任公司 Panorama retrieval system and method based on NLP and graph database
CN117157635A (en) * 2021-04-14 2023-12-01 三菱电机株式会社 Learning device, reasoning device, program, learning method, and reasoning method
CN113434645A (en) * 2021-05-26 2021-09-24 上海声通信息科技股份有限公司 Financial product intelligence customer service system
CN113326367B (en) * 2021-06-30 2023-06-16 四川启睿克科技有限公司 Task type dialogue method and system based on end-to-end text generation
CN113326367A (en) * 2021-06-30 2021-08-31 四川启睿克科技有限公司 Task type dialogue method and system based on end-to-end text generation
CN114706991A (en) * 2022-01-27 2022-07-05 清华大学 Knowledge network construction method, device, equipment and storage medium
CN115114453A (en) * 2022-06-29 2022-09-27 中国银行股份有限公司 Intelligent customer service implementation method and device based on knowledge graph
CN117151122B (en) * 2023-10-30 2024-03-22 湖南三湘银行股份有限公司 Bank customer service session question-answering processing method and system based on natural language processing
CN117151122A (en) * 2023-10-30 2023-12-01 湖南三湘银行股份有限公司 A method and system for bank customer service conversation question and answer processing based on natural language processing
CN119623619A (en) * 2024-08-16 2025-03-14 中国科学院深圳先进技术研究院 Intellectual property and academic assistant system and implementation method, device, electronic device and storage medium thereof

Similar Documents

Publication Publication Date Title
CN107958091A (en) A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping
CN109271537B (en) Text-to-image generation method and system based on distillation learning
CN106570708B (en) Management method and system of intelligent customer service knowledge base
US11017177B2 (en) Neural network systems and methods for target identification from text
Hammad et al. An approach for detecting spam in Arabic opinion reviews
CN103049433B (en) The method of automatic question-answering method, automatically request-answering system and structure question and answer case library
KR102240910B1 (en) Korean Customer Service Associate Assist System based on Machine Learning
CN109918560A (en) A kind of answering method and device based on search engine
CN108595696A (en) A kind of human-computer interaction intelligent answering method and system based on cloud platform
CN113722443B (en) Label recommendation method and system integrating text similarity and collaborative filtering
CN111159363A (en) Knowledge base-based question answer determination method and device
CN116881426B (en) AIGC-based self-explanatory question-answering system
Nanda et al. A hindi question answering system using machine learning approach
CN111737558A (en) Information recommendation method and device and computer readable storage medium
CN110909116B (en) Entity set expansion method and system for social media
CN109460462B (en) Chinese similarity problem generation system and method
CN118467845B (en) Method for constructing intelligent interactive service system, website intelligent interactive method and device
CN106951470A (en) A kind of intelligent Answer System retrieved based on professional knowledge figure
Singh et al. Sentiment analysis of twitter user data on Punjab legislative assembly election, 2017
CN110347833B (en) A Classification Method for Multi-round Dialogue
CN115168615A (en) Knowledge graph big data processing method and system combining data visualization
CN114281972B (en) Dialogue control method, system storage medium and server based on subject object tracking and cognitive reasoning
CN116383354A (en) A graph visualization automatic question answering method based on knowledge graph
Walha et al. A Lexicon approach to multidimensional analysis of tweets opinion
Alqaraleh Turkish sentiment analysis system via ensemble learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180424