CN107958091A - A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping - Google Patents
A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping Download PDFInfo
- Publication number
- CN107958091A CN107958091A CN201711459522.8A CN201711459522A CN107958091A CN 107958091 A CN107958091 A CN 107958091A CN 201711459522 A CN201711459522 A CN 201711459522A CN 107958091 A CN107958091 A CN 107958091A
- Authority
- CN
- China
- Prior art keywords
- financial
- questions
- nlp
- information
- entities
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/06—Asset management; Financial planning or analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Artificial Intelligence (AREA)
- Databases & Information Systems (AREA)
- Finance (AREA)
- Development Economics (AREA)
- Operations Research (AREA)
- Economics (AREA)
- Entrepreneurship & Innovation (AREA)
- Animal Behavior & Ethology (AREA)
- Game Theory and Decision Science (AREA)
- Human Resources & Organizations (AREA)
- Life Sciences & Earth Sciences (AREA)
- Human Computer Interaction (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Technology Law (AREA)
- General Business, Economics & Management (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping, pass through the financial vertical collection of illustrative plates of foundation, it is associated with NLP, that is, natural language processing, so as to establish a set of perfect intelligent finance problem interactive system, with traditional search, question answering system is different by dynamic response, such a method can carry out active rhetorical question, recommend, excavate the problem of user's deep layer is wanted to ask, in flow, the problem of such a method receives user, first pass through NLP technologies, then semanteme is analyzed, retrieved by semanteme into financial vertical collection of illustrative plates, draw a circle to approve the scene domain of question answering, choose most suitable answer and be pushed to client, the problem of client can so being proposed, problem is better understood from reference to knowledge mapping, it can more accurately retrieve answer at the same time.
Description
Technical Field
The invention relates to the field of artificial intelligence dialogue systems and robot languages.
Background
At present, financial managers generally play a role of shopping guide in the purchasing process of financial products, and customers need continuous communication companions in the whole thinking decision process. Unlike traditional product sales, for financial products, purchasing is only a start and the subsequent process requires more communication to serve as a care companion. However, it is obviously impossible to provide communication in all weather by using manual service to ensure high-quality and efficient service. The rise of AI intelligence makes it possible to technically realize all-weather efficient intelligent service for customers.
In the prior art, the technologies relied on by the man-machine conversation system are mainly divided into the following two types:
the method is based on a question-answer library technology, namely, keywords in sentences are extracted to be matched with a large amount of sorted questions and answers. Many intelligent customer service adopt this kind of method to replace the cost of some manual customer service, and this kind of method disadvantage is that there is no correlation among the massive data, in addition if some sentence word orders are changed, the meaning is totally different, but the matched answer is probably unanimous;
the technology based on the search engine is to screen and answer the questions of the user by using the results returned by the search engine, and on one hand, the technology obtains low-quality responses, and on the other hand, the obtained answers are not controllable.
It can be seen that there are at least the following problems in the prior art:
1. the problems and corresponding answers of the traditional method are relatively independently stored, and all the problems are not related, so that the user problems cannot be effectively presumed through semantics;
2. the traditional method is more lateral to solving common user problems and problems before sale, and most personalized problems after sale cannot be answered accurately;
3. the traditional method is based on template matching, the problem richness is achieved by generalization of artificial mass problems, various questions need to be thought of in the same problem, and a large amount of time and energy are consumed.
Disclosure of Invention
Aiming at the problems in the prior art, the application provides a brand-new solution.
The application discloses an NLP (natural language processing) artificial intelligence method based on a financial vertical knowledge graph to realize intelligent financial problem interaction between a machine and a client, which specifically comprises the following steps:
s1, asking questions, and the client asks financial related questions;
s2, processing the natural language NLP, and processing the problems proposed by the customers through the NLP technology;
s3, semantic analysis and understanding, selecting proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
s4, information retrieval, namely retrieving information obtained through semantic analysis and understanding in the financial vertical map;
and S5, outputting the result, generating answer output according to the search result to answer the question of the client, and guiding and mining the question.
The financial vertical map is produced by combing and storing financial domain knowledge and the relationship between the financial domain knowledge through a database by utilizing a machine learning technology.
Further, the making of the financial vertical map comprises the following steps:
1) preparing related learning materials of the financial field;
2) learning the relevant learning data of the financial field in a semi-supervised mode by means of a machine learning technology;
3) combing the learned financial domain knowledge entities and the relations between the financial domain knowledge entities and storing the learned financial domain knowledge entities in a database to obtain a database;
4) semi-supervised maintenance is carried out on the relationship between the financial field knowledge entities, and synonyms representing the same relationship are aggregated in the entity range related to the relationship;
5) the picture description is added to common entities and used as a retrieval item for selection, extraction and use in the subsequent result output, so that the diversification of interactive answer forms can be improved.
When each entity concept is stored, other existing entities are associated, and finally, a financial vertical map with mutual relations among the entities is formed.
Further, the NLP natural language processing includes the following steps in chronological order:
s2.1, processing data, wherein the processing data comprises the processes of complex and simple conversion, Chinese word segmentation, part of speech tagging, data cleaning, syntax analysis, entity recognition and/or voice character conversion;
s2.2, classifying the data processed problems by the classification model from the three aspects of problem type, user behavior and emotion recognition;
s2.3, extracting information, wherein the information extraction is sequentially carried out according to the steps of extraction based on part of speech tagging, extraction based on semantic analysis and extraction based on speech piece analysis;
the extraction based on Part-of-Speech tagging is called PoS (Part-of-Speech) extraction for short, namely, the extraction is selected according to tagged nouns, verbs, adjectives or other parts-of-Speech;
the extraction based on semantic analysis selects a mode based on industry keywords or an entity list to extract and call information, and the calling method is a common statistical method known in the industry, such as chi-square, information gain, mutual information technology and the like.
And the sentence analysis is used for extracting and calling information in a dependency tree mode, wherein the dependency tree represents the dependency relationship among words in a sentence.
S2.4, completing information, detecting the deletion of sentence structures in the questions proposed by the customers, and completing the missing part;
s2.5 information queue: putting the extracted key information into a queue, removing the key information from the queue after more than 5 times of conversation, and simultaneously resolving and paraphrasing the indicative pronouns in the sentences.
In which conversation and queue specifically means that the key information of the latest conversation spoken by the user is put into the information queue, and the key information entered earlier (for example, 5 rounds ahead) is moved out of the queue. The queue information is used for controlling the semantic range of information retrieval and providing a basis for multi-round problem reply.
Further, the information retrieval comprises the following steps in sequence:
s4.1, classifying the questions, and defining the scene range of the answer of the questions according to the information obtained by semantic analysis and understanding, wherein the types of the questions comprise: special scene type questions, pre-or post-engagement questions, QA (question and answer type) questions.
And S4.2, searching the questions, searching the information obtained by semantic analysis and understanding in the financial vertical map within the range of the circled scene after the question types are determined to obtain required entities and/or relations, and generating answers according to word co-occurrence similarity, sentence pattern matching strategies and/or word sequence consideration modes for sending the answers to the clients.
Word co-occurrence similarity is a statistical-based model, and in a segment of language expression, several words often co-occur (i.e., co-occur) in the same sentence or the same paragraph, so that the words are considered to be related in meaning. The similarity calculation is carried out by adopting the method, and the entity searched by the semantics is calculated by adopting the method, so that the accuracy of the subsequent output result is improved.
The sentence pattern matching strategy has the effect that relevant entities required by the response are extracted through the information retrieval step, but the relevant entities are not a complete sentence. Through the sentence pattern matching strategy, the answer sentence pattern can be correspondingly selected through the known question sentence pattern, and the blank slot filling of the 'information completion' part is multiplexed to generate the answer.
The word order consideration is a supplement of the co-occurrence similarity, after the entities are selected, a group of similar entities is actually selected, the priority ranking of the group of entities is determined according to the matching degree of the searched group of entities and the questions, the higher priority is presented in the answers, and the lower priority is the alternative answers.
Further, the result output comprises outputting the generated answer, the guided and excavated question to the customer, wherein the guided and excavated question is further guided and excavated according to the entity associated with the answer in the financial vertical map, and forms a new question to be output to the customer for the customer to answer or confirm, so that a more real and effective dialogue interaction state can be realized.
In addition, the application also discloses a system for carrying out intelligent financial problem interaction by using the NLP artificial intelligence method based on the financial vertical knowledge graph, which specifically comprises an input module, an NLP natural language processing module, a semantic analysis and understanding module, an information retrieval module and a result output module;
wherein,
the input module is used for receiving financial related questions put forward by a customer and sending the questions to the NLP natural language processing module for processing;
the NLP natural language processing module processes the problems proposed by the customers through an NLP technology;
the semantic analysis and understanding module selects proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
the information retrieval module retrieves the information obtained by semantic analysis and understanding in the financial vertical map;
and the result output module generates answer output according to the retrieval result so as to answer the question of the client, and guides and mines the question.
And the established financial vertical map is associated with NLP (natural language processing), so that a set of complete intelligent financial problem interaction system is established. Different from the passive response of the traditional searching and question-answering system, the method can actively ask a question, recommend and mine questions deeply asked by the user. In the process, the method receives the questions of the user, firstly, the NLP technology is used, then, the semantics are analyzed, the financial vertical map is searched through the semantics, the scene range of question answers is defined, and the most suitable answers are selected and pushed to the client.
By doing so, on one hand, the questions proposed by the customers can be better understood by combining the knowledge graph, and on the other hand, the answers can be more accurately retrieved. A plurality of knowledge points exist in the whole financial activity, the relation of the related knowledge points is combed and appropriately stored by a graph database for subsequent reading and retrieval of an NLP system.
Compared with the prior art, the application has the advantages that:
1. the established financial knowledge map can continuously perfect and enrich entity/relation data in the database, when each entity concept is stored, other existing entities can be intelligently associated, and a financial vertical map with complex relation is finally formed, so that complex personalized problems of customers can be solved, and meanwhile, the diversification of interactive answer forms can be improved by storing different types of data (such as picture information) and a special information retrieval and answer generation output mechanism.
2. Active question asking can be performed, questions deeply asked by the user can be recommended and mined, and the more real and effective conversation interaction state is realized.
3. The client problems can be better and more efficiently understood, the most appropriate answers are accurately retrieved, and the conversation communication efficiency is greatly improved.
Drawings
FIG. 1 is an illustration of an example of a converged knowledge entity in a financial vertical graph stored in a graph database;
FIG. 2 is a general flow diagram of an intelligent dialog;
FIG. 3 is a detailed flow diagram of a financial vertical knowledge-graph based NLP artificial intelligence method to achieve intelligent financial problem interaction between machines and customers.
Detailed Description
For the purpose of full disclosure, the present invention will be described in further detail with reference to the following examples. It should be understood that the specific examples described below are only for illustrating the present invention and are not intended to limit the scope of the present invention.
The application specifically discloses an NLP artificial intelligence method based on a financial vertical knowledge graph to realize intelligent financial problem interaction between a machine and a client, and the method specifically comprises the following steps:
s1, asking questions, and the client asks financial related questions;
s2, processing the natural language NLP, and processing the problems proposed by the customers through the NLP technology;
s3, semantic analysis and understanding, selecting proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
s4, information retrieval, namely retrieving information obtained through semantic analysis and understanding in the financial vertical map;
and S5, outputting the result, generating answer output according to the search result to answer the question of the client, and guiding and mining the question.
The financial vertical map is produced by combing and storing financial domain knowledge and the relationship between the financial domain knowledge through a database by utilizing a machine learning technology.
An example of a financial knowledge entity in a financial vertical as shown in FIG. 1, the production of the financial vertical comprising:
1) preparing financial field-related learning materials, which are materials of bank-related knowledge in the example shown in fig. 1;
2) learning the relevant learning materials of the financial field by using a semi-supervised mode by means of a machine learning technology, wherein in the example, the learning materials contain information of four major lines of China and relevant relations thereof;
3) the learned financial field knowledge entities and the relations between the financial field knowledge entities are combed and stored in a database to obtain a graph database, in the example, learning that 'four major banks in China have China Bank, China Industrial Bank, China agricultural Bank and China construction Bank', 5 entities and 4 groups of relations are learned through the sentence, namely four major banks in China-having-China Bank, China Industrial Bank in China-having-China, four major banks in China-having-China agricultural Bank in China-having-China Bank in China and four major banks in China-having-China construction Bank in China;
4) and performing semi-supervised maintenance on the relationship between the knowledge entities in the financial field, and aggregating synonyms representing the same relationship in the entity range related to the relationship, for example, receiving the financial knowledge again, wherein the four major banks in China comprise a Chinese bank, a Chinese industrial and commercial bank, a Chinese agricultural bank and a Chinese construction bank. New relations including appear among 5 existing entities again, and two semantically existing relations including and including are consistent, can be combined into a relation, and records close relations to the entities, so that the accuracy of machine learning is improved;
5) adding picture description to common entities, using the picture description as a retrieval item for subsequent selection and extraction, and importing related pictures to the common entities for selection and extraction during reply in order to improve the diversity of interactive answer forms by storing the pictures.
When each entity concept is stored, other existing entities are associated, and finally, a financial vertical map with mutual relations among the entities is formed.
As shown in fig. 3, fig. 3 shows a detailed flow of NLP artificial intelligence method based on financial vertical knowledge graph to realize intelligent financial problem interaction between machine and customer,
the NLP natural language processing in step S2 includes the following steps in order:
s2.1, processing data, wherein the processing data comprises the processes of complex and simple conversion, Chinese word segmentation, part of speech tagging, data cleaning, syntax analysis, entity recognition and/or voice character conversion;
the mathematical expression of the method adopted by Chinese word segmentation is as follows:
(T)=(W1)P(W2|W1)P(W3|W2)…P(Wn|Wn-1)
p represents the probability of occurrence, where W1Represents the result of the first word region recognition, so P (W)1) Representing the probability of the first word occurring. P (W)2|W1) Representing the probability of a second word appearing if the first word appears, W can be seen1W2Whether it is a common word. Thus, a sentence with divided words is generated until the end of the sentence. Such as: china | four great | banks | which |?
S2.2, classifying the data processed problems by the classification model from the three aspects of problem type, user behavior and emotion recognition;
the question type is mainly the question to distinguish the user is the 5w type question (where, what, while, why, how), the regular question or the non-question type communication.
User behavior refers to the question that the user makes on which page, under what circumstances.
And the emotion recognition can extract the related emotion words according to the conversation process of the user.
S2.3, extracting information, wherein the information extraction is sequentially carried out according to the steps of extraction based on part of speech tagging, extraction based on semantic analysis and extraction based on speech piece analysis;
the extraction based on Part-of-Speech tagging is called PoS (Part-of-Speech) extraction for short, namely, the extraction is selected according to tagged nouns, verbs, adjectives or other parts-of-Speech;
the extraction based on semantic analysis selects a mode based on industry keywords or an entity list to extract and call information, and the calling method is a common statistical method known in the industry, such as chi-square, information gain, mutual information technology and the like.
And the sentence analysis is used for extracting and calling information in a dependency tree mode, wherein the dependency tree represents the dependency relationship among words in a sentence.
More specifically, the mathematical expression in which information extraction is performed by the chi-square:
the feature extraction algorithm is divided into two categories of feature selection and feature extraction. The chi-square test belongs to the better algorithm in the feature selection algorithm.
t and c are two random variables, chi 2 represents chi-square value for checking the correlation of data, t represents a word, and c represents a category. For example, t may represent a flower and c may represent a plant.
When we find N articles, t is classified as containing and not containing, and c is classified as belonging and not belonging. So that the following tables can be generated, corresponding to the letters in the above formulas, respectively.
| Feature selection | c1. Belonging to the genus of "plants" | c2. Not belonging to "plants" | Total of |
| t1. comprising "flowers" | A | B | A+B |
| t2. not containing "flowers" | C | D | C+D |
| Total number of | A+C | B+D | N |
The mathematical expression in which information extraction is performed by information gain:
in the case where the feature is that Y is fixed, the conditional entropy of X is H (X | Y), and P (X | Y) is the probability of occurrence. The colloquial explanation is how much the uncertainty of X is reduced after knowing Y, compared to when it is not. The two variables X, Y of the information gain differ in position, which is a means to consider Y as reducing X uncertainty. While the two variables of mutual information are in the same place.
The mathematical expression for extracting information through mutual information is as follows:
g(D,A)=H(D)-H(D|A)
h (D) represents the self information quantity, i.e. the sent information, H (D | A) represents the condition information quantity, and g (D, A) represents the mutual information quantity. The mutual information amount is self information amount-condition information amount. Information extraction is carried out through mutual information, and the internal bonding strength of the character strings is calculated by utilizing the mutual information, so that the extracted information is more complete and is not the state of the keyword. Such as natural language processing, which finds these words as a fixed academic vocabulary through mutual information, and therefore does not break down into three words for understanding.
S2.4, completing information, namely detecting the deletion of a sentence structure in a question proposed by a client (also called slot blank detection), and completing the missing part (also called blank slot filling);
s2.5 information queue: putting the extracted key information into a queue, removing the key information from the queue after more than 5 times of conversation, and simultaneously resolving and paraphrasing the indicative pronouns in the sentences.
In which conversation and queue specifically means that the key information of the latest conversation spoken by the user is put into the information queue, and the key information entered earlier (for example, 5 rounds ahead) is moved out of the queue. The queue information is used for controlling the semantic range of information retrieval and providing a basis for multi-round problem reply.
Wherein the information retrieval in step S4 includes the following steps in order of sequence:
s4.1, classifying the questions, and defining the scene range of the answer of the questions according to the information obtained by semantic analysis and understanding, wherein the types of the questions comprise: special scene type questions, pre-or post-engagement questions, QA (question and answer type) questions.
And S4.2, searching the questions, searching the information obtained by semantic analysis and understanding in the financial vertical map within the range of the circled scene after the question types are determined to obtain required entities and/or relations, and generating answers according to word co-occurrence similarity, sentence pattern matching strategies and/or word sequence consideration modes for sending the answers to the clients.
Co-occurrence similarity calculation mathematical expression:
Similarity(s1,s2)=αSDMG(s1,s2)+βSDMG(s1,s2)+γSDMG(s1,s2)
s1, s2 are two words, α, β, γ are co-occurrence correlations in three respective spatial vectors.
Wherein the outputting of the result of step S4 includes outputting the generated answer, guided and mined question to the customer, wherein the guided and mined question is further guided and mined according to the entity associated with the answer in the financial vertical map to form a new question to be output to the customer for the customer to answer or confirm, so that a more real and effective dialogue interaction state can be realized. For example:
the end user asks: which are the four banks in china?
And (3) dialog answers: the four banks in China all have China Bank, China Industrial and commercial Bank, China agricultural Bank, China construction Bank, do you know? Wherein the bank of the chinese industry and commerce was established in 1984.
The embodiment can be specifically realized by a system for intelligent financial problem interaction by using an NLP artificial intelligence method based on a financial vertical knowledge graph, wherein the system comprises an input module, an NLP natural language processing module, a semantic analysis and understanding module, an information retrieval module and a result output module;
wherein,
the input module is used for receiving financial related questions put forward by a customer and sending the questions to the NLP natural language processing module for processing;
the NLP natural language processing module processes the problems proposed by the customers through an NLP technology;
the semantic analysis and understanding module selects proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
the information retrieval module retrieves the information obtained by semantic analysis and understanding in the financial vertical map;
and the result output module generates answer output according to the retrieval result so as to answer the question of the client, and guides and mines the question.
The above-described example embodiments merely represent embodiments of the present patent, and the description thereof should not be construed as limiting the scope of the patent. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the patent idea, which falls within the protection scope of the present invention. Therefore, the protection scope of this patent shall be subject to the appended claims. The above-mentioned embodiments only express the embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.
Claims (6)
1. An NLP artificial intelligence method based on a financial vertical knowledge graph is characterized by comprising the following steps:
s1, asking questions, and the client asks financial related questions;
s2, processing the natural language NLP, and processing the problems proposed by the customers through the NLP technology;
s3, semantic analysis and understanding, wherein phrases and keywords representing entities/relations are selected according to the processed information for subsequent retrieval;
s4, information retrieval, namely retrieving information obtained through semantic analysis and understanding in the financial vertical map;
s5, outputting results, generating answers according to the search results to output answers to the questions of the clients, and guiding and mining the questions;
the financial vertical map is produced by combing and storing financial domain knowledge and the relationship between the financial domain knowledge through a database by utilizing a machine learning technology.
2. The NLP artificial intelligence method based on financial vertical knowledge-graph as claimed in claim 1, wherein the making of the financial vertical knowledge-graph comprises:
1) preparing related learning materials of the financial field;
2) learning the relevant learning data of the financial field in a semi-supervised mode by means of a machine learning technology;
3) combing the learned financial domain knowledge entities and the relations between the financial domain knowledge entities and storing the learned financial domain knowledge entities in a database to obtain a database;
4) semi-supervised maintenance is carried out on the relationship between the financial field knowledge entities, and synonyms representing the same relationship are aggregated in the entity range related to the relationship;
5) adding picture description to common entities, and using the picture description as a retrieval item for selection, extraction and use when outputting subsequent results, thereby improving the diversification of interactive answer forms;
when each entity concept is stored, other existing entities are associated, and finally, a financial vertical map with mutual relations among the entities is formed.
3. The NLP artificial intelligence method based on financial vertical knowledge-graph as claimed in claim 1, wherein the NLP natural language processing comprises the following steps in order of precedence:
s2.1, processing data, wherein the processing data comprises the processes of complex and simple conversion, Chinese word segmentation, part of speech tagging, data cleaning, syntax analysis, entity recognition and/or voice character conversion;
s2.2, classifying the data processed problems by the classification model from the three aspects of problem type, user behavior and emotion recognition;
s2.3, extracting information, wherein the information extraction is sequentially carried out according to the steps of extraction based on part of speech tagging, extraction based on semantic analysis and extraction based on speech piece analysis;
s2.4, completing information, detecting the deletion of sentence structures in the questions proposed by the customers, and completing the missing part;
s2.5 information queue: putting the extracted key information into a queue, removing the key information from the queue after more than 5 times of conversation, and simultaneously resolving and paraphrasing the indicative pronouns in the sentences.
4. The NLP artificial intelligence method based on financial vertical knowledge-graph as claimed in claim 1, wherein said information retrieval comprises the following steps in the order of precedence:
s4.1, classifying the questions, and defining the scene range of the answer of the questions according to the information obtained by semantic analysis and understanding, wherein the types of the questions comprise: special scene questions, pre-or post-engagement questions, QA (question and answer) questions;
and S4.2, searching the questions, searching the information obtained by semantic analysis and understanding in the financial vertical map within the range of the circled scene after the question types are determined to obtain required entities and/or relations, and generating answers according to word co-occurrence similarity, sentence pattern matching strategies and/or word sequence consideration modes for sending the answers to the clients.
5. The NLP artificial intelligence method based on financial vertical knowledge graph as claimed in claim 1, wherein the result output includes outputting the generated answers, guided and mined questions to the customer, wherein the guided and mined questions are further guided and mined according to the entity associated with the answers in the financial vertical knowledge graph to form new questions to be output to the customer for the customer to answer or confirm.
6. A system for intelligent financial problem interaction by using the NLP artificial intelligence method based on financial vertical knowledge graph according to any one of claims 1-5, wherein the system comprises an input module, an NLP natural language processing module, a semantic analysis and understanding module, an information retrieval module and a result output module;
wherein,
the input module is used for receiving financial related questions put forward by a customer and sending the questions to the NLP natural language processing module for processing;
the NLP natural language processing module processes the problems proposed by the customers through an NLP technology;
the semantic analysis and understanding module selects proper phrases and keywords representing entities/relations according to the processed information for subsequent retrieval;
the information retrieval module retrieves the information obtained by semantic analysis and understanding in the financial vertical map;
and the result output module generates answer output according to the retrieval result so as to answer the question of the client, and guides and mines the question.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201711459522.8A CN107958091A (en) | 2017-12-28 | 2017-12-28 | A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201711459522.8A CN107958091A (en) | 2017-12-28 | 2017-12-28 | A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN107958091A true CN107958091A (en) | 2018-04-24 |
Family
ID=61957098
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201711459522.8A Pending CN107958091A (en) | 2017-12-28 | 2017-12-28 | A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN107958091A (en) |
Cited By (34)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108733654A (en) * | 2018-05-21 | 2018-11-02 | 宁波薄言信息技术有限公司 | A kind of information processing method |
| CN108920603A (en) * | 2018-06-28 | 2018-11-30 | 厦门快商通信息技术有限公司 | A kind of customer service bootstrap technique based on customer service machine mould |
| CN108932340A (en) * | 2018-07-13 | 2018-12-04 | 华融融通(北京)科技有限公司 | The construction method of financial knowledge mapping under a kind of non-performing asset operation field |
| CN109344174A (en) * | 2018-09-13 | 2019-02-15 | 深圳易投云智能科技有限公司 | Financial analysis method and system |
| CN109559144A (en) * | 2018-09-20 | 2019-04-02 | 欧阳江 | A kind of personalization securities industry customer service system and method |
| CN109658271A (en) * | 2018-12-19 | 2019-04-19 | 前海企保科技(深圳)有限公司 | A kind of intelligent customer service system and method based on the professional scene of insurance |
| CN109829041A (en) * | 2018-12-25 | 2019-05-31 | 出门问问信息科技有限公司 | Question processing method and device, computer equipment and computer readable storage medium |
| CN109918489A (en) * | 2019-02-28 | 2019-06-21 | 上海乐言信息科技有限公司 | A kind of knowledge question answering method and system of more strategy fusions |
| CN110019742A (en) * | 2018-06-19 | 2019-07-16 | 北京京东尚科信息技术有限公司 | Method and apparatus for handling information |
| CN110019687A (en) * | 2019-04-11 | 2019-07-16 | 宁波深擎信息科技有限公司 | A kind of more intention assessment systems, method, equipment and the medium of knowledge based map |
| CN110347894A (en) * | 2019-05-31 | 2019-10-18 | 平安科技(深圳)有限公司 | Knowledge mapping processing method, device, computer equipment and storage medium based on crawler |
| CN110443382A (en) * | 2019-07-09 | 2019-11-12 | 青岛聚好联科技有限公司 | A kind of user interaction approach based on intelligent customer service platform |
| CN110543546A (en) * | 2019-07-18 | 2019-12-06 | 华瑞新智科技(北京)有限公司 | Method and device for inquiring hidden danger problems in industrial equipment |
| CN110597952A (en) * | 2019-08-20 | 2019-12-20 | 深圳壹账通智能科技有限公司 | Information processing method, server, and computer storage medium |
| CN110727773A (en) * | 2019-10-11 | 2020-01-24 | 沈阳民航东北凯亚有限公司 | Information providing method and device |
| CN110807091A (en) * | 2019-03-01 | 2020-02-18 | 王涵 | A hotel intelligent question answering recommendation and decision support analysis method and system |
| CN110990526A (en) * | 2019-11-21 | 2020-04-10 | 腾讯科技(深圳)有限公司 | Query statement display method and related equipment |
| CN111091454A (en) * | 2019-11-05 | 2020-05-01 | 新华智云科技有限公司 | Financial public opinion recommendation method based on knowledge graph |
| WO2020114269A1 (en) * | 2018-12-05 | 2020-06-11 | 中兴通讯股份有限公司 | Robo-advisor implementation method and system |
| CN111309888A (en) * | 2020-02-25 | 2020-06-19 | 百度在线网络技术(北京)有限公司 | Man-machine conversation method, device, electronic equipment and storage medium |
| CN111739541A (en) * | 2019-03-19 | 2020-10-02 | 上海云思智慧信息技术有限公司 | A voice-based conference assistance method and system, storage medium and terminal |
| CN111753021A (en) * | 2020-06-17 | 2020-10-09 | 第四范式(北京)技术有限公司 | Method, apparatus, device and readable storage medium for constructing knowledge graph |
| WO2020221142A1 (en) * | 2019-04-28 | 2020-11-05 | 华为技术有限公司 | Picture book-based question and answer interaction method and electronic device |
| CN112015919A (en) * | 2020-09-15 | 2020-12-01 | 重庆广播电视大学重庆工商职业学院 | Dialogue management method based on learning auxiliary knowledge graph |
| CN112328808A (en) * | 2020-11-03 | 2021-02-05 | 四川长虹电器股份有限公司 | Knowledge graph-based question and answer method and device, electronic equipment and storage medium |
| CN113051934A (en) * | 2021-03-26 | 2021-06-29 | 国网河北省电力有限公司信息通信分公司 | User question and sentence mixing system for processing semantic similarity and word order similarity |
| CN113312489A (en) * | 2021-04-13 | 2021-08-27 | 武汉烽火众智数字技术有限责任公司 | Panorama retrieval system and method based on NLP and graph database |
| CN113326367A (en) * | 2021-06-30 | 2021-08-31 | 四川启睿克科技有限公司 | Task type dialogue method and system based on end-to-end text generation |
| CN113434645A (en) * | 2021-05-26 | 2021-09-24 | 上海声通信息科技股份有限公司 | Financial product intelligence customer service system |
| CN114706991A (en) * | 2022-01-27 | 2022-07-05 | 清华大学 | Knowledge network construction method, device, equipment and storage medium |
| CN115114453A (en) * | 2022-06-29 | 2022-09-27 | 中国银行股份有限公司 | Intelligent customer service implementation method and device based on knowledge graph |
| CN117157635A (en) * | 2021-04-14 | 2023-12-01 | 三菱电机株式会社 | Learning device, reasoning device, program, learning method, and reasoning method |
| CN117151122A (en) * | 2023-10-30 | 2023-12-01 | 湖南三湘银行股份有限公司 | A method and system for bank customer service conversation question and answer processing based on natural language processing |
| CN119623619A (en) * | 2024-08-16 | 2025-03-14 | 中国科学院深圳先进技术研究院 | Intellectual property and academic assistant system and implementation method, device, electronic device and storage medium thereof |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104050256A (en) * | 2014-06-13 | 2014-09-17 | 西安蒜泥电子科技有限责任公司 | Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method |
| CN105868313A (en) * | 2016-03-25 | 2016-08-17 | 浙江大学 | Mapping knowledge domain questioning and answering system and method based on template matching technique |
| CN106776711A (en) * | 2016-11-14 | 2017-05-31 | 浙江大学 | A kind of Chinese medical knowledge mapping construction method based on deep learning |
| CN106919655A (en) * | 2017-01-24 | 2017-07-04 | 网易(杭州)网络有限公司 | A kind of answer provides method and apparatus |
| CN107341215A (en) * | 2017-06-07 | 2017-11-10 | 北京航空航天大学 | A kind of vertical knowledge mapping classification ensemble querying method of multi-source based on Distributed Computing Platform |
-
2017
- 2017-12-28 CN CN201711459522.8A patent/CN107958091A/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104050256A (en) * | 2014-06-13 | 2014-09-17 | 西安蒜泥电子科技有限责任公司 | Initiative study-based questioning and answering method and questioning and answering system adopting initiative study-based questioning and answering method |
| CN105868313A (en) * | 2016-03-25 | 2016-08-17 | 浙江大学 | Mapping knowledge domain questioning and answering system and method based on template matching technique |
| CN106776711A (en) * | 2016-11-14 | 2017-05-31 | 浙江大学 | A kind of Chinese medical knowledge mapping construction method based on deep learning |
| CN106919655A (en) * | 2017-01-24 | 2017-07-04 | 网易(杭州)网络有限公司 | A kind of answer provides method and apparatus |
| CN107341215A (en) * | 2017-06-07 | 2017-11-10 | 北京航空航天大学 | A kind of vertical knowledge mapping classification ensemble querying method of multi-source based on Distributed Computing Platform |
Non-Patent Citations (2)
| Title |
|---|
| 公安部第三研究所: "多摄像机协同关注目标检测跟踪技术", 《多摄像机协同关注目标检测跟踪技术》 * |
| 陆刚等: "机电工程网络信息资源检索与利用", 《机电工程网络信息资源检索与利用》 * |
Cited By (45)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108733654A (en) * | 2018-05-21 | 2018-11-02 | 宁波薄言信息技术有限公司 | A kind of information processing method |
| CN110019742A (en) * | 2018-06-19 | 2019-07-16 | 北京京东尚科信息技术有限公司 | Method and apparatus for handling information |
| CN110019742B (en) * | 2018-06-19 | 2024-05-17 | 北京京东尚科信息技术有限公司 | Method and device for processing information |
| CN108920603A (en) * | 2018-06-28 | 2018-11-30 | 厦门快商通信息技术有限公司 | A kind of customer service bootstrap technique based on customer service machine mould |
| CN108920603B (en) * | 2018-06-28 | 2021-12-21 | 厦门快商通信息技术有限公司 | Customer service guiding method based on customer service machine model |
| CN108932340A (en) * | 2018-07-13 | 2018-12-04 | 华融融通(北京)科技有限公司 | The construction method of financial knowledge mapping under a kind of non-performing asset operation field |
| CN109344174A (en) * | 2018-09-13 | 2019-02-15 | 深圳易投云智能科技有限公司 | Financial analysis method and system |
| CN109559144A (en) * | 2018-09-20 | 2019-04-02 | 欧阳江 | A kind of personalization securities industry customer service system and method |
| CN111353013A (en) * | 2018-12-05 | 2020-06-30 | 中兴通讯股份有限公司 | Method and system for realizing intelligent delivery and reception |
| WO2020114269A1 (en) * | 2018-12-05 | 2020-06-11 | 中兴通讯股份有限公司 | Robo-advisor implementation method and system |
| CN109658271A (en) * | 2018-12-19 | 2019-04-19 | 前海企保科技(深圳)有限公司 | A kind of intelligent customer service system and method based on the professional scene of insurance |
| CN109829041A (en) * | 2018-12-25 | 2019-05-31 | 出门问问信息科技有限公司 | Question processing method and device, computer equipment and computer readable storage medium |
| CN109918489A (en) * | 2019-02-28 | 2019-06-21 | 上海乐言信息科技有限公司 | A kind of knowledge question answering method and system of more strategy fusions |
| CN110807091A (en) * | 2019-03-01 | 2020-02-18 | 王涵 | A hotel intelligent question answering recommendation and decision support analysis method and system |
| CN110807091B (en) * | 2019-03-01 | 2023-08-18 | 王涵 | Hotel intelligent question-answer recommendation and decision support analysis method and system |
| CN111739541A (en) * | 2019-03-19 | 2020-10-02 | 上海云思智慧信息技术有限公司 | A voice-based conference assistance method and system, storage medium and terminal |
| CN110019687B (en) * | 2019-04-11 | 2021-03-23 | 宁波深擎信息科技有限公司 | Multi-intention recognition system, method, equipment and medium based on knowledge graph |
| CN110019687A (en) * | 2019-04-11 | 2019-07-16 | 宁波深擎信息科技有限公司 | A kind of more intention assessment systems, method, equipment and the medium of knowledge based map |
| WO2020221142A1 (en) * | 2019-04-28 | 2020-11-05 | 华为技术有限公司 | Picture book-based question and answer interaction method and electronic device |
| CN110347894A (en) * | 2019-05-31 | 2019-10-18 | 平安科技(深圳)有限公司 | Knowledge mapping processing method, device, computer equipment and storage medium based on crawler |
| CN110443382A (en) * | 2019-07-09 | 2019-11-12 | 青岛聚好联科技有限公司 | A kind of user interaction approach based on intelligent customer service platform |
| CN110543546B (en) * | 2019-07-18 | 2022-03-22 | 华瑞新智科技(北京)有限公司 | Method and device for inquiring hidden danger problems in industrial equipment |
| CN110543546A (en) * | 2019-07-18 | 2019-12-06 | 华瑞新智科技(北京)有限公司 | Method and device for inquiring hidden danger problems in industrial equipment |
| CN110597952A (en) * | 2019-08-20 | 2019-12-20 | 深圳壹账通智能科技有限公司 | Information processing method, server, and computer storage medium |
| CN110727773B (en) * | 2019-10-11 | 2022-02-01 | 沈阳民航东北凯亚有限公司 | Information providing method and device |
| CN110727773A (en) * | 2019-10-11 | 2020-01-24 | 沈阳民航东北凯亚有限公司 | Information providing method and device |
| CN111091454A (en) * | 2019-11-05 | 2020-05-01 | 新华智云科技有限公司 | Financial public opinion recommendation method based on knowledge graph |
| CN110990526B (en) * | 2019-11-21 | 2024-01-30 | 腾讯科技(深圳)有限公司 | Query statement display method and related equipment |
| CN110990526A (en) * | 2019-11-21 | 2020-04-10 | 腾讯科技(深圳)有限公司 | Query statement display method and related equipment |
| CN111309888A (en) * | 2020-02-25 | 2020-06-19 | 百度在线网络技术(北京)有限公司 | Man-machine conversation method, device, electronic equipment and storage medium |
| CN111309888B (en) * | 2020-02-25 | 2023-10-24 | 百度在线网络技术(北京)有限公司 | Man-machine conversation method and device, electronic equipment and storage medium |
| CN111753021A (en) * | 2020-06-17 | 2020-10-09 | 第四范式(北京)技术有限公司 | Method, apparatus, device and readable storage medium for constructing knowledge graph |
| CN112015919A (en) * | 2020-09-15 | 2020-12-01 | 重庆广播电视大学重庆工商职业学院 | Dialogue management method based on learning auxiliary knowledge graph |
| CN112328808A (en) * | 2020-11-03 | 2021-02-05 | 四川长虹电器股份有限公司 | Knowledge graph-based question and answer method and device, electronic equipment and storage medium |
| CN113051934A (en) * | 2021-03-26 | 2021-06-29 | 国网河北省电力有限公司信息通信分公司 | User question and sentence mixing system for processing semantic similarity and word order similarity |
| CN113312489A (en) * | 2021-04-13 | 2021-08-27 | 武汉烽火众智数字技术有限责任公司 | Panorama retrieval system and method based on NLP and graph database |
| CN117157635A (en) * | 2021-04-14 | 2023-12-01 | 三菱电机株式会社 | Learning device, reasoning device, program, learning method, and reasoning method |
| CN113434645A (en) * | 2021-05-26 | 2021-09-24 | 上海声通信息科技股份有限公司 | Financial product intelligence customer service system |
| CN113326367B (en) * | 2021-06-30 | 2023-06-16 | 四川启睿克科技有限公司 | Task type dialogue method and system based on end-to-end text generation |
| CN113326367A (en) * | 2021-06-30 | 2021-08-31 | 四川启睿克科技有限公司 | Task type dialogue method and system based on end-to-end text generation |
| CN114706991A (en) * | 2022-01-27 | 2022-07-05 | 清华大学 | Knowledge network construction method, device, equipment and storage medium |
| CN115114453A (en) * | 2022-06-29 | 2022-09-27 | 中国银行股份有限公司 | Intelligent customer service implementation method and device based on knowledge graph |
| CN117151122B (en) * | 2023-10-30 | 2024-03-22 | 湖南三湘银行股份有限公司 | Bank customer service session question-answering processing method and system based on natural language processing |
| CN117151122A (en) * | 2023-10-30 | 2023-12-01 | 湖南三湘银行股份有限公司 | A method and system for bank customer service conversation question and answer processing based on natural language processing |
| CN119623619A (en) * | 2024-08-16 | 2025-03-14 | 中国科学院深圳先进技术研究院 | Intellectual property and academic assistant system and implementation method, device, electronic device and storage medium thereof |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN107958091A (en) | A kind of NLP artificial intelligence approaches and interactive system based on financial vertical knowledge mapping | |
| CN109271537B (en) | Text-to-image generation method and system based on distillation learning | |
| CN106570708B (en) | Management method and system of intelligent customer service knowledge base | |
| US11017177B2 (en) | Neural network systems and methods for target identification from text | |
| Hammad et al. | An approach for detecting spam in Arabic opinion reviews | |
| CN103049433B (en) | The method of automatic question-answering method, automatically request-answering system and structure question and answer case library | |
| KR102240910B1 (en) | Korean Customer Service Associate Assist System based on Machine Learning | |
| CN109918560A (en) | A kind of answering method and device based on search engine | |
| CN108595696A (en) | A kind of human-computer interaction intelligent answering method and system based on cloud platform | |
| CN113722443B (en) | Label recommendation method and system integrating text similarity and collaborative filtering | |
| CN111159363A (en) | Knowledge base-based question answer determination method and device | |
| CN116881426B (en) | AIGC-based self-explanatory question-answering system | |
| Nanda et al. | A hindi question answering system using machine learning approach | |
| CN111737558A (en) | Information recommendation method and device and computer readable storage medium | |
| CN110909116B (en) | Entity set expansion method and system for social media | |
| CN109460462B (en) | Chinese similarity problem generation system and method | |
| CN118467845B (en) | Method for constructing intelligent interactive service system, website intelligent interactive method and device | |
| CN106951470A (en) | A kind of intelligent Answer System retrieved based on professional knowledge figure | |
| Singh et al. | Sentiment analysis of twitter user data on Punjab legislative assembly election, 2017 | |
| CN110347833B (en) | A Classification Method for Multi-round Dialogue | |
| CN115168615A (en) | Knowledge graph big data processing method and system combining data visualization | |
| CN114281972B (en) | Dialogue control method, system storage medium and server based on subject object tracking and cognitive reasoning | |
| CN116383354A (en) | A graph visualization automatic question answering method based on knowledge graph | |
| Walha et al. | A Lexicon approach to multidimensional analysis of tweets opinion | |
| Alqaraleh | Turkish sentiment analysis system via ensemble learning |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180424 |