Safari et al., 2024 - Google Patents
Data augmentation and preparation process of PerInfEx: a Persian chatbot with the ability of information extractionSafari et al., 2024
View PDF- Document ID
- 5497521393423090027
- Author
- Safari P
- Shamsfard M
- Publication year
- Publication venue
- IEEE Access
External Links
Snippet
In this paper, we describe data preparation for our proposed chatbot PerInfEx (Persian Information Extraction chatbot). It aims to interactively chit-chat with users in Persian and by asking the least number of direct questions, extract as much personal information as …
- 238000002360 preparation method 0 title abstract description 29
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G06F17/30684—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G06F17/2881—Natural language generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/274—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
- G09B19/06—Foreign languages
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Nasution et al. | Chatgpt label: Comparing the quality of human-generated and llm-generated annotations in low-resource language nlp tasks | |
| Elnagar et al. | Systematic literature review of dialectal Arabic: identification and detection | |
| Bashir et al. | Arabic natural language processing for Qur’anic research: a systematic review | |
| Hu et al. | Ocnli: Original chinese natural language inference | |
| Zitouni | Natural language processing of semitic languages | |
| Nayak et al. | To Plan or not to Plan? Discourse Planning in Slot-Value Informed Sequence to Sequence Models for Language Generation. | |
| Banea et al. | Sense-level subjectivity in a multilingual setting | |
| Blinova et al. | A hybrid model of complexity estimation: Evidence from Russian legal texts | |
| JP2006505076A (en) | Method and apparatus for knowledge system | |
| Malmi et al. | Automatic prediction of discourse connectives | |
| Safari et al. | Data augmentation and preparation process of PerInfEx: a Persian chatbot with the ability of information extraction | |
| Hayashibe | Japanese realistic textual entailment corpus | |
| Bade et al. | Hope Speech in Social Media Texts using Transformer. | |
| Cardenas et al. | ‘Don’t Get Too Technical with Me’: A Discourse Structure-Based Framework for Automatic Science Journalism | |
| Imperial et al. | Application of lexical features towards improvement of Filipino readability identification of children's literature | |
| Fahad et al. | Answer agnostic question generation in Bangla Language | |
| Pho | Linguistic realizations of rhetorical structure: A corpus-based study of research article abstracts and introductions in applied linguistics and educational technology | |
| Bonilla | Spoken Spanish PoS tagging: gold standard dataset | |
| Asif et al. | Bidirectional encoder approach for abstractive text summarization of Urdu language | |
| Amezian et al. | Training an LSTM-based Seq2Seq model on a Moroccan biscript lexicon | |
| Scholman et al. | DiscoNaija: a discourse-annotated parallel Nigerian Pidgin-English corpus | |
| Marfani et al. | Analysis of learners’ sentiments on MOOC forums using natural language processing techniques | |
| Behera | Odia parts of speech tagging corpora: suitability of statistical models | |
| Su et al. | Metaphor generation based on noval evaluation method | |
| Sabty | Computational approaches to Arabic-English code-switching |