Song et al., 2006 - Google Patents

Speaker attention system for mobile robots using microphone array and face tracking

Song et al., 2006

Document ID: 17960305091201117657
Author: Song K; Hu J; Tsai C; Chou C; Cheng C; Liu W; Yang C
Publication year: 2006
Publication venue: Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006.

External Links

Cited by

Snippet

This paper presents a real-time human-robot interface system (HRIS), which processes both speech and vision information to improve the quality of communication between human and an autonomous mobile robot. The HRIS contains a real-time speech attention system and a …

Continue reading at ieeexplore.ieee.org (other versions)

230000003935 attention 0 title abstract description 25

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search

Similar Documents

Publication	Publication Date	Title
US7536029B2 (en)	2009-05-19	Apparatus and method performing audio-video sensor fusion for object localization, tracking, and separation
CN111370014B (en)	2024-05-28	System and method for multi-stream object-speech detection and channel fusion
Okuno et al.	2015	Robot audition: Its rise and perspectives
Sasaki et al.	2006	Multiple sound source mapping for a mobile robot by self-motion triangulation
EP1643769B1 (en)	2009-12-23	Apparatus and method performing audio-video sensor fusion for object localization, tracking and separation
Chung et al.	2019	Who said that?: Audio-visual speaker diarisation of real-world meetings
US8538751B2 (en)	2013-09-17	Speech recognition system and speech recognizing method
Yamamoto et al.	2005	Enhanced robot speech recognition based on microphone array source separation and missing feature theory
Grondin et al.	2019	Sound event localization and detection using CRNN on pairs of microphones
Nakamura et al.	2011	Intelligent sound source localization and its application to multimodal human tracking
Ince et al.	2009	Ego noise suppression of a robot using template subtraction
Yamamoto et al.	2004	Improvement of robot audition by interfacing sound source separation and automatic speech recognition with missing feature theory
WO2007138503A1 (en)	2007-12-06	Method of driving a speech recognition system
Song et al.	2006	Speaker attention system for mobile robots using microphone array and face tracking
KR100822880B1 (en)	2008-04-17	Speaker Recognition System and Method through Audio-Video Based Sound Tracking in Intelligent Robot Environment
JP2006251266A (en)	2006-09-21	Audiovisual linkage recognition method and apparatus
Ince et al.	2011	Assessment of general applicability of ego noise estimation
Brueckmann et al.	2007	Adaptive noise reduction and voice activity detection for improved verbal human-robot interaction using binaural data
CN110992971A (en)	2020-04-10	Method for determining voice enhancement direction, electronic equipment and storage medium
KR20190059381A (en)	2019-05-31	Method for Device Control and Media Editing Based on Automatic Speech/Gesture Recognition
Kim et al.	2007	Auditory and visual integration based localization and tracking of humans in daily-life environments
Abutalebi et al.	2011	Performance improvement of TDOA-based speaker localization in joint noisy and reverberant conditions
Kim et al.	2024	A real-time sound source localization system for robotic vacuum cleaners with a microphone array
Wang et al.	2001	Real-time automated video and audio capture with multiple cameras and microphones
Petsatodis et al.	2009	Voice activity detection using audio-visual information