fr zh_cn

Research

1   Previous research themes

1.1   Postdoctorate

Speaker Vérification

Participation to the NIST SRE 2008 evaluation campaign Based on the Alizé library developped in the Avignon Laboratory (LIA).

Description of a baseline speaker verification system based on GMM for NIST SRE campaigns : Speaker Verification

Speaker recognition and tracking in audiovisual documents


Attention, in this example, the on screen information is limited to Jacques Chancel as he is the main researched speaker.

Automatic dialogue in real and virtual worlds

  • Job interview
  • Distress expression

Involved in several projects

  • InfoM@gic : pôle de compétitivité Iles-de-France Cap-Digital project, SP5
  • K-Space : international project, Excellence Network
  • Myblog3D : ANR project, COST 2102
  • CompanionAble : international project

Participation to a plenary session of the e-forensic 2009 conference.

Supervisor

Gérard Chollet (DR CNRS)

1.2   Ph.D. Thesis

Thesis title

Local and frame-synchronous confidence measures for automatic speech recognition

Phd. Supervisors

Jean-Paul Haton (Professor) and Odile Mella (Associate Professor)

Date of defense

October 9th 2007

Thesis

la these

Abstract

In automatic speech recognition, confidence measures aim at estimating the confidence we can give to a result (phone, word, sentence) provided by the speech recognition engine; for example, the contribution of the confidence measure allows to highlight the misrecognized or out-of-vocabulary words.

In this thesis, we propose several confidence measures which are able to provide this estimation for applications using large vocabulary and on-the-fly recognition, as keyword indexation, broadcast news transcription, and live teaching class transcription for hard of hearing children.

In this framework, we have defined two types of confidence measures. The first, based on likelihood ratio, are frame-synchronous measures which can be computed simultaneously with the recognition process of the sentence. The second ones are based on an estimation of the posterior probability limited to a local neighborhood of the considered word, and need only a short delay before being computed on the sub word graph extracted from the recognition process.

These measures were assessed and compared to a state-of-the-art one, which is also based on posterior probability but which requires the recognition of the whole sentence. Two evaluations were performed on a real broadcast news corpus provided by the ESTER campaign. The first one used the Equal Error Rate criterion in an automatic transcription task. The second evaluation was performed in a keyword spotting task. We achieved performance close to our reference measure with our local measures and a delay of less than one second.

We also integrated one of our frame-synchronous measures in the decoding process of the recognition engine in order to improve the solution provided by the system and then to decrease the word error rate. We achieved to decrease the word error rate of around 1%.

Moreover, one of our confidence measure acheived to improve comprehension of hard of hearing children.



Thesis committee

  • Jean-François Bonastre - Pr. Avignon University - LIA/CERI
  • Gérard Chollet - DR CNRS - ENST/TSI Paris
  • Laurant Besacier - Joseph Fourier University - CLIPS Grenoble
  • René Schott - Pr. Henri Poincaré Nancy I University- LORIA/IECN
  • Jean-Paul Haton - Pr. Henri Poincaré Nancy I University - LORIA
  • Odile Mella - Henri Poincaré Nancy I University- LORIA


Host laboratory

SPEECH team (http://parole.loria.fr) - LORIA INRIA Lorraine) - Henri Poincaré Nancy I university, France

"Logo du Loria" "Logo de l'INRIA" "Logo de l'université de Nancy 1" "Logo de la région Lorraine"




1.3   Master Thesis

Title

Speech/Music segmentation for radio broadcast programs.

Mémoire de DEA

Supervisor

Nathalie Parlangeau-Vallès


Host laboratory

SPEECH Team (http://parole.loria.fr) - LORIA ( INRIA Lorraine) - Nancy I university, France

"Logo du Loria" "Logo de l'université de Nancy 1"