PERSONAL)INFORMATION)...

6
© European Union, 20022013 | http://europass.cedefop.europa.eu Page 1 / 6 PERSONAL INFORMATION Maurizio Omologo Via Trento 3, 38049 – Altopiano della Vigolana (Trento), Italy +39 0461 848870 / +39 0461 314563 +39 334 6209861 / +39 347 7299630 [email protected] http://shine.fbk.eu Link to Google Scholar Link to LinkedIn Sex Male | Date of birth July 20 th , 1959 | Nationality Italian RESEARCH INTERESTS AND AREAS OF EXPERTISE WORK EXPERIENCE RESEARCH POSITIONS Multimicrophone signal processing Automatic Speech Recognition ASR applications Music signal processing Audio and speech coding Automatic speech generation Other digital signal processing application fields Microphone arrays and distributed microphone networks: Adaptive beamforming Source separation and extraction Acoustic echo cancellation Analysis and simulation of room acoustics. Acoustic scene analysis: Acoustic event detection and classification Speaker localization and tracking Distantspeaker identification. Multimodality: Audiovideo based person tracking. Frontend processing: Speech enhancement Speech activity detection Acoustic features Pitch extraction. ASR robustness: Distantspeech interaction Acoustic modelling Model adaptation. Corpora: Design, collection, and annotation of audio and speech material Automatic segmentation and labelling Contaminationbased simulations Multilingual issues. Home automation Robotics Voicecontrolled TV Meeting transcription Teleconferencing Automotive etc. Music information retrieval: Automatic chord segmentation Beat detection Singing voice analysis Cover song identification. Low and medium bitrate coding: Analysisbysynthesis LPC techniques (multipulse, CELP, etc.) Subband coding Adaptive predictive coding. Texttospeech: Unitconcatenation based speech synthesis Morphological analysis. Theory of digital signal processing, and some experiences in acoustics, vibrations, and ultrasounds. March 2008 – Present Senior Researcher Fondazione Bruno Kessler (FBK) Centro Information Technology Trento. Head of the SHINE (Speechacoustic scene analysis and interpretation) research unit (see http:/shine.fbk.eu). Project Manager of the EC – DIRHA consortium (see http://dirha.fbk.eu), from 2012 to 2014: microphone network and distantspeech interaction for smarthome applications. At the final review, the project was evaluated by EC with "excellent". Contribution to an innovation action (FESR – DOMHOS project) that involved two local companies: distantspeech recognition technologies for smarthome and surgeryroom contexts, from 2012 to 2014. Contribution to research activities under the FET Open EC SCENIC project on environment aware multichannel audio processing, from 2009 to 2011.

Transcript of PERSONAL)INFORMATION)...

Page 1: PERSONAL)INFORMATION) Maurizio!Omologo!shine.fbk.eu/sites/shine.fbk.eu/files/cv_omologo_maurizio_december… · ! !©European!Union,!2002 02013|! ! Page!1!/6!! PERSONAL)INFORMATION)

© European Union, 2002-­2013 | http://europass.cedefop.europa.eu Page 1 / 6

PERSONAL INFORMATION Maurizio Omologo

Via Trento 3, 38049 – Altopiano della Vigolana (Trento), Italy

+39 0461 848870 / +39 0461 314563 +39 334 6209861 / +39 347 7299630

[email protected]

http://shine.fbk.eu Link to Google Scholar Link to LinkedIn

Sex Male | Date of birth July 20th, 1959 | Nationality Italian RESEARCH INTERESTS AND

AREAS OF EXPERTISE

WORK EXPERIENCE-­ RESEARCH POSITIONS

Multi-­microphone signal processing

Automatic Speech Recognition

ASR applications

Music signal processing

Audio and speech coding

Automatic speech generation

Other digital signal processing

application fields

Microphone arrays and distributed microphone networks: Adaptive beamforming;; Source separation and extraction;; Acoustic echo cancellation;; Analysis and simulation of room acoustics.

Acoustic scene analysis: Acoustic event detection and classification;; Speaker localization and tracking;; Distant-­speaker identification.

Multimodality: Audio-­video based person tracking.

Front-­end processing: Speech enhancement;; Speech activity detection;; Acoustic features;; Pitch extraction.

ASR robustness: Distant-­speech interaction;; Acoustic modelling;; Model adaptation. Corpora: Design, collection, and annotation of audio and speech material;; Automatic segmentation and labelling;; Contamination-­based simulations;; Multi-­lingual issues. Home automation;; Robotics;; Voice-­controlled TV;; Meeting transcription;; Teleconferencing;; Automotive;; etc. Music information retrieval: Automatic chord segmentation;; Beat detection;; Singing voice analysis;; Cover song identification. Low and medium bit-­rate coding: Analysis-­by-­synthesis LPC techniques (multi-­pulse, CELP, etc.);; Sub-­band coding;; Adaptive predictive coding. Text-­to-­speech: Unit-­concatenation based speech synthesis;; Morphological analysis. Theory of digital signal processing, and some experiences in acoustics, vibrations, and ultrasounds.

March 2008 – Present Senior Researcher Fondazione Bruno Kessler (FBK) -­ Centro Information Technology -­ Trento. Head of the SHINE (Speech-­acoustic scene analysis and interpretation) research unit (see http:/shine.fbk.eu).

Project Manager of the EC – DIRHA consortium (see http://dirha.fbk.eu), from 2012 to 2014: microphone network and distant-­speech interaction for smart-­home applications. At the final review, the project was evaluated by EC with "excellent".

Contribution to an innovation action (FESR – DOMHOS project) that involved two local companies: distant-­speech recognition technologies for smart-­home and surgery-­room contexts, from 2012 to 2014.

Contribution to research activities under the FET Open EC -­ SCENIC project on environment aware multi-­channel audio processing, from 2009 to 2011.

Page 2: PERSONAL)INFORMATION) Maurizio!Omologo!shine.fbk.eu/sites/shine.fbk.eu/files/cv_omologo_maurizio_december… · ! !©European!Union,!2002 02013|! ! Page!1!/6!! PERSONAL)INFORMATION)

© European Union, 2002-­2013 | http://europass.cedefop.europa.eu Page 2 / 6

Research on music signal processing: development of an audio chord estimation system;; top performance in the MIREX benchmarks during the last 6 years (see http://www.music-­ir.org/mirex/wiki/2013:Audio_Chord_Estimation_Results_Billboard_2013 and http://www.music-­ir.org/mirex/results/2015/mirex_2015_poster.pdf). Project Manager of the EC – DICIT consortium (see http://dicit.fbk.eu) from 2006 to 2009: distant-­speech interaction to control a TV, or a set-­top box, equipped with a microphone array. At the final review, the project was evaluated by EC with "excellent".

Advisor of 15 master theses. Advisor and Tutor of the following PhD students: Federico Flego (now Engineer at Apple, UK);; Alessio Brutti (now Researcher at FBK);; Maksim Khadkevich (now Software Engineer at Facebook, Menlo Park, US);; Francesco Nesta (now Director of Audio and DSP R&D at Synaptics Incorporated, Irvine, US);; Abdul Waheed Mohammed (now Postdoc in Hydebarad, India);; Mahmoud Fakhry (now Postdoc at Aalborg Universitet, Denmark);; Cristina Guerrero Flores (now Software Engineer at University of Trento);; Georgina Tryfou (now Mid-­senior speech scientist at Omilia, Cyprus);; Mirco Ravanelli (currently PhD student at University of Trento);; and coadvisor of Xinyuan Qian (Phd student at Queen Mary University, London, UK);; + mentor of Luca Giulio Brayda (Team leader at IIT, Genova, Italy).

August 1991 – February 2008, January 1988 – January 1991

Leading and management activities

Other relevant contributions

Researcher Istituto Trentino di Cultura (ITC) -­ Istituto per la Ricerca Scientifica e Tecnologica (IRST), Trento Head of different research units from 1992 to 1997, and of the SHINE unit since 1998. Project Manager of the EC – DIRHA (Distant-­speech Interaction for Robust Home Applications) consortium, from 2012 to 2014.

Project Manager of the EC – DICIT (Distant-­talking Interfaces for Control of Interactive TV) consortium, from 2006 to 2009.

Workpackage Leader in ACube -­ Grandi Progetti PAT, in 2008: sensing algorithms for multi-­modal systems.

Workpackage Leader in PAT-­Fondo Unico PEACH Project, from 2001 to 2004: acoustic technologies for cultural heritage fruition.

Principal Investigator in EC Project HIWIRE, from 2004 to 2006: speech recognition under very noisy environmental conditions.

Principal Investigator in EC Project CHIL (Computer in the Human Interaction Loop), from 2004 to 2006: microphone networks for acoustic scene analysis in meetings and lectures.

Principal Investigator in PAT-­Fondo Unico Project DIPLODOC (DIstributed Processing of LOcal Data for On-­line Car services) from 2002 to 2005: spoken dialogue management and speech communication between a car and a remote center.

Principal Investigator in EC Project VICO (Virtual Intelligent Codriver), from 2001-­2004: speech recognition, understanding and spoken dialogue management for automotive applications.

Contribution to a project supported by a local company (GST -­ S3, SUR projects Legge 6 PAT), from 2002 to 2006: development of a prototype for automatic dictation based on microphone arrays.

Principal Investigator in a technology-­transfer project funded by Atlantis-­Vox-­Car, from 1998 to 2000: development of an in-­car speech recognition system.

Principal investigator in EC Project SpeechDatCar, from 1998 to 2001: design, collection and annotation of a multi-­lingual speech corpus for automotive applications.

Coordinator of a technology transfer action supported by AETHRA (Ancona, Italy), from 1998 to 2000: development of a real-­time speaker localization component for videoconferencing.

Principal investigator in EC Project VODIS-­II, from 1998 to 1999: design, collection and annotation of a multi-­lingual speech corpus for automotive applications.

Contribution to a project supported by PNB-­PST ELBA, from 1997 to 2000: development of an embedded system for speech recognition in noisy environment.

Principal investigator in a technology transfer action funded by CSELT (Torino, Italy), from 1994 to 1995: development of a tool of automatic segmentation and labeling for the design of a text-­to-­speech system based on the concatenation of units.

Principal investigator in EC Project DIMUS, from 1991 to 1994: development of a microphone array-­based system for surveillance of an underground station.

Research for MAIA Project from 1988 to 1993: speech recognition and synthesis applied to robotics.

January 1991 – July 1991 Internship Mc Gill University, Montreal, Canada Visiting Scientist at the Faculty of Computer Science: acoustic features and continuous density HMM speech recognition for speaker independent continuous speech recognition.

Page 3: PERSONAL)INFORMATION) Maurizio!Omologo!shine.fbk.eu/sites/shine.fbk.eu/files/cv_omologo_maurizio_december… · ! !©European!Union,!2002 02013|! ! Page!1!/6!! PERSONAL)INFORMATION)

© European Union, 2002-­2013 | http://europass.cedefop.europa.eu Page 3 / 6

TEACHING EXPERIENCE

INTERNATIONAL/EU PATENTS

SERVICES FOR SCIENTIFIC JOURNALS, CONFERENCES

AND BOARDS

July 1984 – December 1987 Junior Researcher CSELT (Centro Studi e Laboratori Telecomunicazioni), Torino, Italy Research on low-­medium bit-­rate speech coding for GSM 900 MHz mobile communications, under the COST 207 and the ESPRIT SPIN projects. Participation to CCITT meetings for international standardization.

September 2001 -­ September 2014

Contract Professor at the University of Trento Teaching the course of “Audio Signal Processing” (6 credits) at the Master of Science in Telecommunications Engineering (Laurea Magistrale).

Speaker localization and tracking

Audio and speech coding

“Method for location of a speaker and acquisition of a voice message, and related system” -­ U.S. Patent 5,465,302.

“Method of and device for speech signal coding and decoding by means of a multipulse excitation” -­European Patent EP0361432.

“A system for coding audio-­band signals” -­ European Patent EP0396121.

Organization of scientific international events

Committee and board memberships

Editorial activities

Special Session Chairman of European Signal Processing Conference -­ EUSIPCO Conference (Kos, Greece), August 2017.

Technical Area Chairman of IEEE Automatic Speech Recognition and Understanding – ASRU Workshop (Scottsdale, USA), December 2015.

Technical Area Chairman of Analysis of Speech, Audio Signals, Speech Coding, Speech Enhancement -­ ISCA Interspeech Conference (Lyon, France), August 2013.

Tutorial Chairman -­ ISCA Interspeech Conference (Florence, Italy), August 2011. Technical Area Chairman of ASR Robustness and adaptation -­ ISCA Interspeech Conference (Florence, Italy), August 2011.

Local Chairman -­ IEEE Automatic Speech Recognition and Understanding – ASRU Workshop (Merano, Italy), December 2009.

General CoChairman -­ IEEE Hands-­free Speech Communication and Microphone Arrays -­ HSCMA Workshop (Trento, Italy), May 2008.

Technical Area Chairman of Audio and Electroacoustics – European Signal Processing Conference -­ EUSIPCO Conference (Lausanne, Switzerland), August 2008.

Demo Chairman – International Conference on Multimodal Interfaces -­ ICMI (Trento, Italy), October 2005.

General CoChairman -­ IEEE Automatic Speech Recognition and Understanding – ASRU Workshop (Madonna di Campiglio, Italy), December 2001. Member of IEEE James L. Flanagan Speech and Audio Processing Award Committee, since 2017. Elected member of IEEE SPS Speech-­Language Technical Committee, from 2014 to 2016. Member of the steering committee of Associazione Italiana Scienze Vocali (AISV) from 2006 to 2009.

Member of IEEE SPS Speech Technical Committee, from 2003 to 2005. Guest Editor of a special issue on Computational Acoustic Scene Analysis for Applied Science -­ Open Access Journal, 2017-­2018.

Editorial Board Member of Acoustics, since 2017. Associate Editor of IEEE Transactions on Speech and Audio Processing, from 2003 to 2005. Guest Editor of a special Issue on Speech Processing for Natural Interaction with Intelligent Environments, IEEE Journal on Selected Topics in Signal Processing" – 2010.

Editor of Language Resources and Evaluation journal – Springer -­ from 2012 to 2016.

Page 4: PERSONAL)INFORMATION) Maurizio!Omologo!shine.fbk.eu/sites/shine.fbk.eu/files/cv_omologo_maurizio_december… · ! !©European!Union,!2002 02013|! ! Page!1!/6!! PERSONAL)INFORMATION)

© European Union, 2002-­2013 | http://europass.cedefop.europa.eu Page 4 / 6

KEYNOTES AND INVITED TALKS

EDUCATION AND TRAINING

PERSONAL SKILLS

Other activities Member of commissions for the research assessment of EC projects, international institutions, doctorate schools, and national bodies (more details available if required).

Referee service for many international conferences, workshops, and journals (e.g., IEEE Transactions on Speech and Audio Processing, IEEE Transactions on Signal Processing, Speech Communication, Computer Speech and Language, Journal of the Acoustical Society of America).

Member of program committees, and session chair at many conferences and workshops. Member of the ICT Doctoral Committee of University of Trento, from 2008 to 2015. Member of the Doctoral Committee in Electronics, Communications, and Information Technology at University of Bologna, since 2015.

Invited lecture at EAA Winter School 2013, Hot Topics in Acoustics -­ Cutting Edge in Spatial Audio, AIA-­DAGA conference, "Acoustic scene analysis using distributed microphone array networks" .

Invited lecture at Human Activity and Vision Summer School (HAVSS), INRIA Sophia Antipolis, (2012), " Multi-­microphone signal processing for distant-­speech interaction".

Plenary Talk at conference "Acoustic scene analysis &distant-­talking interfaces for smart indoor environments" -­ EUSIPCO (Glasgow, Scotland), 2009.

Plenary Talk at conference "Speaker location and acoustic event detection given a distributed microphone network" -­ IWAENC (Paris, France), 2006.

Tutorial at conference on "Audio-­video based person tracking" -­ IEEE ICASSP (Toulouse, France), 2006.

Keynote Speaker at ESCA-­NATO workshop on Robust Speech Recognition for Unknown Communication Channels (Nancy, France), 1997. Several other invited talks at international events and invited seminars.

October 1978 -­ March 1984 Laurea Degree in Electronic Engineering 110/110 magna cum laudae

University of Padova , Italy Title of the thesis:“Un analizzatore morfologico a transizioni aumentate”, March 1984.

Mother tongue Italian

Other languages Self assessment(*)

UNDERSTANDING SPEAKING WRITING

Listening Reading Spoken interaction Spoken production

English C1 C1 C1 C1 C1 French B1 B1 A2 A2 A2

Levels: A1/2: Basic user -­ B1/2: Independent user -­ C1/2 Proficient user (*) Common European Framework of Reference for Languages

Computer skills: computer programming,

operating systems, numerical analysis, and editing

C, C++, Perl, UNIX Shell scripting. Linux, MacOSX, Windows, Arduino. Competent with most of Microsoft Office programmes. Good command of MATLAB and Octave tools. Experience with several software tools for audio and speech editing, annotation, and processing (e.g., Audacity, Praat, SFSWin, Transcriber).

Good command of Latex.

Other skills and personal interests

Music: playing guitar, bass, and singing. Music production: experience with Logic Pro, Band-­in-­a-­box and other software for professional audio recording, editing, composition.

Sport: basketball, ski, and jogging.

Driving licence B

Page 5: PERSONAL)INFORMATION) Maurizio!Omologo!shine.fbk.eu/sites/shine.fbk.eu/files/cv_omologo_maurizio_december… · ! !©European!Union,!2002 02013|! ! Page!1!/6!! PERSONAL)INFORMATION)

© European Union, 2002-­2013 | http://europass.cedefop.europa.eu Page 5 / 6

SELECTED LIST OF RECENT, OR MORE

CITED, PUBLICATIONS

Journals

Recent peer-­reviewed international conferences

and book chapters

Ravanelli M., Brakel P., Omologo M., Bengio Y. (2017), Light Gated Recurrent Units for speech recognition, in «IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE», accepted for publication.

Guerrero C., Tryfou G., Omologo M. (2018), Cepstral distance based channel selection for distant speech recognition, in «COMPUTER SPEECH AND LANGUAGE», vol. 47, January 2018, pp. 314-­332.

Fakhry M., Svaizer P., Omologo M. (2017), Audio Source Separation in Reverberant Environments Using β -­Divergence-­Based Nonnegative Factorization, in «IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING», vol. 25, n. 7, 2017, pp. 1462 -­ 1476.

Khadkevich M., Omologo M. (2013). Reassigned spectrum-­based feature extraction for GMM-­based automatic chord recognition, in «EURASIP JOURNAL ON AUDIO, SPEECH AND MUSIC PROCESSING», 2013, pp. 1 – 12.

A. Brutti, M. Omologo, P. Svaizer (2013) An environment aware ML estimation of acoustic radiation pattern with distributed microphone pairs, in «SIGNAL PROCESSING», vol. 93, n. 4, 2013, pp. 784 -­796.

Nesta F, Omologo M (2012). Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources . IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, vol. 20, p. 246-­260, ISSN: 1558-­7916.

F. Nesta, P. Svaizer, M. Omologo (2011). Convolutive BSS of short mixtures by ICA recursively regularized across frequencies. IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, vol. 19, p. 624-­ 639, ISSN: 1558-­7916, doi: 10.1109/TASL.2010.2053027.

A. Brutti, L. Cristoforetti, W. Kellermann, L. Marquardt, M. Omologo (2010). WOZ acoustic data collection for interactive TV. LANGUAGE RESOURCES AND EVALUATION, vol. 44, p. 205-­219, ISSN:1574-­020X, doi: DOI 10.1007/s10579-­010-­9116-­x.

A. Brutti, M. Omologo, P. Svaizer (2010). Multiple Source Localization Based on Acoustic Map De-­Emphasis. EURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSING, vol. 2010, ISSN:1687-­4714, doi: 10.1155/2010/147495.

Matassoni M, Omologo M, Giuliani D Svaizer P (2002). Hidden Markov model training with contaminated speech material for distant-­talking speech recognition. COMPUTER SPEECH AND LANGUAGE, vol. 16, p. 205-­223, ISSN: 0885-­2308.

Van Den Heuvel H, Boves L, Moreno A, Omologo M, Richard G, Sanders E (2001). Annotation in the SpeechDat Projects. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, vol. 4, p. 127-­143, ISSN:1381-­2416.

Omologo M, Svaizer P, Matassoni M (1998). Environmental conditions and acoustic transduction in hands-­free speech recognition . SPEECH COMMUNICATION, vol. 25, p. 75-­95, ISSN: 0167-­6393.

Omologo M, Svaizer P (1997). Use of the crosspower-­spectrum phase in acoustic event location. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, vol. 5, p. 288-­292, ISSN: 1063-­6676.

Brugnara F, Falavigna D, Omologo M (1993). Automatic segmentation and labeling of speech based on Hidden Markov Models . SPEECH COMMUNICATION, vol. 12, p. 357-­370, ISSN: 0167-­6393.

Pertilä P., Brutti A., Svaizer P., Omologo M., Multichannel source activity detection, localization, and tracking -­ Chapter 4 of the Book: Vincent, E., Virtanen, T., & Gannot, S. (Eds.). (2017). Audio Source Separation and Speech Enhancement. Wiley.

Tryfou G., Omologo M., A reassigned front-­end for speech recognition. Proc. of EUSIPCO 2017. Ravanelli M., Brakel. P., Omologo M., Bengio Y., Improving speech recognition by revising gated recurrent units Proc. of Interspeech 2017.

Ravanelli M., Brakel P., Omologo M., Bengio Y., A network of deep neural networks for distant speech recognition, ICASSP 2017, BEST IBM STUDENT PAPER AWARD.

Qian X., Brutti A., Omologo M., Cavallaro A., 3D audio-­visual speaker tracking with an adaptive particle filter, ICASSP 2017.

Fakhry M., Svaizer P., Omologo M., Estimation of the spatial information in Gaussian model based audio source separation using weighted spectral bases, Proceedings of EUSIPCO 2016.

Ravanelli M., Svaizer P., Omologo M., Realistic Multi-­Microphone Data Simulation for Distant Speech Recognition, Proceedings of INTERSPEECH 2016.

Guerrero C., Tryfou G., Omologo, M., Channel Selection for Distant Speech Recognition Exploiting Cepstral Distance, Proceedings of INTERSPEECH 2016, pp. 1986-­1990.

Ravanelli M., Brakel P., Omologo M., Bengio Y., Batch-­normalized joint training for DNN-­based distant speech recognition, IEEE Workshop on Spoken Language Technology 2016.

M. Fakhry, P. Svaizer, M. Omologo. Audio source separation usinga redundant library of source spectral bases for nonnegative tensor factorization, Proceedings of ICASSP 2015, pp. 251-­255.

E. Zwyssig, M. Ravanelli, P. Svaizer, M. Omologo. A multi-­channel corpus for distant-­speech interaction in presence of known interferences, Proceedings of ICASSP 2015, pp. 4480-­4484.

M. Ravanelli, M. Omologo. Contaminated speech training methods for robust DNN-­HMM distant speech recognition, Proceedings of INTERSPEECH 2015, 756-­760.

M. Ravanelli, L. Cristoforetti, R. Gretter, M. Pellin, A. Sosi, M. Omologo. The DIRHA-­English corpus and related tasks for distant-­speech recognition in domestic environments, Proceedings of IEEE-­ASRU), 2015, pp. 275-­282.

S. Jalalvand, D. Falavigna, M. Matassoni, P. Svaizer, M. Omologo. Boosted acoustic model learning and hypotheses rescoring on the CHiME-­3 task, , Proceedings of IEEE-­ASRU), 2015, pp. 409-­415.

M. Ravanelli;; M. Omologo, On the selection of the impulse responses for distant-­speech recognition based on contaminated speech training, Proceedings of INTERSPEECH 2014, pp. 1028-­1032.

L. Cristoforetti, M. Ravanelli, M. Omologo, A. Sosi, A. Abad, M. Hagmueller, P. Maragos, The DIRHA simulated corpus, Proceedings LREC 2014, pp. 2629-­2634.

A. Brutti, M. Ravanelli, P. Svaizer, M. Omologo. A speech event detection and localization task for multiroom environments, Proceedings of 4th Joint Workshop on Hands-­free Speech Communication and Microphone Arrays (HSCMA), 2014, pp. 157-­161.

Page 6: PERSONAL)INFORMATION) Maurizio!Omologo!shine.fbk.eu/sites/shine.fbk.eu/files/cv_omologo_maurizio_december… · ! !©European!Union,!2002 02013|! ! Page!1!/6!! PERSONAL)INFORMATION)

© European Union, 2002-­2013 | http://europass.cedefop.europa.eu Page 6 / 6

I authorise the handling of my personal data pursuant to the Personal Data Protection Code – Legislative Decree n. 196/2003.

Trento, December 6th, 2017 Maurizio Omologo

Recent (cont)

Selected list of conference papers

C. Guerrero, M. Omologo. Word boundary agreement to combine multi-­microphone hypotheses in distant speech recognition, Proceedings of HSCMA, 2014.

C. Guerrero, M. Omologo. Exploiting inter-­microphone agreement for hypothesis combination in distant speech recognition, Proceedings of EUSIPCO, 2014, pp 2385-­2389.

G. Tryfou, M. Pellin, M. Omologo. Time-­Frequency Reassigned Cepstral Coefficients for Phone-­Level Speech Segmentation, Proceedings of EUSIPCO, 2014, pp.2060-­2064.

M. Khadkevich, M. Omologo. Large scale cover song identification based on chord profiles. In: Proc. of 14th International Society for Music Information Retrieval Conference, November 2013.

A. Brutti, M. Omologo. Geometric contamination for GMM/UBM speaker verification in reverberant environments. In: Proc. of Interspeech, 2013, Lyon, France, pp. 791 -­ 794.

A.W. Mohammed, M. Matassoni, H.K. Maganti, M. Omologo. Semi-­Blind Model Adaptation using Piece-­wise Energy Decay Curve for Large Reverberant Environments. In: Proc. of Interspeech, 2012, Portland, USA.

P. Svaizer, A. Brutti, M. Omologo. Environment-­aware estimation of the orientation of acoustic sources using a line array. In: EUSIPCO 2012, Bucharest, Romania, pp.1024-­1028.

A.W. Mohammed, M. Matassoni, H.K. Maganti, M. Omologo. Acoustic Model Adaptation Using Piece-­wise Energy Decay Curve for Reverberant Environments. In: EUSIPCO 2012, Bucharest, Romania, pp. 365-­369.

F. Nesta, M. Omologo. Convolutive underdetermined source separation through weighted interleaved ICA and spatio-­temporal source correlation. In: LVA/ICA'12 -­ 10th international conference on Latent Variable Analysis and Signal Separation, Tel Aviv, Israel, pp. 222 -­ 230.

F. Nesta, M. Omologo. Enhanced multidimensional spatial functions for unambiguous localization of multiple sparse acoustic sources. In: ICASSP 2012, Kyoto, Japan, pp. 213-­216

A. Brutti, M. Omologo, P. Svaizer. Maximum a posteriori trajectory estimation for acoustic tracking. In: IWAENC 2012, Aachen, Germany,

M. Ravanelli, A. Sosi, P. Svaizer, M.Omologo. Impulse response estimation for robust speech recognition in a reverberant environment, In: EUSIPCO 2012, Bucharest – Romania, pp.1668-­1672

F. Nesta, M. Omologo. Approximated kernel density estimation for multiple TDOA detection. In: ICASSP 2011, Prague, Czech Republic.

P. Svaizer, A. Brutti, M. Omologo. Use of reflected wavefronts for acoustic source localization with a line array. In: HSCMA 2011, Edinburgh, Scotland.

M. Matassoni, H. K. Maganti, M. Omologo. Non-­linear Spectro-­temporal Modulations for Reverberant Speech Recognition, In: HSCMA 2011, Edinburgh, Scotland, pp. 115 -­120.

A. Brutti, M. Omologo, P. Svaizer. Inference of acoustic source directivity using environment awareness. In: EUSIPCO 2011, Barcelona, Spain, pp. 151 – 155.

Khadkevich M, Omologo M. Time frequency reassigned features for automatic chord recognition. In: ICASSP, 2011. p. 181-­184.

A.Temko, C. Nadeu, D. Macho, R. Malkin, C. Zieger, M. Omologo. Acoustic event detection and classification. In: Computers in the Human Interaction Loop, Springer London, pp. 61-­73.

A. Brutti, M. Omologo, P. Svaizer. Comparison between different sound source localization techniques based on a real data collection, In: HSCMA 2008, pp.69-­72.

Temko A, Malkin R, Zieger C, Macho D, Omologo M. CLEAR evaluation of acoustic event detection and classification systems. In: Multimodal Technologies for Perception of Humans. LECTURE NOTES IN COMPUTER SCIENCE, vol. LNCS 4122, p. 311-­322, ISSN: 0302-­9743

A. Brutti, M. Omologo, P. Svaizer. Oriented Global coherence field for the estimation of the head orientation in smart rooms equipped with distributed microphone arrays. In: -­. Interspeech 2005. Lisbon, Portugal, p. 2337-­2340.

Giuliani D, Matassoni M, Omologo M, Svaizer P. Training of HMM with filtered speech material for hands-­free recognition . In: ICASSP 1999. vol. 1, p. 449-­45.

Omologo M, Svaizer P, De Mori R. Chapter 2 -­ Acoustic transduction. In: Spoken Dialogues with Computers. p. 23-­67, London: Academic Press, 1998.

Svaizer P, Matassoni M, Omologo M. Acoustic source location in a three-­dimensional space using crosspower spectrum phase . In: ICASSP 1997, vol. 1, p. 231-­234.

Omologo M, Matassoni M, Svaizer P, Giuliani D. Microphone array based speech recognition with different talker-­array positions . In: ICASSP 1997, vol. 1, p. 227-­230.

Omologo M, Svaizer P. Acoustic source location in noisy and reverberant environment using CSP analysis . In: ICASSP 1996, vol. 2, p. 921-­924.

Omologo M, Svaizer P. Acoustic event localization using a crosspower-­spectrum phase based technique . In: ICASSP 1994, vol. 2, p. II-­273-­II-­276.

Angelini B, Brugnara F, Falavigna D, Giuliani D, Gretter R, Omologo M. Speaker independent continuous speech recognition using an acoustic-­phonetic italian corpus. In: ICSLP 1994. p. 1391-­1394.

Giuliani D, Omologo M, Svaizer P. Talker Localization and Speech Recognition using a Microphone Array and a Cross-­PowerSpectrum Phase Analysis. In: Proceedings of International Conference on Spoken Language Processing, ICSLP-­ 94. p. 1243-­1246.

Cosi P., Falavigna D., M. Omologo A preliminary statistical evaluation of manual and automatic segmentation discrepancies., Eurospeech 1991.