This paper presents a method to reconstruct the 3D structure of generic convex rooms from sound signals. Differently from most of the previous approaches, the method is fully uncalibrated in the sense that no knowledge about the microphones and sources position is needed. Moreover, we demonstrate that it is possible to bypass the well known echo labelling problem, allowing to reconstruct the room shape in a reasonable computation time without the need of additional hypotheses on the echoes order of arrival. Finally, the method is intrinsically robust to outliers and missing data in the echoes detection, allowing to work also in low SNR conditions. The proposed pipeline formalizes the problem in different steps such as TOA estimation, microphones and sources localization and walls estimation. After providing a solution to these different problems, we present a global optimization approach that links together all the problems in a single optimization function. The accuracy and robustness of the method is assessed on a wide set of simulated setups and in a challenging real scenario. Compared to previous approaches, the method shows relevant improvements in terms of precision and scalability. Moreover, we make freely available for a challenging dataset for 3D room reconstruction with accurate ground truth in a real scenario.

Uncalibrated 3D room geometry estimation from sound impulse responses

Trucco, Andrea;
2017

Abstract

This paper presents a method to reconstruct the 3D structure of generic convex rooms from sound signals. Differently from most of the previous approaches, the method is fully uncalibrated in the sense that no knowledge about the microphones and sources position is needed. Moreover, we demonstrate that it is possible to bypass the well known echo labelling problem, allowing to reconstruct the room shape in a reasonable computation time without the need of additional hypotheses on the echoes order of arrival. Finally, the method is intrinsically robust to outliers and missing data in the echoes detection, allowing to work also in low SNR conditions. The proposed pipeline formalizes the problem in different steps such as TOA estimation, microphones and sources localization and walls estimation. After providing a solution to these different problems, we present a global optimization approach that links together all the problems in a single optimization function. The accuracy and robustness of the method is assessed on a wide set of simulated setups and in a challenging real scenario. Compared to previous approaches, the method shows relevant improvements in terms of precision and scalability. Moreover, we make freely available for a challenging dataset for 3D room reconstruction with accurate ground truth in a real scenario.
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0016003217305458-main.pdf

accesso chiuso

Descrizione: Articolo definitivo pubblicato nella rivista
Tipologia: Documento in versione editoriale
Dimensione 2.88 MB
Formato Adobe PDF
2.88 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11567/889354
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 9
social impact