Paper at SIGIR 2008 — Term Clouds as Surrogates for User Generated Speech

Abstract¶

User generated spoken audio remains a challenge for Automatic Speech Recognition (ASR) technology and content-based audio surrogates derived from ASR-transcripts must be error robust. An investigation of the use of term clouds as surrogates for podcasts demonstrates that ASR term clouds closely approximate term clouds derived from human-generated transcripts across a range of cloud sizes. A user study confirms the conclusion that ASR-clouds are viable surrogates for depicting the content of podcasts.

References¶

[1] Manos Tsagkias, Martha Larson, and Maarten de Rijke. 2008. Term clouds as surrogates for user generated speech. In Proceedings of the 31^st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR ‘08). Association for Computing Machinery, New York, NY, USA, 773–774. ACM Link PDF

I took part in SIREN 2008, a research event in the Netherlands, presenting our work with Martha Larson and Maarten de Rijke, on Term Clouds as Surrogates for User Generated Speech [1]; see post

References

[1] Manos Tsagkias, Martha Larson, and Maarten de Rijke. 2008. Term clouds as surrogates for user generated speech. In Proceedings of the 31^st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR ‘08). Association for Computing Machinery, New York, NY, USA, 773–774. ACM Link PDF

65.00% similar — Talk at SIREN 2008 on Speech Term Clouds
References

[1] Marguerite Fuller, Manos Tsagkias, Eamonn Newman, Jana Besser, Martha Larson, Gareth J.F. Jones, and Maarten de Rijke. 2008. Using Term Clouds to Represent Segment-Level Semantic Content of Podcasts. In Proceedings of the 2^nd SIGIR Workshop on Searching Spontaneous Conversational Speech (SSCS 2008). UvA Link PDF

62.20% similar — Paper at SSCS 2008 — Using Term Clouds to Represent Segment-Level Semantic Content of Podcasts
Abstract

We focus on improving the effectiveness of a Virtual Assistant (VA) in recognizing emerging entities in spoken queries. We introduce a method that uses historical user interactions to forecast which entities will gain in popularity and become trending, and it subse- quently integrates the predictions within the Automated Speech Recognition (ASR) component of the VA. Experiments show that our proposed approach results in a 20% relative reduction in errors on emerging entity name utterances without degrading the overall recognition quality of the system.

Happy to share the news about my first joint pubication with the Siri Speech team at Apple. Our short paper Predicting Entity Popularity to Improve Spoken Entity Recognition by Virtual Assistants with Christophe van Gysel, myself, Ernie Pusateri, and Ilya Oparin, is accepted at SIGIR 2020.

32.39% similar — Paper at SIGIR 2020 — Predicting Entity Popularity to Improve Spoken Entity Recognition by Virtual Assistants
References

[1] Manos Tsagkias, Martha Larson, Wouter Weerkamp, and Maarten de Rijke. 2008. PodCred: a framework for analyzing podcast preference. In Proceedings of the 2^nd ACM Workshop on Information Credibility On the Web (WICOW ‘08). Association for Computing Machinery, New York, NY, USA, 67–74. ACM link PDF

28.52% similar — Paper at WICOW 2008 — PodCred: A Framework for Analyzing Podcast Preference
References

[1] Manos Tsagkias, Martha Larson, and Maarten Rijke. 2009. Exploiting Surface Features for the Prediction of Podcast Preference. In Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval (ECIR ‘09). Springer-Verlag, Berlin, Heidelberg, 473–484. ACM Link PDF

28.47% similar — Paper at ECIR 2009 — Exploiting Surface Features for the Prediction of Podcast Preference
Happy to share yet another publication with the Siri Speech team at Apple, this time led by Sashank Gondala, who interned with us last year. Our full paperError-driven Pruning of Language Models for Virtual Assistants is accepted at ICASSP 2021.

22.77% similar — Paper at ICASSP 2021 — Error-driven Pruning of Language Models for Virtual Assistants
I moved to Barcelona in the first week of September for a three-month internship at Yahoo! Research Labs. I’m very excited about it, and I’m looking forward to getting to know the people here and the problems they are working on.

Update: My work at Yahoo! Research Labs resulted in a publication at SIGIR 2012 [1]; see post for the abstract.

References

[1] Manos Tsagkias and Roi Blanco. 2012. Language intent models for inferring user browsing behavior. In Proceedings of the 35^th international ACM SIGIR conference on research and development in information retrieval (SIGIR ‘12). Association for Computing Machinery, New York, NY, USA, 335–344. ACM Link. PDF

19.15% similar — Internship at Yahoo! Research Labs

Paper at SIGIR 2008 — Term Clouds as Surrogates for User Generated Speech

Manos Tsagkias, Martha Larson, and Maarten de Rijke

University of Amsterdam

20 June 2008

Keywords: paper, speech, information retrieval, sigir

Abstract¶

References¶

Abstract¶

References¶

Related Posts

References

References

Abstract

References

References

References