Posted by Chris Newell on
This sprint the Discovery Team has been revisiting their earlier work on Nearest Neighbour recommendations, the Data Team has been optimising speaker diarization and identification whilst Experiences have been expanding their work on Audio Augmented Reality
The engine used an approach called k-Nearest Neighbors (KNN) where the recommendations are based on a correlation matrix which describes the similarity between pairs of items. The similarity can be measured in terms of the item attributes (metadata-based filtering) or in terms of the users who have consumed them (collaborative filtering). The idea is that if you consume an item then the engine will recommend the programmes with the highest similarity, which are called the Nearest Neighbours. If you consume multiple items then the engine aggregates the scores of the Nearest Neighbours and returns the items with the highest scores. The advantage of the KNN algorithm over some other approaches is that the recommender model is transparent and easily understood - you can browse the Nearest Neighbour model and see the similarity values.
In the Data Team, Ben has been evaluating different Speaker Diarization methods (which partition an audio stream into segments according to the speaker) to see if he can improve our Speaker Identification and Speech-to-Text (STT) tools. He has looked at LIUM, Sidekit 4 Diarization and Kaldi X-Vectors.
Meanwhile, Matt and Misa have retrained the voice activity detection module for our STT and Speaker ID systems by adding examples from BBC content. This made very little difference to the overall performance which was unexpected! They're now looking at the performance of the Voice Activity Detection on the STT and Speaker ID systems and trying to identify where improvements could be made.
This week the Tellybox project received press coverage in Broadcast, Digital Spy and the Star, following on from the earlier Times article. Libby and Alicia were able to demonstrate Tellybox to the BBC's Chief Technology and Product Officer, Matthew Postgate, when he visited our new building. Libby has also made a demonstration version of Tellybox that runs on a Raspberry Pi, to demonstrate how it might work as a 'Set Top Box'.
Nicky and Henry ran an excellent Audio Augmented Reality workshop with Sound Designers, Ben and Max Ringham where they kicked off their work on prototyping Audio AR experiences for Bose Frames. As we start to investigate Audio AR in depth, we published a series of blogs on the subject:
Out and About
Nicky and George were on BBC Radio 4’s Feedback programme, talking about voice assistants and the BBC.
Henry gave a talk at The Design Museum on the histories and myths of smart speakers.