A research project to measure and improve subtitle quality
This is a recent set of work reviewing the problems of subtitling and how they can be overcome on all our platforms
What we're doing
We are examining the issues which impact on the quality of live subtitles for our audience.
We have looked into ways in which we might use language models for individual programme topics to improve the performance of speech to text engines and to detect errors in existing subtitles. We have had some early success modelling weather forecast subtitles which suggests there may be some value in this approach, but it would appear that other topics will be less successful. See White Paper WHP 256: 'Candidate Techniques for Improving Live Subtitle Quality' for more details.
We have carried out a ground-breaking study into the relative impact of subtitle delay and subtitle accuracy. This work required the development of new test methodologies based on industry standards for measuring audio quality. A user study was carried out in December 2012 with a broad sample of people who regularly use subtitles when watching television (photo above). The results were presented at IBC2013 in September and are available as White Paper WHP259: 'The Development of a Methodology to Evaluate the Perceived Quality of Live TV Subtitles'.
We have started running quarterly audience surveys to provide background data on the level of use of subtitles and how people are using them and what issues they have. And we have also started building an automatic monitoring tool to allow us to track long term trends with issues that we can measure, such as position and reading rate, as originally outlined in White Paper WHP 255: 'Measurement of Subtitle Quality: an R&D Perspective'.
More recently we have been exploring ways to take the live broadcast subtitles and carry out automatic post-processing to remove the original delay and improve the formatting. Early results are promising and we have been talking to the iPlayer team about the potential for this work. We are also looking at how live subtitles could be realigned and reformatted during streaming.
Over the coming months we will also be doing research with an aim to developing guidelines for the display of subtitles on devices like tablets and mobile phones and where video is displayed within a web page.
Getting people excited about our user-centred approach to subtitles...
Handout produced for IBC 2013
An outline of the projects and papers that BBC Research & Development are taking to IBC 2013
Candidate Techniques for Improving Live Subtitle Quality
Measurement of Subtitle Quality: an R&D Perspective
More project info
Why it matters
At least 7 million people use subtitles regularly and mostly for reasons other than hearing difficulties. This is a large audience for subtitles. Whilst the quality of subtitling for pre-recorded programmes is very good, subtitling for live programmes faces problems of accuracy and delay.
The delay in the arrival of the subtitles is a particular problem for people in our audience with hearing difficulties as they are watching with the sound on and using the text to supplement their understanding. These people will often turn the subtitles off if they are late as they are too confusing.
For people watching without sound the delay isn't quite as bad, but because the subtitles are their only source of information the accuracy of the subtitles is most important.
We are aiming to contribute to improvements to subtitling quality and availability, for broadcast, on demand, streamed and web content over the coming years.
How it works
We are using speech recognition and language modelling tools to look at processes that can be used to realign and reformat subtitles for later streaming. We are also carrying out user research to measure the impact of various issues on the perceived quality of subtitles.
We have published papers on our work and are talking to colleagues in BBC Future Media about using our approach in iPlayer and other web video.
We demonstrated some of our work at IBC2013, 12th to 17th September in Amsterdam
An R&D White Paper based on the IBC paper is now online.