Automatic censorship of audio data for broadcast by Microsoft
An input audio data stream comprising speech is processed by an automatic censoring filter in either a real-time mode, or a batch mode, producing censored speech that has been altered so that undesired words or phrases are either unintelligible or inaudible. The automatic censoring filter employs a lattice comprising either phonemes and/or words derived from phonemes for comparison against corresponding phonemes or words included in undesired speech data. If the probability that a phoneme or word in the input audio data stream matches a corresponding phoneme or word in the undesired speech data is greater than a probability threshold, the input audio data stream is altered so that the undesired word or a phrase comprising a plurality of such words is unintelligible or inaudible. The censored speech can either be stored or made available to an audience in real-time.
I feel a little skeptical here, the intention may be honorable however I think the implementation is going to be a little more difficult than Microsoft think, or at least more complicated than they have made it appear in the patent.
Consider the phrase (found on Metafilter):
Did you enjoy your shitaki mushroom, Mr. Fukawa? Excellent. Are you still planing on going to the bird sanctuary today? I hear they have a new display of boobies and great tits, as well as some chickens. I just love those cocks and hens. Also, I hear that they have set up a petting zoo outside with an ass, a bitch, and a couple of pussy cats. If you have any problems finding the place, just give us a call and ask for the manager, his name is Dick Kunt.
if this phrase were to be censored using the methods that Microsoft proposed, it would end up becoming completely unintelligable. I am all for censorship of intentionally offensive or inflamatory web and media content but censorship without analysis of context is not going to work.