Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition

Wednesday

Jun132012

Automatically Adjust Input Gain During Speech Recognition

Wednesday, June 13, 2012 at 11:09AM

When we use speech recognition on our Windows 7 machine, the last thing most of us think about is the record level of our microphone. If it's too low, recognition will not be accurate because the system can't hear you proporly. If the level is too high you will overdrive the amplifier and again, the result is low accuracy.

If you are lucky enough to be a .NET developer you can use a neat little trick to automatically adjust the input gain.

Here's the XAML for a simple window that shows an audio level meter and a slider for manually adjusting the input gain:

<Window x:Class="AudioGainTestCS.MainWindow"     xmlns="http://schemas.microsoft.com/winfx/2006/xaml/presentation"
     xmlns:x="http://schemas.microsoft.com/winfx/2006/xaml"
     Title="MainWindow" Height="350" Width="525">
     <Grid Height="273" Width="427">
         <ProgressBar Height="37" HorizontalAlignment="Left"
Margin="12,12,0,0" Name="ProgressBar1" VerticalAlignment="Top" Width="403" />

      <Slider Height="26" HorizontalAlignment="Left" Margin="12,55,0,0" Name="Slider1" VerticalAlignment="Top" Width="403" Orientation="Horizontal" Maximum="99" />

        <TextBox Height="175" HorizontalAlignment="Left" FontSize="30" TextWrapping="Wrap" Margin="14,86,0,0" Name="TextBox1" VerticalAlignment="Top" Width="401" />

    </Grid>

</Window>

C# Code:

using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
using System.Windows;
using System.Windows.Controls;
using System.Windows.Data;
using System.Windows.Documents;
using System.Windows.Input;
using System.Windows.Media;
using System.Windows.Media.Imaging;
using System.Windows.Navigation;
using System.Windows.Shapes;

namespace AudioGainTestCS
{
    /// <summary>
    /// This application automatically adjusts the record level (input volume)
    /// as you use speech recognition. A mismatched record level can be 
    /// disastrous if you're going to use speech recognition. Note that for
    /// this demo, I'm using a DictationGrammar, but it works with any grammar.
    /// 
    /// The key is the WaveLibMixer.dll, which you can find on CodeProject. 
    /// This library lets you get and set levels and other controls associated with
    /// audio devices.
    /// 
    /// The SpeechRecognitionEngine raises an event on a regular interval passing in
    /// the volume at the input device. When you are not speaking this level is close
    /// to zero. When you are shouting it's closer to 99.
    /// 
    /// Given the volume of speech over time and the ability to control the record level
    /// dynamically, well... it's pretty darned easy to automatically adjust the
    /// microphone for the speaker.
    /// 
    /// Don't get caught without this nice little tool!
    /// 
    /// Carl
    /// </summary>
    public partial class MainWindow : Window
    {
        // -- Add a reference to System.Speech
        private System.Speech.Recognition.SpeechRecognitionEngine speech = new System.Speech.Recognition.SpeechRecognitionEngine();
    
        // -- Add a reference to WaveLibMixer.dll, which you can find at:
        //    http://www.codeproject.com/Articles/11695/Audio-Library-Part-I-Windows-Mixer-Control
        private WaveLib.AudioMixer.MixerLine audioLine;
    
        private Int32 peakLevel;

        public MainWindow()
        {
            InitializeComponent();
            this.Loaded+=new RoutedEventHandler(MainWindow_Loaded);
            speech.AudioLevelUpdated+=new EventHandler<System.Speech.Recognition.AudioLevelUpdatedEventArgs>(speech_AudioLevelUpdated);
            speech.SpeechRecognized+=new EventHandler<System.Speech.Recognition.SpeechRecognizedEventArgs>(speech_SpeechRecognized);
            Slider1.ValueChanged += new RoutedPropertyChangedEventHandler<double>(Slider1_ValueChanged);
        }

   
        private void MainWindow_Loaded(object sender, System.Windows.RoutedEventArgs e) {
            // -- Create a new input mixer object to control an audio input device
            WaveLib.AudioMixer.Mixer audioMixer = new WaveLib.AudioMixer.Mixer(WaveLib.AudioMixer.MixerType.Recording);
            // -- Open the default recording device
            audioMixer.DeviceId = audioMixer.DeviceIdDefault;
            // -- Does the mixer have lines? It ought to...
            if ((audioMixer.Lines.Count > 0)) {
                // -- Select the first line, which is usually the Master Volume
                audioLine = audioMixer.Lines[0];
                // -- Does that line have a Volume control? It really ought to...
                if (audioLine.ContainsVolume) {
                    // -- Set the input volume (record level) to 50%
                    audioLine.Volume = audioLine.VolumeMax / 2;
                    // -- Set up the slider min/max and value based on the volume
                    Slider1.Minimum = audioLine.VolumeMin;
                    Slider1.Maximum = audioLine.VolumeMax;
                    Slider1.Value = audioLine.Volume;
                }
                else {
                    // -- Jeez... lame.
                    Slider1.IsEnabled = false;
                }
            }

            // -- The progress bar is used as an audio meter. 
            //    The speech recognition engine gives us a volume value from 0 to 99.
            //    So we set the ProgressBar min and max accordingly
            ProgressBar1.Minimum = 0;
            ProgressBar1.Maximum = 99;
            // -- Set the speech synthesizer to use the default audio input device
            speech.SetInputToDefaultAudioDevice();
            // -- Tell the speech synthesizer to recognize plain speech, not commands.
            speech.LoadGrammar(new System.Speech.Recognition.DictationGrammar());
            // -- Start recognizing
            speech.RecognizeAsync(System.Speech.Recognition.RecognizeMode.Multiple);
        }
    
        private void speech_AudioLevelUpdated(object sender, System.Speech.Recognition.AudioLevelUpdatedEventArgs e) {
            // -- This is an event handler that happens on a regular interval.
            //    The e.AudioLevel is a value from 0 to 99 representing the 
            //    volume of the talker's voice in almost real-time.
            // -- Setting the progressbar value to this level creates an audio meter.
            ProgressBar1.Value = e.AudioLevel;
            // -- If the volume is over 70% loud, knock the slider down a bit
            //    which will in turn drop the record level
            if ((e.AudioLevel > 70)) {
                Slider1.Value -= 1000;
            }
            // -- Keep track of the peak level for this sentence.
            if ((e.AudioLevel > peakLevel)) {
                peakLevel = e.AudioLevel;
            }
        }
    
        private void speech_SpeechRecognized(object sender, System.Speech.Recognition.SpeechRecognizedEventArgs e) {
            // -- This event handler fires after the talker speaks a sentence.
            //    e.Result.Text contains the text that they have spoken.
            // -- In this case, I'm capitalizing the sentence, adding a period at the end, 
            //    and displaying the text in a text box.
            TextBox1.Text = (e.Result.Text.Substring(0, 1).ToUpper() 
                        + (e.Result.Text.Substring(1) + "."));
            if ((peakLevel < 20)) {
                Slider1.Value += 3000;
            }
            // -- Reset the peak level
            peakLevel = 0;
        }

        void Slider1_ValueChanged(object sender, RoutedPropertyChangedEventArgs<double> e)
        {
            // -- When the value of the slider changes, 
            //    set the audio input volume (the record level) to that value.
            //    This can happen when the user moves the slider, or some code
            //    sets the Slider1.Value 
            if (audioLine != null)
            {
                audioLine.Volume = (int)Slider1.Value;
            }
        }    
    }
}

Comments Off

View Printer Friendly Version

Email Article to Friend

References (46)

References allow you to track sources for this article, as well as articles that were written in response to this article.

Response: news feed

at news feed on February 8, 2013

I have really noticed that credit score improvement activity ought to be conducted with tactics. If not, you are going to find yourself causing harm to your rating. In order to succeed in fixing your credit rating you have to confirm that from this second you pay your monthly expenses promptly ...
Response: tory burch wikipedia

at tory burch wikipedia on September 1, 2013

トリーバーチ財布新作
Response: Fulham shipping

by wiki art transport at fineartmoving on October 16, 2013
Response: Harga Samsung Galaxy

by Viotekno at Gadget Terbaru on February 7, 2014
Response: Auto Repair Magazine

by Auto at Auto Repair Magazine on February 7, 2014
Response: Berita Samsung

by Zonegalaxy at Zonegalaxy on February 23, 2014
Response: Rumah Minimalis

by Homes at Rumah Minimalis on April 13, 2014
Response: Comunicação Visual Curitiba

by curitiba at letras caixa curitiba on April 16, 2014

comunicação visual em curitiba letras caixa toldos luminosos, banners venha conferir
Response: best place to get highlights

at best place to get highlights on June 1, 2014

Carl Franklin - Intellectual Hedonism - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: Best accessories for motorcycles

at Best accessories for motorcycles on June 5, 2014

Carl Franklin - Intellectual Hedonism - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: Nutritional Information

at Nutritional Information on July 2, 2014

Carl Franklin - Intellectual Hedonism - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: bathroom renovations

at bathroom renovations on July 7, 2014

Carl Franklin - Intellectual Hedonism - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: youtube.com

at youtube.com on July 25, 2014

Carl Franklin - Intellectual Hedonism - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: SEO Services New York

at SEO Services New York on July 28, 2014

Carl Franklin - Intellectual Hedonism - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: Search Engine Optimization

at Search Engine Optimization on July 30, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: ARob Master SEO Marketer of the Washoe Valley

at ARob Master SEO Marketer of the Washoe Valley on August 17, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: top boise personal injury attorneys

by jen at tony on August 21, 2014

Thaks for sharing the trick buddy. As i have been bothering because of this problem with dot net. XAML is so creative.
Response: piano lessons Crawley

at piano lessons Crawley on August 30, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: Online Marketing Mix

at Online Marketing Mix on September 8, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: petitesannoncesgratuites.snack.ws

at petitesannoncesgratuites.snack.ws on September 11, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: Bathroom remodeling designs

at Bathroom remodeling designs on September 18, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: youtu.be

at youtu.be on September 25, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: bench craft company

at bench craft company on October 1, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: bench craft company 7 bbq

at bench craft company 7 bbq on October 1, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: melbourne plumber

at melbourne plumber on October 6, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: melbourne plumber

at melbourne plumber on October 6, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: android app developers

at android app developers on October 9, 2014

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: Bluestacks Alternative

by Vssr at Bluestacks Offline Installer on December 5, 2014
Response: Criação de Sites em Curitiba

by Agência de Criação at Otimização de Sites em Curitiba on March 15, 2015

Agência de criação de sites desenvolvidos profissional desenvolvimento alto site otimização de site
Response: iphoto for pc

by Ashwin Singh at Techofy on April 30, 2015
Response: download-bluestacks-offline-installer-for-windows8

by john at xender-for-pc-laptop-windows 8 on June 28, 2015
Response: download bluestacks offline installer for computer

by jhon at download bluestacks offline installer for pc free on July 13, 2015
Response: install windows on android tablet

by chaitu chai at change my software 8.1 edition on July 13, 2015
Response: install windows 8.1 on android tablet

by chaitu at change my software 8.1 edition free on July 22, 2015
Response: download bluestacks offline installer for windows

by anusha at ignitebrains.com on July 22, 2015
Response: subway surfer windows 8

by andrew at subway surfers for windows 8.1 on July 23, 2015
Response: download change my software 8.1 edition

by sahil at install windows 8.1 on android mobile on July 26, 2015
Response: new year 2016

by happy new year 2016 at happy new year 2016 on July 26, 2015

tygdhj
Response: download imo for pc

by news at download shareit for pc on August 17, 2015
Response: bluestacks for windows 10

by blue at bluestacks for windows 8.1 on November 14, 2015
Response: xender dor pc download

at xender dor pc download on January 1, 2016

Carl Franklin - Blog - Automatically Adjust Input Gain During Speech Recognition
Response: AssociatedNews

by AssociatedNews at http://associatednews.com on February 12, 2016

These lawn tractors also include hour meters to remind you of essential service needs, that ought to bring about longer periods without mower issues.
Response: Roland Garros 2016 Direct

by French at French open 2016 on April 12, 2016

Roland Garros 2016
Response: cinemabox for ipad download

by box at cine on May 28, 2016
Response: playview download

by view at play on May 28, 2016
Response: Tour de France 2016

by TDf at TDF 2016 live on June 27, 2016

TDF 2016