September 10, 2008 marks the most amazing day in my life’s Speech Signal Processing history!
And im not exaggerating… Fed and myself attended the talk of Dr. Alex Waibel, a professor from CMU (Pittsburgh) and University of Karlsruhe (Germany) - who’s a genius in speech signal processing!
Fed and I couldnt stop gushing about Dr. Waibel’s demos! They are just WOW! Seriously, im running out of words to describe them! I think the reason Fed and I are overwhelmed this much is because what he presented are something we’ve been trying to do in the UP DSP lab (and yes, we’re inching along, but hey, we’ll get there in time!) But to see them running right before our eyes and see that its very much possible - simply awesome! You know how its like when you watch movies or shows like CSI or Alias and they show you some techie stuff and you’re like ‘oh yeah, its a movie - science fiction!’ but this one is real - and fed and i saw it working real-time!
The following are the demos that we witnessed:
1. The direction detector that shows a video wherein people are in a meeting and there are arrows pointing to where the person is directing his attention to or who he’s conversing with - so you can see arrows following the movements of a person’s head.
2. The face-tracking system that identifies the eyes of the person (so Dr. Waibel presented a video where everytime a door opens and someone goes inside, you can see two circles where the person’s eyes are)
3. Directional sound system consisting of ultrasonic speakers that convolve with the air. Basically, how it works is that if the speaker is directed to you then only you can hear the sound. If its not directed to you, then you wont be able to hear a thing.
But the demo that fed and i can relate the most is the language translator. This project is a part of the CHIL (Computers in the Human Interaction Loop) project that aims to aid human to human interaction by breaking the language divide. Dr. Waibel showed us the ‘Long Tail of Language’ - a graph depicting the number of speakers of a certain language. On top of that list is Mandarin, English, Spanish with hundreds of millions of speakers, and then there’s a plunge on the number of speakers of the other languages. And there are 6000 languages in the world! Anyway, this project, like i mentioned aids human to human interaction by breaking the language barriers. And how this thing works may sound simple, but being a speech processing person myself, let me tell you, its not as easy as it sound
There are three legs to this system: (1) Speech recognition - where the system identifies what the speaker has said and outputs it in text form; (2) Translator - translates the text from one language to another; and (3) Speech synthesis - the translated text is converted to sound output. What’s amazing is that Dr. Waibel had a demo of an english-to-filipino translator
However, the synthesizer he used is Spanish so it sounded like a Spanish guy speaking Filipino words (close enough though!)
And so there comes an opportunity for us. Dr. Waibel is really interested to continue that project for the Filipino language and for the other languages in the Philippines - both on the recognition and the synthesis end. They already have the translator (one language to Filipino) but it still needs improvement since no one at their lab speaks Filipino. And that’s where we come in. I am proud to say that our lab is the pioneer in our country to do speech signal processing and we have the ability to help Dr. Waibel and the people at InterACT (International Center for Advanced Communication Technologies) to get this going (at least for the Filipino language). There already have been projects and theses on Filipino speech recognition and synthesis done at our lab. (And Dr. Waibel said that he’s impressed with our efforts in the lab, and he’s quite amazed that these researches are being done in our country. nice!)
So yes, im excited for all these opportunities to collaborate with great minds!
It will be great for the lab to be expanding our horizon, so to speak, and to be working with people who are very good at this field, at least this way, we can leap from where are now to where we want to be
Like i said, we’re inching along, but we’ll get there, with all the help we can possibly get!
all in all, this experience is truly awesome and one-of-a-kind!
oh wait, did i mention, Dr. Waibel’s hobby is flying helicopters?!
Here’s our picture with Dr. Alex Waibel 
